Skip to main content
Top
Published in: Cluster Computing 2/2019

09-03-2018

RETRACTED ARTICLE: Research on semi supervised K-means clustering algorithm in data mining

Published in: Cluster Computing | Special Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

K-means clustering has become an important tool for the analysis of gene expression data, which can also look for the expression of cluster with the same fluctuation from two directions of genes and conditions. But the K-means clustering is a multi-objective local search algorithm, which is easy to fall into local optimum when dealing with complex data of the gene. In order to improve the global search capability of the algorithm, this paper presents a semi supervised K clustering algorithm. Firstly, the K—means clustering algorithm is used to deal with gene data. Then the improved semi supervised K mean clustering is used for the greedy iteration to find the K mean clustering, so as to achieve better results. Through the simulation experiment, the results prove the global semi supervised K clustering algorithm has better optimization ability and better cluster effect compared with MDO algorithm.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Lausch, A., Schmidt, A., Tischendorf, L.: Data mining and linked open data—new perspectives for data analysis in environmental research. Ecol. Model. 295, 5–17 (2015)CrossRef Lausch, A., Schmidt, A., Tischendorf, L.: Data mining and linked open data—new perspectives for data analysis in environmental research. Ecol. Model. 295, 5–17 (2015)CrossRef
2.
go back to reference Santos, I., Brezo, F., Ugarte-Pedrero, X., Bringas, P.G.: Opcode sequences as representation of executables for data-mining-based unknown malware detection. Inf. Sci. 231(9), 64–82 (2013)MathSciNetCrossRef Santos, I., Brezo, F., Ugarte-Pedrero, X., Bringas, P.G.: Opcode sequences as representation of executables for data-mining-based unknown malware detection. Inf. Sci. 231(9), 64–82 (2013)MathSciNetCrossRef
3.
go back to reference Astolfi, D., Castellani, F., Garinei, A., Terzi, L.: Data mining techniques for performance analysis of onshore wind farms. Appl. Energy 148, 220–233 (2015)CrossRef Astolfi, D., Castellani, F., Garinei, A., Terzi, L.: Data mining techniques for performance analysis of onshore wind farms. Appl. Energy 148, 220–233 (2015)CrossRef
4.
go back to reference Grigoras, G., Scarlatache, F.: An assessment of the renewable energy potential using a clustering based data mining method. Case study in romania. Energy 81, 416–429 (2015)CrossRef Grigoras, G., Scarlatache, F.: An assessment of the renewable energy potential using a clustering based data mining method. Case study in romania. Energy 81, 416–429 (2015)CrossRef
5.
go back to reference Schneider, A.: Monitoring land cover change in urban and peri-urban areas using dense time stacks of landsat satellite data and a data mining approach. Remote Sens. Environ. 124, 689–704 (2012)CrossRef Schneider, A.: Monitoring land cover change in urban and peri-urban areas using dense time stacks of landsat satellite data and a data mining approach. Remote Sens. Environ. 124, 689–704 (2012)CrossRef
6.
go back to reference Ferreira, J.C., Almeida, J.D., Silva, A.R.D.: The impact of driving styles on fuel consumption: a data-warehouse-and-data-mining-based discovery process. IEEE Trans. Intell. Transp. Syst. 16(5), 2653–2662 (2015)CrossRef Ferreira, J.C., Almeida, J.D., Silva, A.R.D.: The impact of driving styles on fuel consumption: a data-warehouse-and-data-mining-based discovery process. IEEE Trans. Intell. Transp. Syst. 16(5), 2653–2662 (2015)CrossRef
7.
go back to reference Yang, Y., Tan, W., Li, T., Da, R.: Consensus clustering based on constrained self-organizing map and improved cop-kmeans ensemble in intelligent decision support systems. Knowl.-Based Syst. 32(32), 101–115 (2012)CrossRef Yang, Y., Tan, W., Li, T., Da, R.: Consensus clustering based on constrained self-organizing map and improved cop-kmeans ensemble in intelligent decision support systems. Knowl.-Based Syst. 32(32), 101–115 (2012)CrossRef
8.
go back to reference Taghizadeh-Mehrjardi, R., Nabiollahi, K., Minasny, B., Triantafilis, J.: Comparing data mining classifiers to predict spatial distribution of usda-family soil groups in Baneh region, Iran. Geoderma 253–254, 67–77 (2015)CrossRef Taghizadeh-Mehrjardi, R., Nabiollahi, K., Minasny, B., Triantafilis, J.: Comparing data mining classifiers to predict spatial distribution of usda-family soil groups in Baneh region, Iran. Geoderma 253–254, 67–77 (2015)CrossRef
9.
go back to reference Zhang, S., Jin, W., Huang, Y., Su, W., Yang, J., Feng, Z.: Profiling a caenorhabditis elegans behavioral parametric dataset with a supervised k-means clustering algorithm identifies genetic networks regulating locomotion. J. Neurosci. Methods 197(2), 315–323 (2011)CrossRef Zhang, S., Jin, W., Huang, Y., Su, W., Yang, J., Feng, Z.: Profiling a caenorhabditis elegans behavioral parametric dataset with a supervised k-means clustering algorithm identifies genetic networks regulating locomotion. J. Neurosci. Methods 197(2), 315–323 (2011)CrossRef
Metadata
Title
RETRACTED ARTICLE: Research on semi supervised K-means clustering algorithm in data mining
Publication date
09-03-2018
Published in
Cluster Computing / Issue Special Issue 2/2019
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-018-2199-7

Other articles of this Special Issue 2/2019

Cluster Computing 2/2019 Go to the issue

Premium Partner