Skip to main content
Top
Published in: Pattern Analysis and Applications 4/2018

01-03-2018 | Industrial and Commercial Application

Gene clustering with hidden Markov model optimized by PSO algorithm

Authors: Mohammad Soruri, Javad Sadri, S. Hamid Zahiri

Published in: Pattern Analysis and Applications | Issue 4/2018

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Gene clustering is one of the most important problems in bioinformatics. In the sequential data clustering, hidden Markov models (HMMs) have been widely used to find similarity between sequences, due to their capability of handling sequence patterns with various lengths. In this paper, a novel gene clustering scheme based on HMMs optimized by particle swarm optimization algorithm is introduced. In this approach, each gene sequence is described by a specific HMM, and then for each model, its probability to generate individual sequence is evaluated. A hierarchical clustering algorithm based on a new definition of a distance measure has been applied to find the best clusters. Experiments carried out on lung cancer-related genes dataset show that the proposed approach can be successfully utilized for gene clustering.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
2.
go back to reference Krogh A, Brown M, Mian I.S, Sjolander K, Haussler D (1993) Hidden Markov models in computational biology: application to protein modeling. UCSC-CRL-93-32 Krogh A, Brown M, Mian I.S, Sjolander K, Haussler D (1993) Hidden Markov models in computational biology: application to protein modeling. UCSC-CRL-93-32
3.
go back to reference Zhang ZY, Li T, Ding C, Ren XW, Zhang XS (2010) Binary matrix factorization for analyzing gene expression data. Data Min Knowl Discov 20:28–52MathSciNetCrossRef Zhang ZY, Li T, Ding C, Ren XW, Zhang XS (2010) Binary matrix factorization for analyzing gene expression data. Data Min Knowl Discov 20:28–52MathSciNetCrossRef
4.
go back to reference Vignes M, Forbes F (2009) Gene clustering via integrated Markov models combining individual and pairwise features. IEEE/ACM Trans Comput Biol Bioinform 6:260–270CrossRef Vignes M, Forbes F (2009) Gene clustering via integrated Markov models combining individual and pairwise features. IEEE/ACM Trans Comput Biol Bioinform 6:260–270CrossRef
5.
go back to reference Durbin R, Eddy SR, Krogh A, Mitchison G (1998) Biological sequence analysis. Cambridge University Press, CambridgeCrossRef Durbin R, Eddy SR, Krogh A, Mitchison G (1998) Biological sequence analysis. Cambridge University Press, CambridgeCrossRef
6.
go back to reference Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77:257–286CrossRef Rabiner LR (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proc IEEE 77:257–286CrossRef
7.
go back to reference Rabiner LR, Lee CH, Juang BH, Wilpon JG (1989) HMM clustering for connected word recognition. In: Proceedings of IEEE ICASSP, pp 405–408 Rabiner LR, Lee CH, Juang BH, Wilpon JG (1989) HMM clustering for connected word recognition. In: Proceedings of IEEE ICASSP, pp 405–408
8.
go back to reference Lee KF (1990) Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition. IEEE Trans Acoust Speech Signal Process 38:599–609CrossRef Lee KF (1990) Context-dependent phonetic hidden Markov models for speaker-independent continuous speech recognition. IEEE Trans Acoust Speech Signal Process 38:599–609CrossRef
9.
go back to reference Al-Hajj R, Mokbel C, Likforman-Sulem L (2007) Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: 9th International conference on document analysis and recognition, pp 959–963 Al-Hajj R, Mokbel C, Likforman-Sulem L (2007) Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: 9th International conference on document analysis and recognition, pp 959–963
10.
go back to reference Panuccio A, Bicego M, Murino V (2002) A hidden Markov model-based approach to sequential data clustering. Struct Synt Stat Pattern Recognit 2396:734–743MATH Panuccio A, Bicego M, Murino V (2002) A hidden Markov model-based approach to sequential data clustering. Struct Synt Stat Pattern Recognit 2396:734–743MATH
11.
go back to reference Bicego M, Murino V, Figueiredo MAT (2004) Similarity-based classification of sequences using hidden Markov models. Pattern Recognit Soc 37:2281–2291CrossRef Bicego M, Murino V, Figueiredo MAT (2004) Similarity-based classification of sequences using hidden Markov models. Pattern Recognit Soc 37:2281–2291CrossRef
12.
go back to reference Li C, Biswas G (2000) A Bayesian approach to temporal data clustering using hidden Markov models. In: Proceedings of the 17th international conference on machine learning, pp 543–550 Li C, Biswas G (2000) A Bayesian approach to temporal data clustering using hidden Markov models. In: Proceedings of the 17th international conference on machine learning, pp 543–550
13.
go back to reference Ferles C, Stafylopatis A (2008) Sequence clustering with the self-organizing hidden Markov model map. In: 8th IEEE international conference on bioinformatics and bioengineering, pp 1–7 Ferles C, Stafylopatis A (2008) Sequence clustering with the self-organizing hidden Markov model map. In: 8th IEEE international conference on bioinformatics and bioengineering, pp 1–7
14.
go back to reference Mesa A, Basterrech S, Guerberoff G, Alveraz-Valin F (2015) Hidden Markov models for gene sequence classification. Pattern Anal Appl 19:793–805CrossRef Mesa A, Basterrech S, Guerberoff G, Alveraz-Valin F (2015) Hidden Markov models for gene sequence classification. Pattern Anal Appl 19:793–805CrossRef
15.
go back to reference Kennedy J, Eberhart RC (1995) Particle swarm optimization. Process IEEE Int Conf Neural Netw 4:1942–1948CrossRef Kennedy J, Eberhart RC (1995) Particle swarm optimization. Process IEEE Int Conf Neural Netw 4:1942–1948CrossRef
16.
go back to reference Goldberg DE, Holland JH (1988) Genetic algorithms and machine learning. Mach Learn 3:95–99CrossRef Goldberg DE, Holland JH (1988) Genetic algorithms and machine learning. Mach Learn 3:95–99CrossRef
17.
go back to reference Angeline PJ (1998) Evolutionary optimization versus particle swarm optimization: philosophy and performance differences. Evolut Program VII 1447:601–610 Angeline PJ (1998) Evolutionary optimization versus particle swarm optimization: philosophy and performance differences. Evolut Program VII 1447:601–610
18.
go back to reference Xue L, Yin J, Ji Z, Jiang L (2006) A particle swarm optimization for hidden Markov model training. In: Proceedings of 8th international conference on signal processing Xue L, Yin J, Ji Z, Jiang L (2006) A particle swarm optimization for hidden Markov model training. In: Proceedings of 8th international conference on signal processing
19.
go back to reference Banu PK, Andrews S (2015) Gene clustering using metaheuristic optimization algorithms. Int J Appl Metaheur Comput 6(4):14–38CrossRef Banu PK, Andrews S (2015) Gene clustering using metaheuristic optimization algorithms. Int J Appl Metaheur Comput 6(4):14–38CrossRef
20.
go back to reference Theodoridis S, Koutroumbas K (1999) Pattern recognition. Academic Press, CambridgeMATH Theodoridis S, Koutroumbas K (1999) Pattern recognition. Academic Press, CambridgeMATH
Metadata
Title
Gene clustering with hidden Markov model optimized by PSO algorithm
Authors
Mohammad Soruri
Javad Sadri
S. Hamid Zahiri
Publication date
01-03-2018
Publisher
Springer London
Published in
Pattern Analysis and Applications / Issue 4/2018
Print ISSN: 1433-7541
Electronic ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-018-0680-9

Other articles of this Issue 4/2018

Pattern Analysis and Applications 4/2018 Go to the issue

Premium Partner