Skip to main content
Top

2016 | OriginalPaper | Chapter

RNA Secondary Structure Prediction Using Extreme Learning Machine with Clustering Under-Sampling Technique

Authors : Tianhang Liu, Jiarun Lin, Chengkun Wu, Jianping Yin

Published in: Proceedings of ELM-2015 Volume 2

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper gives a machine learning method for the subject of RNA secondary structure prediction. The method is based on extreme learning machine for its outstanding performance in classification problem, and use under-sampling technique to solve the problem of data imbalance. Feature vector in the classifier includes covariation score and inconsistent sequence penalty. The proposed method is compared with SVM and ELM without under-sampling, as well as classical method RNAalifold in terms of sensitivity, specificity, Matthews correlation coefficient and G-mean. The training and testing data are 68 RNA aligned families from Rfam, version 11.0. The results show that the proposed method can achieve highest scores in sensitivity, MCC and G-mean, which means that it is an effective method for RNA secondary structure prediction.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Lodish, H.: Molecular Cell Biology. Macmillan (2008) Lodish, H.: Molecular Cell Biology. Macmillan (2008)
2.
go back to reference Schultes, E.A., Bartel, D.P.: One sequence, two ribozymes: implications for the emergence of new ribozyme folds. Science 289(5478), 448–452 (2000)CrossRef Schultes, E.A., Bartel, D.P.: One sequence, two ribozymes: implications for the emergence of new ribozyme folds. Science 289(5478), 448–452 (2000)CrossRef
3.
go back to reference Tinoco, I., Bustamante, C.: How RNA folds. J. Mol. Biol. 293(2), 271–281 (1999)CrossRef Tinoco, I., Bustamante, C.: How RNA folds. J. Mol. Biol. 293(2), 271–281 (1999)CrossRef
4.
go back to reference Varani, G., McClain, W.: The g x u wobble base pair. a fundamental building block of RNA structure crucial to RNA function in diverse biological systems. EMBO Rep. 1(1), 18–23 (2000)CrossRef Varani, G., McClain, W.: The g x u wobble base pair. a fundamental building block of RNA structure crucial to RNA function in diverse biological systems. EMBO Rep. 1(1), 18–23 (2000)CrossRef
5.
go back to reference Batey, R.T., Gilbert, S.D., Montange, R.K.: Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine. Nature 432(7015), 411–415 (2004)CrossRef Batey, R.T., Gilbert, S.D., Montange, R.K.: Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine. Nature 432(7015), 411–415 (2004)CrossRef
6.
go back to reference Kim, S.H., Quigley, G., Suddath, F., Rich, A.: High-resolution x-ray diffraction patterns of crystalline transfer RNA that show helical regions. Proc. Natl. Acad. Sci. 68(4), 841–845 (1971)CrossRef Kim, S.H., Quigley, G., Suddath, F., Rich, A.: High-resolution x-ray diffraction patterns of crystalline transfer RNA that show helical regions. Proc. Natl. Acad. Sci. 68(4), 841–845 (1971)CrossRef
7.
go back to reference Ferentz, A.E., Wagner, G.: NMR spectroscopy: a multifaceted approach to macromolecular structure. Q. Rev. Biophys. 33(1), 29–65 (2000)CrossRef Ferentz, A.E., Wagner, G.: NMR spectroscopy: a multifaceted approach to macromolecular structure. Q. Rev. Biophys. 33(1), 29–65 (2000)CrossRef
8.
go back to reference Ray, S.S., Pal, S.K.: RNA secondary structure prediction using soft computing. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(1), 0002–17 (2013)CrossRef Ray, S.S., Pal, S.K.: RNA secondary structure prediction using soft computing. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(1), 0002–17 (2013)CrossRef
9.
go back to reference Bindewald, E., Shapiro, B.A.: RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers. RNA 12(3), 342–352 (2006)CrossRef Bindewald, E., Shapiro, B.A.: RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers. RNA 12(3), 342–352 (2006)CrossRef
10.
go back to reference Benedetti, G., Morosetti, S.: A genetic algorithm to search for optimal and suboptimal RNA secondary structures. Biophys. Chem. 55(3), 253–259 (1995)CrossRef Benedetti, G., Morosetti, S.: A genetic algorithm to search for optimal and suboptimal RNA secondary structures. Biophys. Chem. 55(3), 253–259 (1995)CrossRef
11.
go back to reference Chen, J.-H., Le, S.-Y., Maizel, J.V.: Prediction of common secondary structures of RNAs: a genetic algorithm approach. Nucleic Acids Res. 28(4), 991–999 (2000)CrossRef Chen, J.-H., Le, S.-Y., Maizel, J.V.: Prediction of common secondary structures of RNAs: a genetic algorithm approach. Nucleic Acids Res. 28(4), 991–999 (2000)CrossRef
12.
go back to reference Yingjie, Z., Zhengzhi, W.: Consensus RNA secondary structure prediction based on SVMS. In: The 2nd International Conference on Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008, pp. 101–104, IEEE (2008) Yingjie, Z., Zhengzhi, W.: Consensus RNA secondary structure prediction based on SVMS. In: The 2nd International Conference on Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008, pp. 101–104, IEEE (2008)
13.
go back to reference Li, K., Kong, X., Lu, Z., Wenyin, L., Yin, J.: Boosting weighted elm for imbalanced learning. Neurocomputing 128, 15–21 (2014)CrossRef Li, K., Kong, X., Lu, Z., Wenyin, L., Yin, J.: Boosting weighted elm for imbalanced learning. Neurocomputing 128, 15–21 (2014)CrossRef
14.
go back to reference Liu, Q., He, Q., Shi, Z.: Extreme support vector machine classiffier. In: Advances in Knowledge Discovery and Data Mining, pp. 222–233. Springer (2008) Liu, Q., He, Q., Shi, Z.: Extreme support vector machine classiffier. In: Advances in Knowledge Discovery and Data Mining, pp. 222–233. Springer (2008)
15.
go back to reference Tianhang, L.I.U., Jianping, Y.I.N.: RNA secondary structure prediction using self-adaptive evolutionary extreme learning machine. J. Comput. Inform. Syst. 11(3), 995–1004 (2015) Tianhang, L.I.U., Jianping, Y.I.N.: RNA secondary structure prediction using self-adaptive evolutionary extreme learning machine. J. Comput. Inform. Syst. 11(3), 995–1004 (2015)
16.
go back to reference Burge, S.W., Daub, J., Eberhardt, R., Tate, J., Barquist, L., Nawrocki, E.P., Eddy, S.R., Gardner, P.P., Bateman, A.: Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 1005 (2012) Burge, S.W., Daub, J., Eberhardt, R., Tate, J., Barquist, L., Nawrocki, E.P., Eddy, S.R., Gardner, P.P., Bateman, A.: Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 1005 (2012)
17.
go back to reference Griffiths-Jones, S., Moxon, S., Marshall, M., Khanna, A., Eddy, S.R., Bateman, A.: Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33(suppl 1), 121–124 (2005) Griffiths-Jones, S., Moxon, S., Marshall, M., Khanna, A., Eddy, S.R., Bateman, A.: Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33(suppl 1), 121–124 (2005)
18.
go back to reference Watson, J.D., Crick, F.H., et al.: Molecular structure of nucleic acids. Nature 171(4356), 737–738 (1953)CrossRef Watson, J.D., Crick, F.H., et al.: Molecular structure of nucleic acids. Nature 171(4356), 737–738 (1953)CrossRef
19.
go back to reference Crick, F.: Codon-anticodon pairing: the wobble hypothesis (1965) Crick, F.: Codon-anticodon pairing: the wobble hypothesis (1965)
20.
go back to reference Leontis, N.B., Westhof, E.: Geometric nomenclature and classiffication of RNA base pairs. RNA 7(04), 499–512 (2001)CrossRef Leontis, N.B., Westhof, E.: Geometric nomenclature and classiffication of RNA base pairs. RNA 7(04), 499–512 (2001)CrossRef
21.
go back to reference Hofacker, I.L., Fekete, M., Stadler, P.F.: Secondary structure prediction for aligned RNA sequences. J. Mol. Biol. 319(5), 1059–1066 (2002)CrossRef Hofacker, I.L., Fekete, M., Stadler, P.F.: Secondary structure prediction for aligned RNA sequences. J. Mol. Biol. 319(5), 1059–1066 (2002)CrossRef
22.
go back to reference Gardner, P.P., Giegerich, R.: A comprehensive comparison of comparative RNA structure prediction approaches. BMC Bioinform. 5(1), 140 (2004)CrossRef Gardner, P.P., Giegerich, R.: A comprehensive comparison of comparative RNA structure prediction approaches. BMC Bioinform. 5(1), 140 (2004)CrossRef
23.
go back to reference Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., et al.: Extreme learning machines trends and controversies. IEEE Intell. Syst. 28(6), 30–59 (2013)CrossRef Cambria, E., Huang, G.-B., Kasun, L.L.C., Zhou, H., Vong, C.M., Lin, J., Yin, J., Cai, Z., Liu, Q., Li, K., et al.: Extreme learning machines trends and controversies. IEEE Intell. Syst. 28(6), 30–59 (2013)CrossRef
24.
go back to reference Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef Huang, G.-B., Zhu, Q.-Y., Siew, C.-K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef
25.
go back to reference Huang, G.-B., Ding, X., Zhou, H.: Optimization method based extreme learning machine for classification. Neurocomputing 74(1), 155–163 (2010)CrossRef Huang, G.-B., Ding, X., Zhou, H.: Optimization method based extreme learning machine for classification. Neurocomputing 74(1), 155–163 (2010)CrossRef
26.
go back to reference Huang, G.-B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 42(2), 513–529 (2012)CrossRef Huang, G.-B., Zhou, H., Ding, X., Zhang, R.: Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 42(2), 513–529 (2012)CrossRef
Metadata
Title
RNA Secondary Structure Prediction Using Extreme Learning Machine with Clustering Under-Sampling Technique
Authors
Tianhang Liu
Jiarun Lin
Chengkun Wu
Jianping Yin
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-28373-9_27

Premium Partner