Skip to main content

2015 | OriginalPaper | Buchkapitel

Rapid Annotation of Non-coding RNA Structures with a Parameterized Filtering Approach

verfasst von : Yinglei Song, Junfeng Qu, Chunmei Liu

Erschienen in: Intelligent Computing Theories and Methodologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

An important problem in structural bioinformatics is to search genomes for RNA sequences with known secondary structures. Most of the existing approaches use sequence-structure alignment to evaluate the probability for a sequence segment to be a member of the searched RNA family. Due to the large amount of computation time needed by accurate sequence-structure alignments, filtering approaches have been developed to rapidly eliminate most part of the genome that are unlikely to contain sequences from the desired family. Most of the existing filtering tools construct and select filters based on the recognition ability of filters and do not consider the computation time needed by the filtering process. In this paper, we develop a parameterized filter selection approach that considers both the recognition ability and the computational efficiency of filters. As the first step, the approach constructs a set of filters that contain highly conserved parts in a searched family. A dynamic programming approach is then used to evaluate the recognition ability of each filter constructed in the first step. Finally, a set of filters are selected by solving a variant of the 0-1 knapsack problem that considers both their sizes and recognition ability. Our testing results showed that this new filtering approach can significantly speed up the search procedure without adversely affecting the search accuracy. It therefore can be combined with most of the existing search tools to significantly reduce the computation time needed for search.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bafna, V., Zhang, S.: FastR: fast database search tool for non-coding RNA. In: Proceedings of the 3rd IEEE Computational Systems Bioinformatics Conference, pp. 52–61 (2004) Bafna, V., Zhang, S.: FastR: fast database search tool for non-coding RNA. In: Proceedings of the 3rd IEEE Computational Systems Bioinformatics Conference, pp. 52–61 (2004)
2.
Zurück zum Zitat Burge, S.W., et al.: Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41, D226–D232 (2013)CrossRef Burge, S.W., et al.: Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 41, D226–D232 (2013)CrossRef
3.
Zurück zum Zitat Frank, D.N., Pace, N.R.: Ribonuclease P: unity and diversity in a tRNA processing ribozyme. Annu. Rev. Biochem. 67, 153–180 (1998)CrossRef Frank, D.N., Pace, N.R.: Ribonuclease P: unity and diversity in a tRNA processing ribozyme. Annu. Rev. Biochem. 67, 153–180 (1998)CrossRef
4.
Zurück zum Zitat Huang, D.S., Yu, H.: Normalized feature vectors: a novel alignment-free sequence comparison method based on the numbers of adjacent amino acids. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 457–467 (2013)CrossRef Huang, D.S., Yu, H.: Normalized feature vectors: a novel alignment-free sequence comparison method based on the numbers of adjacent amino acids. IEEE/ACM Trans. Comput. Biol. Bioinform. 10(2), 457–467 (2013)CrossRef
5.
Zurück zum Zitat Kolbe, D.L., Eddy, S.R.: Fast filtering for RNA homology search. Bioinformatics 27, 3102–3109 (2011)CrossRef Kolbe, D.L., Eddy, S.R.: Fast filtering for RNA homology search. Bioinformatics 27, 3102–3109 (2011)CrossRef
6.
Zurück zum Zitat Meyer, F., Kurtz, S., Backofen, R., Will, S., Beckstette, M.: Structator: fast index-based search for RNA sequence-structure patterns. BMC Bioinformatics 12, 214 (2011)CrossRef Meyer, F., Kurtz, S., Backofen, R., Will, S., Beckstette, M.: Structator: fast index-based search for RNA sequence-structure patterns. BMC Bioinformatics 12, 214 (2011)CrossRef
7.
Zurück zum Zitat Liu, C., Song, Y., Malmberg, R.L., Cai, L.: Profiling and searching for RNA pseudoknot structures in genomes. In: Sunderam, V.S., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2005. LNCS, vol. 3515, pp. 968–975. Springer, Heidelberg (2005)CrossRef Liu, C., Song, Y., Malmberg, R.L., Cai, L.: Profiling and searching for RNA pseudoknot structures in genomes. In: Sunderam, V.S., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds.) ICCS 2005. LNCS, vol. 3515, pp. 968–975. Springer, Heidelberg (2005)CrossRef
8.
Zurück zum Zitat Liu, C., Song, Y., Hu, P., Malmberg, R.L., Cai, L.: Efficient annotation of non-coding RNA structures including pseudoknots via automated filters. In: Proceedings of the 2006 Computational Systems Bioinformatics Conference, pp. 99–110 (2006) Liu, C., Song, Y., Hu, P., Malmberg, R.L., Cai, L.: Efficient annotation of non-coding RNA structures including pseudoknots via automated filters. In: Proceedings of the 2006 Computational Systems Bioinformatics Conference, pp. 99–110 (2006)
9.
Zurück zum Zitat Lowe, T.M., Eddy, S.R.: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997)CrossRef Lowe, T.M., Eddy, S.R.: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964 (1997)CrossRef
10.
Zurück zum Zitat Mistry, J., Finn, R.D., Eddy, S.R., Bateman, A., Puna, M.: Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e123 (2013)CrossRef Mistry, J., Finn, R.D., Eddy, S.R., Bateman, A., Puna, M.: Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 41, e123 (2013)CrossRef
11.
Zurück zum Zitat Nawrocki, E.P., Eddy, S.R.: Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013)CrossRef Nawrocki, E.P., Eddy, S.R.: Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29, 2933–2935 (2013)CrossRef
12.
Zurück zum Zitat Nguyen, V.T., Kiss, T., Michels, A.A., Bensaude, O.: 7SK small nuclear RNA binds to and inhibits the activity of CDK9/cyclin T complexes. Nature 414, 322–325 (2001)CrossRef Nguyen, V.T., Kiss, T., Michels, A.A., Bensaude, O.: 7SK small nuclear RNA binds to and inhibits the activity of CDK9/cyclin T complexes. Nature 414, 322–325 (2001)CrossRef
13.
Zurück zum Zitat Song, Y., Liu, C., Malmberg, R.L., Pan, F., Cai, L.: Tree decomposition based fast search of rna structures including pseudoknots in genomes. In: Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference, pp. 223–234 (2005) Song, Y., Liu, C., Malmberg, R.L., Pan, F., Cai, L.: Tree decomposition based fast search of rna structures including pseudoknots in genomes. In: Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference, pp. 223–234 (2005)
14.
Zurück zum Zitat Song, Y., Yu, M.: On finding the longest antisymmetric path in directed acyclic graphs. Inf. Process. Lett. 115(2), 377–381 (2015)MathSciNetCrossRefMATH Song, Y., Yu, M.: On finding the longest antisymmetric path in directed acyclic graphs. Inf. Process. Lett. 115(2), 377–381 (2015)MathSciNetCrossRefMATH
15.
16.
Zurück zum Zitat Song, Y.: An improved parameterized algorithm for the independent feedback vertex set problem. Theor. Comput. Sci. 535, 25–30 (2014)CrossRefMATH Song, Y.: An improved parameterized algorithm for the independent feedback vertex set problem. Theor. Comput. Sci. 535, 25–30 (2014)CrossRefMATH
17.
Zurück zum Zitat Song, Y.: A new parameterized algorithm for rapid peptide sequencing. PLoS ONE 9(2), e87476 (2014)CrossRef Song, Y.: A new parameterized algorithm for rapid peptide sequencing. PLoS ONE 9(2), e87476 (2014)CrossRef
18.
Zurück zum Zitat Song, Y., Yu, M.: On the treewidths of graphs of bounded degree. PLoS ONE 10(4), e0120880 (2015)CrossRef Song, Y., Yu, M.: On the treewidths of graphs of bounded degree. PLoS ONE 10(4), e0120880 (2015)CrossRef
19.
Zurück zum Zitat Song, Y., Chi, A.Y.: A new approach for parameter estimation in the sequence-structure alignment of non-coding RNAs. J. Inf. Sci. Eng. 31(2), 593–607 (2015) Song, Y., Chi, A.Y.: A new approach for parameter estimation in the sequence-structure alignment of non-coding RNAs. J. Inf. Sci. Eng. 31(2), 593–607 (2015)
20.
Zurück zum Zitat Song, Y., Liu, C., Huang, X., Malmberg, R.L., Xu, Y., Cai, L.: Efficient parameterized algorithms for biopolymer structure-sequence alignment. IEEE/ACM Trans. Comput. Biol. Bioinform. 3(4), 423–432 (2006)CrossRef Song, Y., Liu, C., Huang, X., Malmberg, R.L., Xu, Y., Cai, L.: Efficient parameterized algorithms for biopolymer structure-sequence alignment. IEEE/ACM Trans. Comput. Biol. Bioinform. 3(4), 423–432 (2006)CrossRef
21.
Zurück zum Zitat Song, Y., Liu, C., Wang, Z.: A machine learning based approach for accurate annotation of noncoding RNAs. IEEE/ACM Trans. Comput. Biol. Bioinform. (to appear) Song, Y., Liu, C., Wang, Z.: A machine learning based approach for accurate annotation of noncoding RNAs. IEEE/ACM Trans. Comput. Biol. Bioinform. (to appear)
22.
Zurück zum Zitat Wang, B., Huang, D.S., Jiang, C.: A new strategy for protein interface identification using manifold learning method. IEEE Trans. Nanobiosci. 13(2), 118–123 (2014)MathSciNetCrossRef Wang, B., Huang, D.S., Jiang, C.: A new strategy for protein interface identification using manifold learning method. IEEE Trans. Nanobiosci. 13(2), 118–123 (2014)MathSciNetCrossRef
23.
Zurück zum Zitat Weinberg, Z., Ruzzo, W.L.: Faster genome annotation of non-coding RNA families without loss of accuracy. In: Proceedings of the Eighth Annual International Conference on Computational Molecular Biology, pp. 243–251 (2004) Weinberg, Z., Ruzzo, W.L.: Faster genome annotation of non-coding RNA families without loss of accuracy. In: Proceedings of the Eighth Annual International Conference on Computational Molecular Biology, pp. 243–251 (2004)
24.
Zurück zum Zitat Wheeler, T.J., Eddy, S.R.: NHMMER: DNA homology search with profile HMMs. Bioinformatics 29, 2487–2489 (2013)CrossRef Wheeler, T.J., Eddy, S.R.: NHMMER: DNA homology search with profile HMMs. Bioinformatics 29, 2487–2489 (2013)CrossRef
25.
Zurück zum Zitat Yang, Z., Zhu, Q., Luo, K., Zhou, Q.: The 7SK small nuclear RNA inhibits the Cdk9/cyclin T1 kinase to control transcription. Nature 414, 317–322 (2001)CrossRef Yang, Z., Zhu, Q., Luo, K., Zhou, Q.: The 7SK small nuclear RNA inhibits the Cdk9/cyclin T1 kinase to control transcription. Nature 414, 317–322 (2001)CrossRef
26.
Zurück zum Zitat Zhu, L., You, Z.H., Huang, D.S., Wang, B.: t-LSE: a novel robust geometric approach for modeling protein-protein interaction networks. PLoS ONE 8(4), e58368 (2013)CrossRef Zhu, L., You, Z.H., Huang, D.S., Wang, B.: t-LSE: a novel robust geometric approach for modeling protein-protein interaction networks. PLoS ONE 8(4), e58368 (2013)CrossRef
Metadaten
Titel
Rapid Annotation of Non-coding RNA Structures with a Parameterized Filtering Approach
verfasst von
Yinglei Song
Junfeng Qu
Chunmei Liu
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-22186-1_54

Premium Partner