Skip to main content
Top

2024 | OriginalPaper | Chapter

Plant Protein Classification Using K-mer Encoding

Authors : K. Veningston, P. V. Venkateswara Rao, M. Pravallika Devi, S. Pranitha Reddy, M. Ronalda

Published in: Computational Intelligence and Network Systems

Publisher: Springer Nature Switzerland

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Proteins play an important role in the human body and in plants. A lack of expertise in protein labeling in plants can make it extremely difficult to characterize and comprehend the precise roles and activities of different proteins. Furthermore, it restricts development in fields like biotechnology, disease resistance, and crop enhancement. The presented project focuses on plant protein classification, aiming to overcome the challenges arising from limited protein labeling knowledge. Advanced machine learning techniques, including various classification algorithms such as Logistic Regression, Decision Tree, K-nearest neighbors (KNN), Support Vector Machines (SVM), Random Forest (RF), Multinomial Naive Bayes (NB), AdaBoost, and XGBoost, are employed to accurately classify protein sequences into their respective families. This classification approach provides valuable insights into the functions and roles of proteins within plants, ultimately advancing our understanding of plant biology. This attempt offers new possibilities for advancement in critical sectors such as agriculture, drug discovery, and genomic research by eliminating the limitations associated with limited protein labeling knowledge.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Öncül, A.B., Çelik, Y.: A hybrid deep learning model for classification of plant transcription factor proteins. SIViP 17, 2055–2061 (2023)CrossRef Öncül, A.B., Çelik, Y.: A hybrid deep learning model for classification of plant transcription factor proteins. SIViP 17, 2055–2061 (2023)CrossRef
4.
go back to reference Yadav, A.K., Singla, D.: VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques. J. Biosci. 45, 1–9 (2020)CrossRef Yadav, A.K., Singla, D.: VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques. J. Biosci. 45, 1–9 (2020)CrossRef
5.
go back to reference Simon, O.A., et al.: K-mer-based machine learning method to classify LTR-retrotransposons in plant genomes. PeerJ 9, e11456 (2021) Simon, O.A., et al.: K-mer-based machine learning method to classify LTR-retrotransposons in plant genomes. PeerJ 9, e11456 (2021)
6.
go back to reference Warin, W., et al. Ensemble of multiple classifiers for multilabel classification of plant protein subcellular localization. Life 11.4, 293 (2021) Warin, W., et al. Ensemble of multiple classifiers for multilabel classification of plant protein subcellular localization. Life 11.4, 293 (2021)
7.
go back to reference Guo, Y., Hou, L., Zhu, W., Wang, P.: Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes. Front. Genet. 12, 797641 (2021)CrossRef Guo, Y., Hou, L., Zhu, W., Wang, P.: Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes. Front. Genet. 12, 797641 (2021)CrossRef
8.
go back to reference Juneja, S., Dhankhar, A., Juneja, A., Bali, S.: An approach to DNA sequence classification through machine learning: DNA sequencing, K-Mer counting, thresholding, sequence analysis. Int. J. Reliable Qual. E-Healthcare (IJRQEH) 11(2), 1–15 (2022)CrossRef Juneja, S., Dhankhar, A., Juneja, A., Bali, S.: An approach to DNA sequence classification through machine learning: DNA sequencing, K-Mer counting, thresholding, sequence analysis. Int. J. Reliable Qual. E-Healthcare (IJRQEH) 11(2), 1–15 (2022)CrossRef
9.
go back to reference Sangphukieo, A., Laomettachit, T., Ruengjitchatchawalya, M.: Photosynthetic protein classification using genome neighborhood-based machine learning feature. Sci. Rep. 10(1), 7108 (2020)CrossRef Sangphukieo, A., Laomettachit, T., Ruengjitchatchawalya, M.: Photosynthetic protein classification using genome neighborhood-based machine learning feature. Sci. Rep. 10(1), 7108 (2020)CrossRef
10.
go back to reference Gotoh, O., Morita, M., Nelson, D.R.: Assessment and refinement of eukaryotic gene structure prediction with gene-structure-aware multiple protein sequence alignment. BMC Bioinform. 15(1), 1–13 (2014)CrossRef Gotoh, O., Morita, M., Nelson, D.R.: Assessment and refinement of eukaryotic gene structure prediction with gene-structure-aware multiple protein sequence alignment. BMC Bioinform. 15(1), 1–13 (2014)CrossRef
12.
go back to reference Pan, J., et al.: DWPPI: a deep learning approach for predicting protein-protein interactions in plants based on multi-source information with a large-scale biological network. Front. Bioeng. Biotechnol. 10, 807522 (2022)CrossRef Pan, J., et al.: DWPPI: a deep learning approach for predicting protein-protein interactions in plants based on multi-source information with a large-scale biological network. Front. Bioeng. Biotechnol. 10, 807522 (2022)CrossRef
13.
go back to reference Li, L.-P., Zhang, B., Cheng, L.: CPIELA: computational prediction of plant protein-protein interactions by ensemble learning approach from protein sequences and evolutionary information. Front. Genet. 13, 857839 (2022)CrossRef Li, L.-P., Zhang, B., Cheng, L.: CPIELA: computational prediction of plant protein-protein interactions by ensemble learning approach from protein sequences and evolutionary information. Front. Genet. 13, 857839 (2022)CrossRef
Metadata
Title
Plant Protein Classification Using K-mer Encoding
Authors
K. Veningston
P. V. Venkateswara Rao
M. Pravallika Devi
S. Pranitha Reddy
M. Ronalda
Copyright Year
2024
DOI
https://doi.org/10.1007/978-3-031-48984-6_8

Premium Partner