Skip to main content
Top

2016 | OriginalPaper | Chapter

Clustering-Based Weighted Extreme Learning Machine for Classification in Drug Discovery Process

Authors : Wasu Kudisthalert, Kitsuchart Pasupa

Published in: Neural Information Processing

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Extreme Learning Machine (ELM) is a universal approximation method that is extremely fast and easy to implement, but the weights of the model are normally randomly selected so they can lead to poor prediction performance. In this work, we applied Weighted Similarity Extreme Learning Machine in combination with Jaccard/Tanimoto (WELM-JT) and cluster analysis (namely, k-means clustering and Support Vector Clustering) on similarity and distance measures (i.e., Jaccard/Tanimoto and Euclidean) in order to predict which compounds with not-so-different chemical structures have an activity for treating a certain symptom or disease. The proposed method was experimented on one of the most challenging datasets named Maximum Unbiased Validation (MUV) dataset with 4 different types of fingerprints (i.e. ECFP_4, ECFP_6, FCFP_4 and FCFP_6). The experimental results show that WELM-JT in combination with k-means-ED gave the best performance. It retrieved the highest number of active molecules and used the lowest number of nodes. Meanwhile, WELM-JT with k-means-JT and ECFP_6 encoding proved to be a robust contender for most of the activity classes.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Liu, X., Song, H., Zhang, J., Han, B., Wei, X., Ma, X., Cui, W., Chen, Y.: Identifying novel type ZBGs and nonhydroxamate HDAC inhibitors through a SVM based virtual screening approach. Mol. Inf. 29(5), 407–420 (2010)CrossRef Liu, X., Song, H., Zhang, J., Han, B., Wei, X., Ma, X., Cui, W., Chen, Y.: Identifying novel type ZBGs and nonhydroxamate HDAC inhibitors through a SVM based virtual screening approach. Mol. Inf. 29(5), 407–420 (2010)CrossRef
2.
go back to reference Chen, B., Harrison, R.F., Pasupa, K., Willett, P., Wilton, D.J., Wood, D.J., Lewell, X.Q.: Virtual screening using binary kernel discrimination: effect of noisy training data and the optimization of performance. J. Chem. Inf. Model. 46(2), 478–486 (2006)CrossRef Chen, B., Harrison, R.F., Pasupa, K., Willett, P., Wilton, D.J., Wood, D.J., Lewell, X.Q.: Virtual screening using binary kernel discrimination: effect of noisy training data and the optimization of performance. J. Chem. Inf. Model. 46(2), 478–486 (2006)CrossRef
3.
go back to reference Czarnecki, W.M.: Weighted tanimoto extreme learning machine with case study in drug discovery. IEEE Comput. Intell. Mag. 10(3), 19–29 (2015)CrossRef Czarnecki, W.M.: Weighted tanimoto extreme learning machine with case study in drug discovery. IEEE Comput. Intell. Mag. 10(3), 19–29 (2015)CrossRef
4.
go back to reference Kudisthalert, W., Pasupa, K.: A coefficient comparison of weighted similarity extreme learning machine for drug screening. In: 2016 8th International Conference on Knowledge and Smart Technology (KST), pp. 43–48. IEEE (2016) Kudisthalert, W., Pasupa, K.: A coefficient comparison of weighted similarity extreme learning machine for drug screening. In: 2016 8th International Conference on Knowledge and Smart Technology (KST), pp. 43–48. IEEE (2016)
5.
go back to reference Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the 2004 IEEE International Joint Conference on Neural Networks, vol. 2, pp. 985–990. IEEE (2004) Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: a new learning scheme of feedforward neural networks. In: Proceedings of the 2004 IEEE International Joint Conference on Neural Networks, vol. 2, pp. 985–990. IEEE (2004)
6.
go back to reference Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1), 489–501 (2006)CrossRef
7.
go back to reference Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)CrossRef Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)CrossRef
8.
go back to reference Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979)MATH Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 100–108 (1979)MATH
9.
go back to reference Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (2013)MATH Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (2013)MATH
10.
go back to reference Ben-Hur, A., Horn, D., Siegelmann, H.T., Vapnik, V.: Support vector clustering. J. Mach. Learn. Res. 2, 125–137 (2002)MATH Ben-Hur, A., Horn, D., Siegelmann, H.T., Vapnik, V.: Support vector clustering. J. Mach. Learn. Res. 2, 125–137 (2002)MATH
11.
go back to reference Cereto-Massagué, A., Ojeda, M.J., Valls, C., Mulero, M., Garcia-Vallvé, S., Pujadas, G.: Molecular fingerprint similarity search in virtual screening. Methods 71, 58–63 (2015)CrossRef Cereto-Massagué, A., Ojeda, M.J., Valls, C., Mulero, M., Garcia-Vallvé, S., Pujadas, G.: Molecular fingerprint similarity search in virtual screening. Methods 71, 58–63 (2015)CrossRef
12.
go back to reference Gardiner, E.J., Holliday, J.D., O’Dowd, C., Willett, P.: Effectiveness of 2D fingerprints for scaffold hopping. Future Med. Chem. 3(4), 405–414 (2011)CrossRef Gardiner, E.J., Holliday, J.D., O’Dowd, C., Willett, P.: Effectiveness of 2D fingerprints for scaffold hopping. Future Med. Chem. 3(4), 405–414 (2011)CrossRef
13.
go back to reference Rohrer, S.G., Baumann, K.: Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data. J. Chem. Inf. Model. 49(2), 169–184 (2009)CrossRef Rohrer, S.G., Baumann, K.: Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data. J. Chem. Inf. Model. 49(2), 169–184 (2009)CrossRef
14.
go back to reference Wang, Y., Xiao, J., Suzek, T.O., Zhang, J., Wang, J., Bryant, S.H.: PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res. 37, W623–W633 (2009). gkp456CrossRef Wang, Y., Xiao, J., Suzek, T.O., Zhang, J., Wang, J., Bryant, S.H.: PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res. 37, W623–W633 (2009). gkp456CrossRef
15.
go back to reference Lee, D., Lee, J.: Support vector clustering (SVC) toolbox Lee, D., Lee, J.: Support vector clustering (SVC) toolbox
16.
go back to reference Siegel, S.: Nonparametric statistics for the behavioral sciences (1956) Siegel, S.: Nonparametric statistics for the behavioral sciences (1956)
Metadata
Title
Clustering-Based Weighted Extreme Learning Machine for Classification in Drug Discovery Process
Authors
Wasu Kudisthalert
Kitsuchart Pasupa
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-46687-3_49

Premium Partner