Skip to main content

2017 | OriginalPaper | Buchkapitel

The Application of Naive Bayes Classifier in Name Disambiguation

verfasst von : Na Li, Jin Han

Erschienen in: Cloud Computing and Security

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Name repetition exists in the academic resource management system, which brings difficulties to academic evaluation, information retrieval, citation analysis and so on. According as different authors use function words in different habits, the Naive Bayes classifier was used to study in this paper. Based on the assumption of feature independence, this paper selects 26 common function words with high frequency as statistical frequency standard, use Naive Bayes classifier to classify texts. Experiments show that the method has a high accuracy rate.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Chen, Y., Martin, J.: Towards robust unsupervised personal name disambiguation. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-Co NLL) (2007) Chen, Y., Martin, J.: Towards robust unsupervised personal name disambiguation. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-Co NLL) (2007)
2.
Zurück zum Zitat Ikeda, M., Ono, S., Sato, I.: Person name disambiguation on the web by two stage clustering. In: Second Web People Search Evaluation Workshop, WWW 2009 (2009) Ikeda, M., Ono, S., Sato, I.: Person name disambiguation on the web by two stage clustering. In: Second Web People Search Evaluation Workshop, WWW 2009 (2009)
3.
Zurück zum Zitat Romano, L., Buza, K., Giuliano, C.: XMedia: Web people search by clustering with machinely learned similarity measures. In: Second Web People Search Evaluation Workshop, WWW 2009 (2009) Romano, L., Buza, K., Giuliano, C.: XMedia: Web people search by clustering with machinely learned similarity measures. In: Second Web People Search Evaluation Workshop, WWW 2009 (2009)
4.
Zurück zum Zitat Huang, Z.: Research on name disambiguation algorithm based on multi-view nonnegative matrix factorization. Dalian University of Technology. Master thesis (2015) Huang, Z.: Research on name disambiguation algorithm based on multi-view nonnegative matrix factorization. Dalian University of Technology. Master thesis (2015)
5.
Zurück zum Zitat Zhang, S., You, L.: Chinese people name disambiguation by hierarchical clustering. New Technol. Libr. Inf. Serv. 2010(11), 64–68 (2010) Zhang, S., You, L.: Chinese people name disambiguation by hierarchical clustering. New Technol. Libr. Inf. Serv. 2010(11), 64–68 (2010)
6.
Zurück zum Zitat Li, Q.: Person name disambiguation based on hierarchical clustering and web page relationship. Shan Dong University. Master Thesis (2012) Li, Q.: Person name disambiguation based on hierarchical clustering and web page relationship. Shan Dong University. Master Thesis (2012)
7.
Zurück zum Zitat Li, W.J.: The research and application of name disambiguation algorithm based on multi-level clustering. Dalian University of Technology. Master Thesis (2013) Li, W.J.: The research and application of name disambiguation algorithm based on multi-level clustering. Dalian University of Technology. Master Thesis (2013)
8.
Zurück zum Zitat Yang, Y.L., Zhou, J., Li, B.C.: Name disambiguation algorithm based on ensemble. Appl. Res. Comput. 33(9), 2716–2720 (2016) Yang, Y.L., Zhou, J., Li, B.C.: Name disambiguation algorithm based on ensemble. Appl. Res. Comput. 33(9), 2716–2720 (2016)
9.
Zurück zum Zitat Chen, C., Wang, H.F.: Social network based cross-document personal name disambiguation. J. Chin. Inf. Process. 25(5), 75–82 (2011) Chen, C., Wang, H.F.: Social network based cross-document personal name disambiguation. J. Chin. Inf. Process. 25(5), 75–82 (2011)
10.
Zurück zum Zitat Guo, S.: Research on author name disambiguation algorithm in the literature database. New Technol. Libr. Inf. Serv. 29(Z1), 69–74 (2013) Guo, S.: Research on author name disambiguation algorithm in the literature database. New Technol. Libr. Inf. Serv. 29(Z1), 69–74 (2013)
12.
Zurück zum Zitat Zhou, Z.L., Wang, Y.L., Wu, Q.M.J., Yang, C.N., Sun, X.M.: Effective and efficient global context verification for image copy detection. IEEE Trans. Inf. Forensics Secur. 12(1), 48–63 (2017). doi:10.1109/TIFS.2016.2601065 CrossRef Zhou, Z.L., Wang, Y.L., Wu, Q.M.J., Yang, C.N., Sun, X.M.: Effective and efficient global context verification for image copy detection. IEEE Trans. Inf. Forensics Secur. 12(1), 48–63 (2017). doi:10.​1109/​TIFS.​2016.​2601065 CrossRef
13.
Zurück zum Zitat Gu, B., Sheng, V.S., Tay, K.Y., Romano, W., Li, S.: Incremental support vector learning for ordinal regression. IEEE Trans. Neural Netw. Learn. Syst. 26(7), 1403–1416 (2015)CrossRefMathSciNet Gu, B., Sheng, V.S., Tay, K.Y., Romano, W., Li, S.: Incremental support vector learning for ordinal regression. IEEE Trans. Neural Netw. Learn. Syst. 26(7), 1403–1416 (2015)CrossRefMathSciNet
14.
Zurück zum Zitat Tian, Q., Chen, S.C.: Cross-heterogeneous-database age estimation through correlation representation learning. Neurocomputing 238, 286–295 (2017)CrossRef Tian, Q., Chen, S.C.: Cross-heterogeneous-database age estimation through correlation representation learning. Neurocomputing 238, 286–295 (2017)CrossRef
15.
Zurück zum Zitat Li, X., Xie, H., Chen, L., Wang, J., Deng, X.: News impact on stock price return via sentiment analysis. Knowl.-Based Syst. 69, 14–23 (2014)CrossRef Li, X., Xie, H., Chen, L., Wang, J., Deng, X.: News impact on stock price return via sentiment analysis. Knowl.-Based Syst. 69, 14–23 (2014)CrossRef
16.
Zurück zum Zitat Xie, H., Li, X., Wang, T., Chen, L., Li, K.: Personalized search for social media via dominating verbal context. Neurocomputing 172, 27–37 (2016)CrossRef Xie, H., Li, X., Wang, T., Chen, L., Li, K.: Personalized search for social media via dominating verbal context. Neurocomputing 172, 27–37 (2016)CrossRef
Metadaten
Titel
The Application of Naive Bayes Classifier in Name Disambiguation
verfasst von
Na Li
Jin Han
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68542-7_52