Skip to main content
Top

2016 | OriginalPaper | Chapter

Deep Belief Network Based Part-of-Speech Tagger for Telugu Language

Authors : M. Jagadeesh, M. Anand Kumar, K. P. Soman

Published in: Proceedings of the Second International Conference on Computer and Communication Technologies

Publisher: Springer India

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Indian languages have very less linguistic resources, though they have a large speaker base. They are very rich in morphology, making it very difficult to do sequential tagging or any type of language analysis. In natural language processing, parts-of-speech (POS) tagging is the basic tool with which it is possible to extract terminology using linguistic patterns. The main aim of this research is to do sequential tagging for Indian languages based on the unsupervised features and distributional information of a word with its neighboring words. The results of the machine learning algorithms depend on the data representation. Not all the data contribute to creation of the model, leading a few in vain and it depends on the descriptive factors of data disparity. Data representations are designed by using domain-specific knowledge but the aim of Artificial Intelligence is to reduce these domain-dependent representations, so that it can be applied to the domains which are new to one. Recently, deep learning algorithms have acquired a substantial interest in reducing the dimension of features or extracting the latent features. Recent development and applications of deep learning algorithms are giving impressive results in several areas mostly in image and text applications.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006) Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006)
2.
go back to reference Dhanalakshmi, V., Anandkumar, M., Vijaya, M.S., Loganathan, R., Soman, K.P., Rajendran, S.: Tamil part-of-speech tagger based on SVMTool. In: Proceedings of the COLIPS International Conference on natural language processing (IALP), Thailand (2008) Dhanalakshmi, V., Anandkumar, M., Vijaya, M.S., Loganathan, R., Soman, K.P., Rajendran, S.: Tamil part-of-speech tagger based on SVMTool. In: Proceedings of the COLIPS International Conference on natural language processing (IALP), Thailand (2008)
3.
go back to reference Antony, P.J., Mohan, Soman, K.P.: SVM based part of speech tagger for Malayalam. In: Recent Trends in Information, Telecommunication and Computing (ITC) (2010) Antony, P.J., Mohan, Soman, K.P.: SVM based part of speech tagger for Malayalam. In: Recent Trends in Information, Telecommunication and Computing (ITC) (2010)
4.
go back to reference Binulal, G., Sindhiya, P., Goud, A., Soman, K.P. A SVM based approach to Telugu parts of speech tagging using SVMTool. Int. J. Recent Trends Eng. 1(2), 166–169 (2009) Binulal, G., Sindhiya, P., Goud, A., Soman, K.P. A SVM based approach to Telugu parts of speech tagging using SVMTool. Int. J. Recent Trends Eng. 1(2), 166–169 (2009)
5.
go back to reference Dhanalakshmi, V., Anand Kumar, M., Rekha, R.U., Arun Kumar, C., Soman, K.P., Rajendran, S.: Morphological analyzer for agglutinative languages using machine learning approaches. Adv. Recent Technol. Comm. Comput. 433–435 (2009) Dhanalakshmi, V., Anand Kumar, M., Rekha, R.U., Arun Kumar, C., Soman, K.P., Rajendran, S.: Morphological analyzer for agglutinative languages using machine learning approaches. Adv. Recent Technol. Comm. Comput. 433–435 (2009)
6.
go back to reference Anand Kumar, M., Dhanalakshmi, V., Soman, K.P., Rajendran, S.: A sequence labeling approach to morphological analyzer for tamil language. Int. J. Comput. Sci. Eng. 2(6), 1944–1951 (2010) Anand Kumar, M., Dhanalakshmi, V., Soman, K.P., Rajendran, S.: A sequence labeling approach to morphological analyzer for tamil language. Int. J. Comput. Sci. Eng. 2(6), 1944–1951 (2010)
7.
go back to reference Kiranmai, S., Mallika, G.K., Anand Kumar, M., Dhanalakshmi, V., Soman, K.P.: Morphological analyzer for Telugu using support vector machine. In: Information and Communication Technologies, pp. 430–433, Springer, Berlin (2010) Kiranmai, S., Mallika, G.K., Anand Kumar, M., Dhanalakshmi, V., Soman, K.P.: Morphological analyzer for Telugu using support vector machine. In: Information and Communication Technologies, pp. 430–433, Springer, Berlin (2010)
8.
go back to reference Abeera, V.P., Aparna, S., Rekha, R.U., Anand Kumar, M., Dhanalakshmi, V. Soman, K.P., Rajendran, S.: Morphological analyzer for Malayalam using machine learning. In: Data Engineering and Management, pp. 252–254. Springer, Berlin (2012) Abeera, V.P., Aparna, S., Rekha, R.U., Anand Kumar, M., Dhanalakshmi, V. Soman, K.P., Rajendran, S.: Morphological analyzer for Malayalam using machine learning. In: Data Engineering and Management, pp. 252–254. Springer, Berlin (2012)
9.
go back to reference Rabiner, Lawrence R.: A tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)CrossRef Rabiner, Lawrence R.: A tutorial on hidden markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–286 (1989)CrossRef
10.
go back to reference Schmid, H.: Part-of-speech tagging with neural Networks. In: Proceedings of the International Conference on Computational Linguistics, pp. 172–176 (1994) Schmid, H.: Part-of-speech tagging with neural Networks. In: Proceedings of the International Conference on Computational Linguistics, pp. 172–176 (1994)
11.
go back to reference Goldwater, S., Griffiths, T.: A fully Bayesian approach to unsupervised part-of-speech tagging. Annu. Meet.-ACL. 45(1), 744 (2007) Goldwater, S., Griffiths, T.: A fully Bayesian approach to unsupervised part-of-speech tagging. Annu. Meet.-ACL. 45(1), 744 (2007)
12.
go back to reference Erhan, D., et al.: Why does unsupervised pre-training help deep learning? J. Mach. Learn. Res 11, 625–660 (2010) Erhan, D., et al.: Why does unsupervised pre-training help deep learning? J. Mach. Learn. Res 11, 625–660 (2010)
13.
go back to reference Bengio, Y., et al.: Greedy layer-wise training of deep networks. Adv. Neural Inf. Proc. Syst. 19, 153 (2007) Bengio, Y., et al.: Greedy layer-wise training of deep networks. Adv. Neural Inf. Proc. Syst. 19, 153 (2007)
15.
go back to reference Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. In: Cognitive Modeleling (1988) Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. In: Cognitive Modeleling (1988)
16.
go back to reference Raina, R., et al.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning. ACM (2007) Raina, R., et al.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning. ACM (2007)
17.
go back to reference Mohamed, A., et al.: Deep belief networks using discriminative features for phone recognition. In: Acoustics, Speech and Signal Processing (ICASSP) (2011) Mohamed, A., et al.: Deep belief networks using discriminative features for phone recognition. In: Acoustics, Speech and Signal Processing (ICASSP) (2011)
18.
go back to reference Salakhutdinov, R., Hinton, G.E.: Deep boltzmann machines. In: International Conference on Artificial Intelligence and Statistics (2009) Salakhutdinov, R., Hinton, G.E.: Deep boltzmann machines. In: International Conference on Artificial Intelligence and Statistics (2009)
19.
go back to reference Salakhutdinov, R., Mnih, A., Hinton, G.: Restricted Boltzmann machines for collaborative filtering. In: Proceedings of the 24th International Conference on Machine Learning. ACM (2007) Salakhutdinov, R., Mnih, A., Hinton, G.: Restricted Boltzmann machines for collaborative filtering. In: Proceedings of the 24th International Conference on Machine Learning. ACM (2007)
Metadata
Title
Deep Belief Network Based Part-of-Speech Tagger for Telugu Language
Authors
M. Jagadeesh
M. Anand Kumar
K. P. Soman
Copyright Year
2016
Publisher
Springer India
DOI
https://doi.org/10.1007/978-81-322-2526-3_9