Skip to main content

2015 | OriginalPaper | Buchkapitel

A Supervised Framework for Classifying Dependency Relations from Bengali Shallow Parsed Sentences

verfasst von : Anupam Mondal, Dipankar Das

Erschienen in: Mining Intelligence and Knowledge Exploration

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Natural Language Processing, one of the contemporary research area has adopted parsing technologies for various languages across the world for different objectives. In the present task, a new approach has been introduced for classifying the dependency parsed relations for a morphologically rich and free-phrase-ordered Indian language like Bengali. The pair of dependency parsed relations (also referred as kaarakas ‘cases’) are classified based on different features like vibhaktis (inflections), Part-of-Speech (POS), punctuation, gender, number and post-position. It is observed that the consecutive and non-consecutive occurrences of such relations play a vital role in the classification. We employed three different machine-learning classifiers, namely NaiveBayes, Sequential Minimal Optimization (SMO) and Conditional Random Field (CRF) which obtained the average F-Scores of 0.895, 0.869 and 0.697, respectively for classifying relation pairs of three primary kaarakas and one primary vibhakti relation. We have also conducted the error analysis for such primary relations using confusion matrices.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Dhar, A., Chatterji, S., Sarkar, S., Basu, S.: A hybrid dependency parser for Bangla. In: Proceedings of the 10th Workshop on Asian Language Resources, COLING Mumbai, pp. 55–64, India (2012) Dhar, A., Chatterji, S., Sarkar, S., Basu, S.: A hybrid dependency parser for Bangla. In: Proceedings of the 10th Workshop on Asian Language Resources, COLING Mumbai, pp. 55–64, India (2012)
2.
Zurück zum Zitat Ghosh, A., Bhaskar, P., Das, A., Bandyopadhyay, S.: Dependency parser for Bengali. In: JU System at ICON (2009) Ghosh, A., Bhaskar, P., Das, A., Bandyopadhyay, S.: Dependency parser for Bengali. In: JU System at ICON (2009)
3.
Zurück zum Zitat Chatterji, S., Sonare, P., Sarkar, S., Roy, D.: Grammar driven rules for hybrid Bengali dependency parsing. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009) Chatterji, S., Sonare, P., Sarkar, S., Roy, D.: Grammar driven rules for hybrid Bengali dependency parsing. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)
4.
Zurück zum Zitat Das, A., Shee, A., Garain, U.: Evaluation of two Bengali dependency parsers. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL), COLING, pp. 133–142 (2012) Das, A., Shee, A., Garain, U.: Evaluation of two Bengali dependency parsers. In: Proceedings of the Workshop on Machine Translation and Parsing in Indian Languages (MTPIL), COLING, pp. 133–142 (2012)
5.
Zurück zum Zitat Garain, U., De. S.: Dependency Parsing in Bangla. IGI Global (2013) Garain, U., De. S.: Dependency Parsing in Bangla. IGI Global (2013)
6.
Zurück zum Zitat Haque, M.N., Khan, M.: Parsing Bangla using LFG. In: Proceedings of Association for Computational Linguistic (1997) Haque, M.N., Khan, M.: Parsing Bangla using LFG. In: Proceedings of Association for Computational Linguistic (1997)
7.
Zurück zum Zitat Kosaraju, P., Kesidi, S.R., Ainavolu, V.B.R., Kukkadapu, P.: Experiments on Indian language dependency parsing. In: Proceedings of ICON (2010) Kosaraju, P., Kesidi, S.R., Ainavolu, V.B.R., Kukkadapu, P.: Experiments on Indian language dependency parsing. In: Proceedings of ICON (2010)
8.
Zurück zum Zitat Bharati, A., Sangal, R., Sharma, D.M.: SSF: Shakti Standard Format Guide (2007) Bharati, A., Sangal, R., Sharma, D.M.: SSF: Shakti Standard Format Guide (2007)
9.
Zurück zum Zitat Das, D., Choudhury, M.: Chunker and shallow parser for free word order languages: an approach based on valency theory and feature structures. In: Proceedings of ICON (2004) Das, D., Choudhury, M.: Chunker and shallow parser for free word order languages: an approach based on valency theory and feature structures. In: Proceedings of ICON (2004)
10.
Zurück zum Zitat Begum, R., Husain, S., Sharma, D.M., Bai, L.: Developing verb frames in Hindi. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco (2008) Begum, R., Husain, S., Sharma, D.M., Bai, L.: Developing verb frames in Hindi. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco (2008)
11.
Zurück zum Zitat Chatterji, S., Sarkar, T.M., Sarkar, S., Chakrabory, J.: Kaaraka relations in Bengali. In: Proceedings of 31st All-India Conference of Linguists (AICL), Hyderabad, pp. 33–36, India (2009) Chatterji, S., Sarkar, T.M., Sarkar, S., Chakrabory, J.: Kaaraka relations in Bengali. In: Proceedings of 31st All-India Conference of Linguists (AICL), Hyderabad, pp. 33–36, India (2009)
12.
Zurück zum Zitat Bharati, R., Sangal, D.M., Bai, L.: AnnCorra: annotating corpora guidelines for POS and chunk annotation for Indian languages. Technical report (TR-LTRC-31), LTRC, IIIT Hyderabad, India (2006) Bharati, R., Sangal, D.M., Bai, L.: AnnCorra: annotating corpora guidelines for POS and chunk annotation for Indian languages. Technical report (TR-LTRC-31), LTRC, IIIT Hyderabad, India (2006)
13.
Zurück zum Zitat Ghosh, A., Das, A., Bhaskar, P., Bandyopadhyay, S.: Bengali parsing system. In: ICON NLP Tool Contest (2010) Ghosh, A., Das, A., Bhaskar, P., Bandyopadhyay, S.: Bengali parsing system. In: ICON NLP Tool Contest (2010)
14.
Zurück zum Zitat Rao, P.R.K., Vijay, S.R.R., Vijaykrishna, R., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In: Proceedings of IJCAI Workshop on Shallow Parsing for South Asian Languages (2007) Rao, P.R.K., Vijay, S.R.R., Vijaykrishna, R., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In: Proceedings of IJCAI Workshop on Shallow Parsing for South Asian Languages (2007)
15.
Zurück zum Zitat De, S., Dhar, A., Garain, U.: Structure simplification and demand satisfaction approach to dependency parsing in Bangla. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009) De, S., Dhar, A., Garain, U.: Structure simplification and demand satisfaction approach to dependency parsing in Bangla. In: Proceedings of ICON 2009 NLP Tools Contest: Indian Language Dependency Parsing, Hyderabad, India (2009)
16.
Zurück zum Zitat Bandyopadhyay, S., Ekbal, A., Halder, D.: HMM based POS tagger and rule-based chunker for Bengali. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech and Chunking for Indian Languages (2006) Bandyopadhyay, S., Ekbal, A., Halder, D.: HMM based POS tagger and rule-based chunker for Bengali. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech and Chunking for Indian Languages (2006)
17.
Zurück zum Zitat Das, D., Ekbal, A., Bandyopadhyay, S.: Acquiring verb subcategorization frames in Bengali from corpora. In: Li, W., Mollá-Aliod, D. (eds.) ICCPOL 2009. LNCS, vol. 5459, pp. 386–393. Springer, Heidelberg (2009)CrossRef Das, D., Ekbal, A., Bandyopadhyay, S.: Acquiring verb subcategorization frames in Bengali from corpora. In: Li, W., Mollá-Aliod, D. (eds.) ICCPOL 2009. LNCS, vol. 5459, pp. 386–393. Springer, Heidelberg (2009)CrossRef
18.
Zurück zum Zitat Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India (2008) Begum, R., Husain, S., Dhwaj, A., Sharma, D.M., Bai, L., Sangal, R.: Dependency annotation scheme for Indian Languages. In: Proceedings of the Third International Joint Conference on Natural Language Processing (IJCNLP), Hyderabad, India (2008)
Metadaten
Titel
A Supervised Framework for Classifying Dependency Relations from Bengali Shallow Parsed Sentences
verfasst von
Anupam Mondal
Dipankar Das
Copyright-Jahr
2015
DOI
https://doi.org/10.1007/978-3-319-26832-3_56