Skip to main content
Top

Investigating Part-of-Speech Tagging in Khasi Using Naïve Bayes and Support Vector Machine

  • 2023
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter delves into the initial efforts of applying Naïve Bayes and Support Vector Machine models for part-of-speech tagging in the Khasi language, spoken by the indigenous people of Meghalaya, India. The research involves the creation of a manually annotated Khasi PoS corpus and the evaluation of these models on this corpus, yielding impressive accuracy rates. The study highlights the challenges of ambiguities and orthographic issues in the Khasi language and compares the results with state-of-the-art approaches. The chapter also discusses the potential for future work in expanding the corpus and addressing the linguistic challenges specific to Khasi. The experimental results and comparisons with other Indian languages underscore the significance of this work in advancing the computational linguistics of under-resourced languages.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Investigating Part-of-Speech Tagging in Khasi Using Naïve Bayes and Support Vector Machine
Authors
Sunita Warjri
Partha Pakray
Saralin A. Lyngdoh
Arnab Kumar Maji
Copyright Year
2023
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-19-3679-1_11
This content is only visible if you are logged in and have the appropriate permissions.