Skip to main content
Top

Context-Based Bigram Model for POS Tagging in Hindi: A Heuristic Approach

  • 16-08-2022
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The article introduces a context-based bigram model for POS tagging in Hindi, addressing the complexities of language ambiguity and morphological structure. It leverages a heuristic approach to predict POS tags by considering the context of words in both forward and reverse directions. The model is trained on a large corpus of Hindi words and demonstrates superior performance compared to existing methods. The research highlights the importance of considering both preceding and following tags for accurate POS tagging in Hindi. The proposed approach uses a heuristic method to calculate probability distributions for word-tag and tag-tag pairs, leading to improved accuracy in tagging. The article also discusses the limitations and potential future improvements of the proposed model, emphasizing the need for more sophisticated algorithms to handle the rich morphological structure of Indian languages.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Context-Based Bigram Model for POS Tagging in Hindi: A Heuristic Approach
Authors
Santosh Kumar Bharti
Rajeev Kumar Gupta
Samir Patel
Manan Shah
Publication date
16-08-2022
Publisher
Springer Berlin Heidelberg
Published in
Annals of Data Science / Issue 1/2024
Print ISSN: 2198-5804
Electronic ISSN: 2198-5812
DOI
https://doi.org/10.1007/s40745-022-00434-4
This content is only visible if you are logged in and have the appropriate permissions.
    Image Credits
    Schmalkalden/© Schmalkalden, NTT Data/© NTT Data, Verlagsgruppe Beltz/© Verlagsgruppe Beltz, ibo Software GmbH/© ibo Software GmbH, Sovero/© Sovero, Axians Infoma GmbH/© Axians Infoma GmbH, Prosoz Herten GmbH/© Prosoz Herten GmbH, Stormshield/© Stormshield, MACH AG/© MACH AG, OEDIV KG/© OEDIV KG, Rundstedt & Partner GmbH/© Rundstedt & Partner GmbH, Doxee AT GmbH/© Doxee AT GmbH , Governikus GmbH & Co. KG/© Governikus GmbH & Co. KG, Vendosoft/© Vendosoft