Skip to main content

2020 | OriginalPaper | Buchkapitel

Sanskrit Stopword Analysis Through Morphological Analyzer and Its Gujarati Equivalent for MT System

verfasst von : Jaideepsinh Raulji, Jatinderkumar R. Saini

Erschienen in: ICT Analysis and Applications

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The identification and removal of a stopword is a common preprocessing task in many natural language processing implementations. The morphologically parsed information of stopword is also relevant in analysis of various NLP tasks. The list of most common seventy-five Sanskrit stopwords are evaluated using rule-based morphological analyzer. Most stopwords were classified as indeclinables and pronouns. The Gujarati equivalent of stopwords is retrieved using bilingual dictionary so as to cache the data for faster retrieval during MT process.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Raulji J, Saini J (2017) Generating stopword list for Sanskrit language. In: 2017 IEEE 7th international advance computing conference (IACC), Jan 2017. IEEE. pp. 799–802 Raulji J, Saini J (2017) Generating stopword list for Sanskrit language. In: 2017 IEEE 7th international advance computing conference (IACC), Jan 2017. IEEE. pp. 799–802
2.
Zurück zum Zitat Ghosh K, Bhattacharya A (2017) Stopword removal: Why bother? A case study on verbose queries. In: Proceedings of the 10th annual ACM India compute conference on ZZZ, pp 99–102. ACM Ghosh K, Bhattacharya A (2017) Stopword removal: Why bother? A case study on verbose queries. In: Proceedings of the 10th annual ACM India compute conference on ZZZ, pp 99–102. ACM
3.
Zurück zum Zitat Joshi H, Pareek J, Patel R, Chauhan K (2012) To stop or not to stop—Experiments on stopword elimination for information retrieval of Gujarati text documents. In: 2012 Nirma university international conference on engineering (NUiCONE), pp 1–4. IEEE Joshi H, Pareek J, Patel R, Chauhan K (2012) To stop or not to stop—Experiments on stopword elimination for information retrieval of Gujarati text documents. In: 2012 Nirma university international conference on engineering (NUiCONE), pp 1–4. IEEE
4.
Zurück zum Zitat Jha V, Manjunath N, Shenoy PD, Venugopal KR (2016) Hsra: Hindi stopword removal algorithm. In: 2016 international conference on microelectronics, computing and communications (MicroCom), pp 1–5. IEEE Jha V, Manjunath N, Shenoy PD, Venugopal KR (2016) Hsra: Hindi stopword removal algorithm. In: 2016 international conference on microelectronics, computing and communications (MicroCom), pp 1–5. IEEE
5.
Zurück zum Zitat Ghag KV, Shah K (2015) Comparative analysis of effect of stopwords removal on sentiment classification. In: 2015 international conference on computer, communication and control (IC4), pp 1–6. IEEE Ghag KV, Shah K (2015) Comparative analysis of effect of stopwords removal on sentiment classification. In: 2015 international conference on computer, communication and control (IC4), pp 1–6. IEEE
6.
Zurück zum Zitat Lazarinis F (2007) Lemmatization and stopword elimination in Greek Web searching. In: Proceedings of the 2007 Euro American conference on Telematics and information systems, p 61. ACM Lazarinis F (2007) Lemmatization and stopword elimination in Greek Web searching. In: Proceedings of the 2007 Euro American conference on Telematics and information systems, p 61. ACM
7.
Zurück zum Zitat Govilkar S, Bakal JW, Kulkarni SR (2016) Extraction of root words using morphological analyzer for devanagari script. Int J Inf Technol Comput Sci (IJITCS) 8(1): 33 Govilkar S, Bakal JW, Kulkarni SR (2016) Extraction of root words using morphological analyzer for devanagari script. Int J Inf Technol Comput Sci (IJITCS) 8(1): 33
8.
Zurück zum Zitat Raulji J, Saini J (2019) Sanskrit lemmatizer for improvisation of Morpholological analyzer. In: Presented at international conference on emerging technologies in computer engineering: microservices in big-data analytics, SKIT, Jaipur, India, Feb 2019. ESCI Taylor and Francis Raulji J, Saini J (2019) Sanskrit lemmatizer for improvisation of Morpholological analyzer. In: Presented at international conference on emerging technologies in computer engineering: microservices in big-data analytics, SKIT, Jaipur, India, Feb 2019. ESCI Taylor and Francis
9.
Zurück zum Zitat Jha GN, Agrawal M, Subhash C, Mishra S, Mani D, Mishra D, Bhadra M, Singh S (2009) Inflectional morphology analyzer for Sanskrit. In: Kulkarni A, Huet G (eds) Sanskrit computational linguistics 1 and 2, LNAI, vol 5402, pp 219–238. Springer Jha GN, Agrawal M, Subhash C, Mishra S, Mani D, Mishra D, Bhadra M, Singh S (2009) Inflectional morphology analyzer for Sanskrit. In: Kulkarni A, Huet G (eds) Sanskrit computational linguistics 1 and 2, LNAI, vol 5402, pp 219–238. Springer
10.
Zurück zum Zitat Tapaswi N, Jain S (2012) Treebank based deep grammar acquisition and part of speech tagging for Sanskrit sentences. In: CSI 6th international conference on software engineering (CONSEG), Sept 2012. IEEE Tapaswi N, Jain S (2012) Treebank based deep grammar acquisition and part of speech tagging for Sanskrit sentences. In: CSI 6th international conference on software engineering (CONSEG), Sept 2012. IEEE
11.
Zurück zum Zitat Bharati A, Kulkarni A, Sheeba V (2006) Building a wide coverage Sanskrit morphological analyzer: a practical approach. The first national symposium on modelling and shallow parsing of Indian languages, IIT Bombay Bharati A, Kulkarni A, Sheeba V (2006) Building a wide coverage Sanskrit morphological analyzer: a practical approach. The first national symposium on modelling and shallow parsing of Indian languages, IIT Bombay
Metadaten
Titel
Sanskrit Stopword Analysis Through Morphological Analyzer and Its Gujarati Equivalent for MT System
verfasst von
Jaideepsinh Raulji
Jatinderkumar R. Saini
Copyright-Jahr
2020
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-0630-7_42

Neuer Inhalt