Skip to main content

2020 | OriginalPaper | Buchkapitel

Urdu Natural Language Processing Issues and Challenges: A Review Study

verfasst von : Usman Khan, Maaz Bin Ahmad, Farhan Shafiq, Muhammad Sarim

Erschienen in: Intelligent Technologies and Applications

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Natural language processing is the technology used to aid computers to understand the human’s natural language. However this is not an easy task to teach a machine to understand how humans communicate. This paper provides a summary of information about some speech recognition techniques that are in the literature for new scholars to look into. It also discusses related work along with efficiency comparison for different natural languages. After that, a brief summary of Urdu language and related work done in Urdu language processing issues and challenges is presented. In the last part, future work is proposed for efficient processing of Urdu language along with some useful techniques.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Olson, H.F., Belar, H.: Phonetic typewriter. J. Acoust. Soc. Am. 28(6), 1072–1081 (1956)CrossRef Olson, H.F., Belar, H.: Phonetic typewriter. J. Acoust. Soc. Am. 28(6), 1072–1081 (1956)CrossRef
2.
Zurück zum Zitat Hussain, S.: Resources for Urdu language processing. In: Proceedings of the 6th Workshop on Asian Language Resources (2019) Hussain, S.: Resources for Urdu language processing. In: Proceedings of the 6th Workshop on Asian Language Resources (2019)
4.
Zurück zum Zitat Tran, D.T.: Fuzzy Approaches to Speech and Speaker Recognition. A thesis submitted for the degree of Doctor of Philosophy of the university of Canberra (2000) Tran, D.T.: Fuzzy Approaches to Speech and Speaker Recognition. A thesis submitted for the degree of Doctor of Philosophy of the university of Canberra (2000)
5.
Zurück zum Zitat Anusuya, M.A., Katti, S. K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (2010) Anusuya, M.A., Katti, S. K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (2010)
6.
Zurück zum Zitat Katagiri, S., et al.: A New hybrid algorithm for speech recognition based on HMM segmentation and learning Vector quantization. IEEE Transactions on Audio Speech and Language processing 1(4), 421–430 (1993)CrossRef Katagiri, S., et al.: A New hybrid algorithm for speech recognition based on HMM segmentation and learning Vector quantization. IEEE Transactions on Audio Speech and Language processing 1(4), 421–430 (1993)CrossRef
7.
Zurück zum Zitat Shaikh, M.K., Khowaja, H.A., Khan, M.A.: Urdu text translation with natural language processing. In: Student Conference On Engineering, Sciences and Technology, Karachi, Pakistan, pp. 81–85 (2004) Shaikh, M.K., Khowaja, H.A., Khan, M.A.: Urdu text translation with natural language processing. In: Student Conference On Engineering, Sciences and Technology, Karachi, Pakistan, pp. 81–85 (2004)
8.
Zurück zum Zitat Karim, R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in bangla. In: Proceedings 5th International Conference on Computer and Information Technology (ICCIT02), Dhaka, Bangladesh (2002) Karim, R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in bangla. In: Proceedings 5th International Conference on Computer and Information Technology (ICCIT02), Dhaka, Bangladesh (2002)
9.
Zurück zum Zitat Oney, B., Durgunoglu, A.Y.: Learning to read in Turkish: a phonologically transparent orthography. Appl. Psycholinguist. 18, 1–15 (1997)CrossRef Oney, B., Durgunoglu, A.Y.: Learning to read in Turkish: a phonologically transparent orthography. Appl. Psycholinguist. 18, 1–15 (1997)CrossRef
10.
Zurück zum Zitat Tamzida, A., Siddiqui, S.: A synchronic comparison between the vowel phonemes of Bengali & English phonology and its classroom applicability. Stamford J. English 6, 285–314 (2013)CrossRef Tamzida, A., Siddiqui, S.: A synchronic comparison between the vowel phonemes of Bengali & English phonology and its classroom applicability. Stamford J. English 6, 285–314 (2013)CrossRef
11.
Zurück zum Zitat Barman, B.: A contrastive analysis of english and bangla phonemics. Dhaka University J. Linguist. 2(4), 19–42 (2011)CrossRef Barman, B.: A contrastive analysis of english and bangla phonemics. Dhaka University J. Linguist. 2(4), 19–42 (2011)CrossRef
12.
Zurück zum Zitat Hossain, S.A., Rahman, M.L., Ahmed, F.: A review on bangla phoneme production and perception for computational approaches. In: 7th WSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, pp. 69–89 (2005) Hossain, S.A., Rahman, M.L., Ahmed, F.: A review on bangla phoneme production and perception for computational approaches. In: 7th WSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, pp. 69–89 (2005)
13.
Zurück zum Zitat Hassan, F., Alam Kotwal, M.R., Rahman, M.M., Nasiruddin, M., Latif, M.A., Nurul Huda, M.: Local feature or mel frequency cepstral coefficients - which one is better for mln-based bangla speech recognition? In: Abraham, A., Lloret Mauri, J., Buford, John F., Suzuki, J., Thampi, Sabu M. (eds.) ACC 2011. CCIS, vol. 191, pp. 154–161. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22714-1_17CrossRef Hassan, F., Alam Kotwal, M.R., Rahman, M.M., Nasiruddin, M., Latif, M.A., Nurul Huda, M.: Local feature or mel frequency cepstral coefficients - which one is better for mln-based bangla speech recognition? In: Abraham, A., Lloret Mauri, J., Buford, John F., Suzuki, J., Thampi, Sabu M. (eds.) ACC 2011. CCIS, vol. 191, pp. 154–161. Springer, Heidelberg (2011). https://​doi.​org/​10.​1007/​978-3-642-22714-1_​17CrossRef
14.
Zurück zum Zitat Ali, M., Hossain, M., Bhuiyan, M.N., et al.: Automatic speech recognition technique for bangla words. Int. J. Adv. Sci. Technol. 50, 51–60 (2013) Ali, M., Hossain, M., Bhuiyan, M.N., et al.: Automatic speech recognition technique for bangla words. Int. J. Adv. Sci. Technol. 50, 51–60 (2013)
15.
Zurück zum Zitat Rahman, M.M., Khatun, F.: Development of isolated speech recognition system for bangla words. Daffodil Int. Univ. J. Sci. Technol. 6(1), 30–35 (2011)CrossRef Rahman, M.M., Khatun, F.: Development of isolated speech recognition system for bangla words. Daffodil Int. Univ. J. Sci. Technol. 6(1), 30–35 (2011)CrossRef
16.
Zurück zum Zitat Hasnat, M.A., Mowla, J., Khan, M.: Isolated and continuous bangla speech recognition: implementation, performance and application perspective. In: Center for research on Bangla language processing (CRBLP) (2007) Hasnat, M.A., Mowla, J., Khan, M.: Isolated and continuous bangla speech recognition: implementation, performance and application perspective. In: Center for research on Bangla language processing (CRBLP) (2007)
17.
Zurück zum Zitat Rahman, M.M., Bhuiyan, M.A.-A.: On segmentation and extraction of features from continuous bangla speech including windowing. Int. J. Appl. Res. Inf. Technol. Comput. 2(2), 31–40 (2011)CrossRef Rahman, M.M., Bhuiyan, M.A.-A.: On segmentation and extraction of features from continuous bangla speech including windowing. Int. J. Appl. Res. Inf. Technol. Comput. 2(2), 31–40 (2011)CrossRef
18.
Zurück zum Zitat Ettaouil, M., Lazaar, M., En-Naimani, Z.: A hybrid ANN/HMM models for arabic speech recognition using optimal codebook. In:2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA), Rabat, pp. 1–5 (2013) Ettaouil, M., Lazaar, M., En-Naimani, Z.: A hybrid ANN/HMM models for arabic speech recognition using optimal codebook. In:2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA), Rabat, pp. 1–5 (2013)
19.
Zurück zum Zitat Can, B., Artuner, H.: A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs). Department of Computer Engineering Hacettepe University Ankara, Turkey. IEEE (2013) Can, B., Artuner, H.: A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs). Department of Computer Engineering Hacettepe University Ankara, Turkey. IEEE (2013)
20.
Zurück zum Zitat Palaz, H., Kanak, A., Bicil, Y., Dog̃an, M.U., İslam, T.: TREN - Turkish speech recognition platform. In: 2005 13th European Signal Processing Conference, Antalya, pp. 1–4 (2005) Palaz, H., Kanak, A., Bicil, Y., Dog̃an, M.U., İslam, T.: TREN - Turkish speech recognition platform. In: 2005 13th European Signal Processing Conference, Antalya, pp. 1–4 (2005)
21.
Zurück zum Zitat Salor, O.L., Pellom, B., Çiloglu, T., Hacioglu, K., Demirekler, M.: On developing new text and audio corpora and speech recognition tools for the Turkish language. In: Seventh International Conference on Spoken Language Processing (2002) Salor, O.L., Pellom, B., Çiloglu, T., Hacioglu, K., Demirekler, M.: On developing new text and audio corpora and speech recognition tools for the Turkish language. In: Seventh International Conference on Spoken Language Processing (2002)
22.
Zurück zum Zitat Kuo, H.J., Arisoy, E., Mangu, L., Saon, G.: Minimum Bayes risk discriminative language models for Arabic speech recognition. In: 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, Waikoloa, HI, pp. 208–213 (2011) Kuo, H.J., Arisoy, E., Mangu, L., Saon, G.: Minimum Bayes risk discriminative language models for Arabic speech recognition. In: 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, Waikoloa, HI, pp. 208–213 (2011)
23.
Zurück zum Zitat Alotaibi, Y., Selouani, S.A., Alghamdi, M., Meftah, A.: Arabic and English speech recognition using cross-language acoustic models. In: 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA, pp. 40–44 (2012). https://doi.org/10.1109/isspa.2012.6310585 Alotaibi, Y., Selouani, S.A., Alghamdi, M., Meftah, A.: Arabic and English speech recognition using cross-language acoustic models. In: 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA, pp. 40–44 (2012). https://​doi.​org/​10.​1109/​isspa.​2012.​6310585
24.
Zurück zum Zitat Bayeh, R., Mokbel, C., Chollet, G.: Broadcast news transcription baseline system using the Nemlar database. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC): the workshop in HLT & NLP within the Arabic world, Marrakech, Morocco (2008) Bayeh, R., Mokbel, C., Chollet, G.: Broadcast news transcription baseline system using the Nemlar database. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC): the workshop in HLT & NLP within the Arabic world, Marrakech, Morocco (2008)
25.
Zurück zum Zitat Hammami, N., Bedda, M., Farah, N.: Probabilistic classification based on Gaussian copula for speech recognition: Application to Spoken Arabic digits. In: 2013 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, pp. 312–317 (2013) Hammami, N., Bedda, M., Farah, N.: Probabilistic classification based on Gaussian copula for speech recognition: Application to Spoken Arabic digits. In: 2013 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, pp. 312–317 (2013)
27.
Zurück zum Zitat Shaikh, S., Strzalkowski, T., Webb, N.: Classification of dialogue acts in Urdu multi- party discourse (2011) Shaikh, S., Strzalkowski, T., Webb, N.: Classification of dialogue acts in Urdu multi- party discourse (2011)
31.
Zurück zum Zitat Shaukat, A.A., Ali, H., Akram, U.: Automatic Urdu speech recognition using hidden Markov model. In: 2016 International Conference on Image, Vision and Computing (ICIVC), Portsmouth, pp. 135–139 (2016) Shaukat, A.A., Ali, H., Akram, U.: Automatic Urdu speech recognition using hidden Markov model. In: 2016 International Conference on Image, Vision and Computing (ICIVC), Portsmouth, pp. 135–139 (2016)
32.
Zurück zum Zitat Anwar, W., Wang, X., Wang, X.: A survey of automatic Urdu language Processing. In: 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, pp. 4489–4494 (2006) Anwar, W., Wang, X., Wang, X.: A survey of automatic Urdu language Processing. In: 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, pp. 4489–4494 (2006)
33.
Zurück zum Zitat Qasim, M., Nawaz, S., Hussain, S., Habib, T.: Urdu speech recognition system for district names of Pakistan: Development, challenges and solutions. In: 2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA), Bali, pp. 28–32 (2016) Qasim, M., Nawaz, S., Hussain, S., Habib, T.: Urdu speech recognition system for district names of Pakistan: Development, challenges and solutions. In: 2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA), Bali, pp. 28–32 (2016)
36.
Zurück zum Zitat Oprea, M., Şchiopu, D.: An artificial neural network-based isolated word speech recognition system for the Romanian language. In: 2012 16th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, pp. 1–6 (2012) Oprea, M., Şchiopu, D.: An artificial neural network-based isolated word speech recognition system for the Romanian language. In: 2012 16th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, pp. 1–6 (2012)
37.
Zurück zum Zitat Revathi, B., Humera Khanam, B.: Hindi To English part of speech tagger by using Crf method. North Asian Int. Res. J. Sci. Eng. I.T. 2(1), 2–10 (2016). ISSN: 2454-7514 Revathi, B., Humera Khanam, B.: Hindi To English part of speech tagger by using Crf method. North Asian Int. Res. J. Sci. Eng. I.T. 2(1), 2–10 (2016). ISSN: 2454-7514
38.
Zurück zum Zitat Hussain, S.: Letter-To-sound conversion For Urdu text-to-speech system. Coling (2004) Hussain, S.: Letter-To-sound conversion For Urdu text-to-speech system. Coling (2004)
39.
Zurück zum Zitat Medhi, B., Talukdar, P.H.: Isolated Assamese speech recognition using artificial neural network. In: 2015 International Symposium on Advanced Computing and Communication (ISACC), Silchar, pp. 141–148 (2015) Medhi, B., Talukdar, P.H.: Isolated Assamese speech recognition using artificial neural network. In: 2015 International Symposium on Advanced Computing and Communication (ISACC), Silchar, pp. 141–148 (2015)
40.
Zurück zum Zitat Krishnan, V.R.V., Jayakumar, A., Babu, A.P.: Speech recognition of isolated Malayalam words using wavelet features and artificial neural network. In:4th IEEE International Symposium on Electronic Design, Test and Applications (delta 2008), Hong Kong, pp. 240-243 (2008) Krishnan, V.R.V., Jayakumar, A., Babu, A.P.: Speech recognition of isolated Malayalam words using wavelet features and artificial neural network. In:4th IEEE International Symposium on Electronic Design, Test and Applications (delta 2008), Hong Kong, pp. 240-243 (2008)
41.
Zurück zum Zitat Polur, P.D., Zhou, R., Yang, J., Adnani, F., Hobson, R.S.: Isolated speech recognition using artificial neural networks. In: 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, vol. 2, pp. 1731–1734 2001 Polur, P.D., Zhou, R., Yang, J., Adnani, F., Hobson, R.S.: Isolated speech recognition using artificial neural networks. In: 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, vol. 2, pp. 1731–1734 2001
42.
Zurück zum Zitat Sukumar, R.A., Sukumar, S.A., Shah, F.A., Anto, B.P.: Key-word based query recognition in a speech corpus by using artificial neural networks. In:2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks, Liverpool, pp. 33–36 (2010) Sukumar, R.A., Sukumar, S.A., Shah, F.A., Anto, B.P.: Key-word based query recognition in a speech corpus by using artificial neural networks. In:2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks, Liverpool, pp. 33–36 (2010)
43.
Zurück zum Zitat Sukumar, A.R., Shah, A.F., Anto, P.B.: Isolated question words recognition from speech queries by using Artificial Neural Networks. In: 2010 Second International conference on Computing, Communication and Networking Technologies, Karur, pp. 1–4 (2010) Sukumar, A.R., Shah, A.F., Anto, P.B.: Isolated question words recognition from speech queries by using Artificial Neural Networks. In: 2010 Second International conference on Computing, Communication and Networking Technologies, Karur, pp. 1–4 (2010)
44.
Zurück zum Zitat Dey, N.S., Mohanty, R., Chugh, K.L.: Speech and speaker recognition system using artificial neural networks and hidden markov model. In: 2012 International Conference on Communication Systems and Network Technologies, Rajkot, pp. 311–315 (2012) Dey, N.S., Mohanty, R., Chugh, K.L.: Speech and speaker recognition system using artificial neural networks and hidden markov model. In: 2012 International Conference on Communication Systems and Network Technologies, Rajkot, pp. 311–315 (2012)
47.
Zurück zum Zitat Benvenuto, N., Marchesi, M., Piazza, F., Uncini, A.: A comparison between real and complex valued neural networks in communication applications. In: Teuvo, K., Kai, M., Olli, S., Jari, K. (eds.) Artificial Neural Networks, North-Holland, pp. 1177–1180 (1991) Benvenuto, N., Marchesi, M., Piazza, F., Uncini, A.: A comparison between real and complex valued neural networks in communication applications. In: Teuvo, K., Kai, M., Olli, S., Jari, K. (eds.) Artificial Neural Networks, North-Holland, pp. 1177–1180 (1991)
Metadaten
Titel
Urdu Natural Language Processing Issues and Challenges: A Review Study
verfasst von
Usman Khan
Maaz Bin Ahmad
Farhan Shafiq
Muhammad Sarim
Copyright-Jahr
2020
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-5232-8_39