Skip to main content
Top

2020 | OriginalPaper | Chapter

Urdu Natural Language Processing Issues and Challenges: A Review Study

Authors : Usman Khan, Maaz Bin Ahmad, Farhan Shafiq, Muhammad Sarim

Published in: Intelligent Technologies and Applications

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Natural language processing is the technology used to aid computers to understand the human’s natural language. However this is not an easy task to teach a machine to understand how humans communicate. This paper provides a summary of information about some speech recognition techniques that are in the literature for new scholars to look into. It also discusses related work along with efficiency comparison for different natural languages. After that, a brief summary of Urdu language and related work done in Urdu language processing issues and challenges is presented. In the last part, future work is proposed for efficient processing of Urdu language along with some useful techniques.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Olson, H.F., Belar, H.: Phonetic typewriter. J. Acoust. Soc. Am. 28(6), 1072–1081 (1956)CrossRef Olson, H.F., Belar, H.: Phonetic typewriter. J. Acoust. Soc. Am. 28(6), 1072–1081 (1956)CrossRef
2.
go back to reference Hussain, S.: Resources for Urdu language processing. In: Proceedings of the 6th Workshop on Asian Language Resources (2019) Hussain, S.: Resources for Urdu language processing. In: Proceedings of the 6th Workshop on Asian Language Resources (2019)
4.
go back to reference Tran, D.T.: Fuzzy Approaches to Speech and Speaker Recognition. A thesis submitted for the degree of Doctor of Philosophy of the university of Canberra (2000) Tran, D.T.: Fuzzy Approaches to Speech and Speaker Recognition. A thesis submitted for the degree of Doctor of Philosophy of the university of Canberra (2000)
5.
go back to reference Anusuya, M.A., Katti, S. K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (2010) Anusuya, M.A., Katti, S. K.: Speech recognition by machine: a review. Int. J. Comput. Sci. Inf. Secur. (2010)
6.
go back to reference Katagiri, S., et al.: A New hybrid algorithm for speech recognition based on HMM segmentation and learning Vector quantization. IEEE Transactions on Audio Speech and Language processing 1(4), 421–430 (1993)CrossRef Katagiri, S., et al.: A New hybrid algorithm for speech recognition based on HMM segmentation and learning Vector quantization. IEEE Transactions on Audio Speech and Language processing 1(4), 421–430 (1993)CrossRef
7.
go back to reference Shaikh, M.K., Khowaja, H.A., Khan, M.A.: Urdu text translation with natural language processing. In: Student Conference On Engineering, Sciences and Technology, Karachi, Pakistan, pp. 81–85 (2004) Shaikh, M.K., Khowaja, H.A., Khan, M.A.: Urdu text translation with natural language processing. In: Student Conference On Engineering, Sciences and Technology, Karachi, Pakistan, pp. 81–85 (2004)
8.
go back to reference Karim, R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in bangla. In: Proceedings 5th International Conference on Computer and Information Technology (ICCIT02), Dhaka, Bangladesh (2002) Karim, R., Rahman, M.S., Iqbal, M.Z.: Recognition of spoken letters in bangla. In: Proceedings 5th International Conference on Computer and Information Technology (ICCIT02), Dhaka, Bangladesh (2002)
9.
go back to reference Oney, B., Durgunoglu, A.Y.: Learning to read in Turkish: a phonologically transparent orthography. Appl. Psycholinguist. 18, 1–15 (1997)CrossRef Oney, B., Durgunoglu, A.Y.: Learning to read in Turkish: a phonologically transparent orthography. Appl. Psycholinguist. 18, 1–15 (1997)CrossRef
10.
go back to reference Tamzida, A., Siddiqui, S.: A synchronic comparison between the vowel phonemes of Bengali & English phonology and its classroom applicability. Stamford J. English 6, 285–314 (2013)CrossRef Tamzida, A., Siddiqui, S.: A synchronic comparison between the vowel phonemes of Bengali & English phonology and its classroom applicability. Stamford J. English 6, 285–314 (2013)CrossRef
11.
go back to reference Barman, B.: A contrastive analysis of english and bangla phonemics. Dhaka University J. Linguist. 2(4), 19–42 (2011)CrossRef Barman, B.: A contrastive analysis of english and bangla phonemics. Dhaka University J. Linguist. 2(4), 19–42 (2011)CrossRef
12.
go back to reference Hossain, S.A., Rahman, M.L., Ahmed, F.: A review on bangla phoneme production and perception for computational approaches. In: 7th WSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, pp. 69–89 (2005) Hossain, S.A., Rahman, M.L., Ahmed, F.: A review on bangla phoneme production and perception for computational approaches. In: 7th WSEAS International Conference on Mathematical Methods and Computational Techniques in Electrical Engineering, pp. 69–89 (2005)
13.
go back to reference Hassan, F., Alam Kotwal, M.R., Rahman, M.M., Nasiruddin, M., Latif, M.A., Nurul Huda, M.: Local feature or mel frequency cepstral coefficients - which one is better for mln-based bangla speech recognition? In: Abraham, A., Lloret Mauri, J., Buford, John F., Suzuki, J., Thampi, Sabu M. (eds.) ACC 2011. CCIS, vol. 191, pp. 154–161. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22714-1_17CrossRef Hassan, F., Alam Kotwal, M.R., Rahman, M.M., Nasiruddin, M., Latif, M.A., Nurul Huda, M.: Local feature or mel frequency cepstral coefficients - which one is better for mln-based bangla speech recognition? In: Abraham, A., Lloret Mauri, J., Buford, John F., Suzuki, J., Thampi, Sabu M. (eds.) ACC 2011. CCIS, vol. 191, pp. 154–161. Springer, Heidelberg (2011). https://​doi.​org/​10.​1007/​978-3-642-22714-1_​17CrossRef
14.
go back to reference Ali, M., Hossain, M., Bhuiyan, M.N., et al.: Automatic speech recognition technique for bangla words. Int. J. Adv. Sci. Technol. 50, 51–60 (2013) Ali, M., Hossain, M., Bhuiyan, M.N., et al.: Automatic speech recognition technique for bangla words. Int. J. Adv. Sci. Technol. 50, 51–60 (2013)
15.
go back to reference Rahman, M.M., Khatun, F.: Development of isolated speech recognition system for bangla words. Daffodil Int. Univ. J. Sci. Technol. 6(1), 30–35 (2011)CrossRef Rahman, M.M., Khatun, F.: Development of isolated speech recognition system for bangla words. Daffodil Int. Univ. J. Sci. Technol. 6(1), 30–35 (2011)CrossRef
16.
go back to reference Hasnat, M.A., Mowla, J., Khan, M.: Isolated and continuous bangla speech recognition: implementation, performance and application perspective. In: Center for research on Bangla language processing (CRBLP) (2007) Hasnat, M.A., Mowla, J., Khan, M.: Isolated and continuous bangla speech recognition: implementation, performance and application perspective. In: Center for research on Bangla language processing (CRBLP) (2007)
17.
go back to reference Rahman, M.M., Bhuiyan, M.A.-A.: On segmentation and extraction of features from continuous bangla speech including windowing. Int. J. Appl. Res. Inf. Technol. Comput. 2(2), 31–40 (2011)CrossRef Rahman, M.M., Bhuiyan, M.A.-A.: On segmentation and extraction of features from continuous bangla speech including windowing. Int. J. Appl. Res. Inf. Technol. Comput. 2(2), 31–40 (2011)CrossRef
18.
go back to reference Ettaouil, M., Lazaar, M., En-Naimani, Z.: A hybrid ANN/HMM models for arabic speech recognition using optimal codebook. In:2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA), Rabat, pp. 1–5 (2013) Ettaouil, M., Lazaar, M., En-Naimani, Z.: A hybrid ANN/HMM models for arabic speech recognition using optimal codebook. In:2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA), Rabat, pp. 1–5 (2013)
19.
go back to reference Can, B., Artuner, H.: A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs). Department of Computer Engineering Hacettepe University Ankara, Turkey. IEEE (2013) Can, B., Artuner, H.: A syllable-based Turkish speech recognition system by using time delay neural networks (TDNNs). Department of Computer Engineering Hacettepe University Ankara, Turkey. IEEE (2013)
20.
go back to reference Palaz, H., Kanak, A., Bicil, Y., Dog̃an, M.U., İslam, T.: TREN - Turkish speech recognition platform. In: 2005 13th European Signal Processing Conference, Antalya, pp. 1–4 (2005) Palaz, H., Kanak, A., Bicil, Y., Dog̃an, M.U., İslam, T.: TREN - Turkish speech recognition platform. In: 2005 13th European Signal Processing Conference, Antalya, pp. 1–4 (2005)
21.
go back to reference Salor, O.L., Pellom, B., Çiloglu, T., Hacioglu, K., Demirekler, M.: On developing new text and audio corpora and speech recognition tools for the Turkish language. In: Seventh International Conference on Spoken Language Processing (2002) Salor, O.L., Pellom, B., Çiloglu, T., Hacioglu, K., Demirekler, M.: On developing new text and audio corpora and speech recognition tools for the Turkish language. In: Seventh International Conference on Spoken Language Processing (2002)
22.
go back to reference Kuo, H.J., Arisoy, E., Mangu, L., Saon, G.: Minimum Bayes risk discriminative language models for Arabic speech recognition. In: 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, Waikoloa, HI, pp. 208–213 (2011) Kuo, H.J., Arisoy, E., Mangu, L., Saon, G.: Minimum Bayes risk discriminative language models for Arabic speech recognition. In: 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, Waikoloa, HI, pp. 208–213 (2011)
23.
go back to reference Alotaibi, Y., Selouani, S.A., Alghamdi, M., Meftah, A.: Arabic and English speech recognition using cross-language acoustic models. In: 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA, pp. 40–44 (2012). https://doi.org/10.1109/isspa.2012.6310585 Alotaibi, Y., Selouani, S.A., Alghamdi, M., Meftah, A.: Arabic and English speech recognition using cross-language acoustic models. In: 2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA, pp. 40–44 (2012). https://​doi.​org/​10.​1109/​isspa.​2012.​6310585
24.
go back to reference Bayeh, R., Mokbel, C., Chollet, G.: Broadcast news transcription baseline system using the Nemlar database. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC): the workshop in HLT & NLP within the Arabic world, Marrakech, Morocco (2008) Bayeh, R., Mokbel, C., Chollet, G.: Broadcast news transcription baseline system using the Nemlar database. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC): the workshop in HLT & NLP within the Arabic world, Marrakech, Morocco (2008)
25.
go back to reference Hammami, N., Bedda, M., Farah, N.: Probabilistic classification based on Gaussian copula for speech recognition: Application to Spoken Arabic digits. In: 2013 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, pp. 312–317 (2013) Hammami, N., Bedda, M., Farah, N.: Probabilistic classification based on Gaussian copula for speech recognition: Application to Spoken Arabic digits. In: 2013 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, pp. 312–317 (2013)
27.
go back to reference Shaikh, S., Strzalkowski, T., Webb, N.: Classification of dialogue acts in Urdu multi- party discourse (2011) Shaikh, S., Strzalkowski, T., Webb, N.: Classification of dialogue acts in Urdu multi- party discourse (2011)
31.
go back to reference Shaukat, A.A., Ali, H., Akram, U.: Automatic Urdu speech recognition using hidden Markov model. In: 2016 International Conference on Image, Vision and Computing (ICIVC), Portsmouth, pp. 135–139 (2016) Shaukat, A.A., Ali, H., Akram, U.: Automatic Urdu speech recognition using hidden Markov model. In: 2016 International Conference on Image, Vision and Computing (ICIVC), Portsmouth, pp. 135–139 (2016)
32.
go back to reference Anwar, W., Wang, X., Wang, X.: A survey of automatic Urdu language Processing. In: 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, pp. 4489–4494 (2006) Anwar, W., Wang, X., Wang, X.: A survey of automatic Urdu language Processing. In: 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, pp. 4489–4494 (2006)
33.
go back to reference Qasim, M., Nawaz, S., Hussain, S., Habib, T.: Urdu speech recognition system for district names of Pakistan: Development, challenges and solutions. In: 2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA), Bali, pp. 28–32 (2016) Qasim, M., Nawaz, S., Hussain, S., Habib, T.: Urdu speech recognition system for district names of Pakistan: Development, challenges and solutions. In: 2016 Conference of the Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA), Bali, pp. 28–32 (2016)
36.
go back to reference Oprea, M., Şchiopu, D.: An artificial neural network-based isolated word speech recognition system for the Romanian language. In: 2012 16th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, pp. 1–6 (2012) Oprea, M., Şchiopu, D.: An artificial neural network-based isolated word speech recognition system for the Romanian language. In: 2012 16th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, pp. 1–6 (2012)
37.
go back to reference Revathi, B., Humera Khanam, B.: Hindi To English part of speech tagger by using Crf method. North Asian Int. Res. J. Sci. Eng. I.T. 2(1), 2–10 (2016). ISSN: 2454-7514 Revathi, B., Humera Khanam, B.: Hindi To English part of speech tagger by using Crf method. North Asian Int. Res. J. Sci. Eng. I.T. 2(1), 2–10 (2016). ISSN: 2454-7514
38.
go back to reference Hussain, S.: Letter-To-sound conversion For Urdu text-to-speech system. Coling (2004) Hussain, S.: Letter-To-sound conversion For Urdu text-to-speech system. Coling (2004)
39.
go back to reference Medhi, B., Talukdar, P.H.: Isolated Assamese speech recognition using artificial neural network. In: 2015 International Symposium on Advanced Computing and Communication (ISACC), Silchar, pp. 141–148 (2015) Medhi, B., Talukdar, P.H.: Isolated Assamese speech recognition using artificial neural network. In: 2015 International Symposium on Advanced Computing and Communication (ISACC), Silchar, pp. 141–148 (2015)
40.
go back to reference Krishnan, V.R.V., Jayakumar, A., Babu, A.P.: Speech recognition of isolated Malayalam words using wavelet features and artificial neural network. In:4th IEEE International Symposium on Electronic Design, Test and Applications (delta 2008), Hong Kong, pp. 240-243 (2008) Krishnan, V.R.V., Jayakumar, A., Babu, A.P.: Speech recognition of isolated Malayalam words using wavelet features and artificial neural network. In:4th IEEE International Symposium on Electronic Design, Test and Applications (delta 2008), Hong Kong, pp. 240-243 (2008)
41.
go back to reference Polur, P.D., Zhou, R., Yang, J., Adnani, F., Hobson, R.S.: Isolated speech recognition using artificial neural networks. In: 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, vol. 2, pp. 1731–1734 2001 Polur, P.D., Zhou, R., Yang, J., Adnani, F., Hobson, R.S.: Isolated speech recognition using artificial neural networks. In: 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Istanbul, Turkey, vol. 2, pp. 1731–1734 2001
42.
go back to reference Sukumar, R.A., Sukumar, S.A., Shah, F.A., Anto, B.P.: Key-word based query recognition in a speech corpus by using artificial neural networks. In:2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks, Liverpool, pp. 33–36 (2010) Sukumar, R.A., Sukumar, S.A., Shah, F.A., Anto, B.P.: Key-word based query recognition in a speech corpus by using artificial neural networks. In:2010 2nd International Conference on Computational Intelligence, Communication Systems and Networks, Liverpool, pp. 33–36 (2010)
43.
go back to reference Sukumar, A.R., Shah, A.F., Anto, P.B.: Isolated question words recognition from speech queries by using Artificial Neural Networks. In: 2010 Second International conference on Computing, Communication and Networking Technologies, Karur, pp. 1–4 (2010) Sukumar, A.R., Shah, A.F., Anto, P.B.: Isolated question words recognition from speech queries by using Artificial Neural Networks. In: 2010 Second International conference on Computing, Communication and Networking Technologies, Karur, pp. 1–4 (2010)
44.
go back to reference Dey, N.S., Mohanty, R., Chugh, K.L.: Speech and speaker recognition system using artificial neural networks and hidden markov model. In: 2012 International Conference on Communication Systems and Network Technologies, Rajkot, pp. 311–315 (2012) Dey, N.S., Mohanty, R., Chugh, K.L.: Speech and speaker recognition system using artificial neural networks and hidden markov model. In: 2012 International Conference on Communication Systems and Network Technologies, Rajkot, pp. 311–315 (2012)
47.
go back to reference Benvenuto, N., Marchesi, M., Piazza, F., Uncini, A.: A comparison between real and complex valued neural networks in communication applications. In: Teuvo, K., Kai, M., Olli, S., Jari, K. (eds.) Artificial Neural Networks, North-Holland, pp. 1177–1180 (1991) Benvenuto, N., Marchesi, M., Piazza, F., Uncini, A.: A comparison between real and complex valued neural networks in communication applications. In: Teuvo, K., Kai, M., Olli, S., Jari, K. (eds.) Artificial Neural Networks, North-Holland, pp. 1177–1180 (1991)
Metadata
Title
Urdu Natural Language Processing Issues and Challenges: A Review Study
Authors
Usman Khan
Maaz Bin Ahmad
Farhan Shafiq
Muhammad Sarim
Copyright Year
2020
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-15-5232-8_39

Premium Partner