Skip to main content
Erschienen in: Artificial Intelligence Review 3/2022

13.09.2021

Attention-based neural joint source-channel coding of text for point to point and broadcast channel

verfasst von: Ting Liu, Xuechen Chen

Erschienen in: Artificial Intelligence Review | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this work, we consider the transmissions of structured data such as text over a noisy channel and correlated texts over a broadcast channel. As the separate source-channel coding principle no longer holds in such scenarios, we propose a joint source-channel coding scheme which is based on deep learning architecture. In order to enhance the convergence speed, we adopt the bidirectional gated recurrent unit at the encoder. For the decoder, to improve the recovery quality, we propose the following two types of strategies: (1) After a unidirectional neural network based decoder is used, a generative adversarial network is applied to train the whole joint source-channel coding framework and pointwise mutual information is added to the objective function of beam search process; (2) Rather than using a unidirectional neural network-based decoder, we develop a bidirectional neural network based and bidirectional attention mechanism integrated decoder to utilize past and future information. Experiments under different types of channels show that our schemes are superior to the existing deep learning joint source-channel coding method and in the case of low bit budget, long sentence length and small channel signal to noise ratio, our models are significantly superior to those of separate source-channel coding. In addition, we extend the proposed unidirectional and bidirectional decoders to the broadcast channel. Additionally, to improve the performance of unidirectional decoding, we utilize not only the correlation between adjacent words in the same text but also the correlation between words in different languages with the same meaning in the beam search process.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Balsa J, Fresnedo Ó, Domínguez-Bolaño T, García-Naya JA, Castedo L (2019) Experimental evaluation of analog encoding for the wireless transmission of still images. In: 2019 IEEE 20th international workshop on signal processing advances in wireless communications (SPAWC), pp 1–5 Balsa J, Fresnedo Ó, Domínguez-Bolaño T, García-Naya JA, Castedo L (2019) Experimental evaluation of analog encoding for the wireless transmission of still images. In: 2019 IEEE 20th international workshop on signal processing advances in wireless communications (SPAWC), pp 1–5
Zurück zum Zitat Bennatan A, Burshtein D, Caire G, Shamai S (2006) Superposition coding for side-information channels. IEEE Trans Inf Theory 52(5):1872–1889MathSciNetCrossRef Bennatan A, Burshtein D, Caire G, Shamai S (2006) Superposition coding for side-information channels. IEEE Trans Inf Theory 52(5):1872–1889MathSciNetCrossRef
Zurück zum Zitat Bourtsoulatze E, Kurka D, Gündüz D (2019) Deep joint source-channel coding for wireless image transmission. In: ICASSP 2019—2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4774–4778 Bourtsoulatze E, Kurka D, Gündüz D (2019) Deep joint source-channel coding for wireless image transmission. In: ICASSP 2019—2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4774–4778
Zurück zum Zitat Cheok MJ, Omar Z, Jaward MH (2019) A review of hand gesture and sign language recognition techniques. Int J Mach Learn Cybern 10:131–153CrossRef Cheok MJ, Omar Z, Jaward MH (2019) A review of hand gesture and sign language recognition techniques. Int J Mach Learn Cybern 10:131–153CrossRef
Zurück zum Zitat Choi K, Tatwawadi K, Grover A, Weissman T, Ermon S (2019) Neural joint source-channel coding. In: ICML Choi K, Tatwawadi K, Grover A, Weissman T, Ermon S (2019) Neural joint source-channel coding. In: ICML
Zurück zum Zitat Farsad N, Rao M, Goldsmith A (2018) Deep learning for joint source-channel coding of text. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 2326–2330 Farsad N, Rao M, Goldsmith A (2018) Deep learning for joint source-channel coding of text. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 2326–2330
Zurück zum Zitat Gao Y, Tuncel E (2011) Wyner-ziv coding over broadcast channels: hybrid digital/analog schemes. IEEE Trans Inf Theory 57(9):5660–5672MathSciNetCrossRef Gao Y, Tuncel E (2011) Wyner-ziv coding over broadcast channels: hybrid digital/analog schemes. IEEE Trans Inf Theory 57(9):5660–5672MathSciNetCrossRef
Zurück zum Zitat Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville AC, Bengio Y (2014) Generative adversarial nets. In: NIPS Goodfellow IJ, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville AC, Bengio Y (2014) Generative adversarial nets. In: NIPS
Zurück zum Zitat Guzmán F, Chen P, Ott M, Pino J, Lample G, Koehn P, Chaudhary V, Ranzato M (2019) Two new evaluation datasets for low-resource machine translation: Nepali-English and Sinhala-English. In: EMNLP/IJCNLP Guzmán F, Chen P, Ott M, Pino J, Lample G, Koehn P, Chaudhary V, Ranzato M (2019) Two new evaluation datasets for low-resource machine translation: Nepali-English and Sinhala-English. In: EMNLP/IJCNLP
Zurück zum Zitat Huffman D (1952) A method for the construction of minimum-redundancy codes. Resonance 11:91–99CrossRef Huffman D (1952) A method for the construction of minimum-redundancy codes. Resonance 11:91–99CrossRef
Zurück zum Zitat Jain A, Jain A, Chauhan N, Singh V, Thakur N (2017) Information retrieval using cosine and Jaccard similarity measures in vector space model. Int J Comput Appl 164:28–30 Jain A, Jain A, Chauhan N, Singh V, Thakur N (2017) Information retrieval using cosine and Jaccard similarity measures in vector space model. Int J Comput Appl 164:28–30
Zurück zum Zitat Li J, Monroe W, Shi T, Jean S, Ritter A, Jurafsky D (2017) Adversarial learning for neural dialogue generation. In: The 2017 conference on empirical methods in natural language processing. arXiv:abs/1701.06547 Li J, Monroe W, Shi T, Jean S, Ritter A, Jurafsky D (2017) Adversarial learning for neural dialogue generation. In: The 2017 conference on empirical methods in natural language processing. arXiv:​abs/​1701.​06547
Zurück zum Zitat Navarro G (2001) A guided tour to approximate string matching. ACM Comput Surv 33:31–88CrossRef Navarro G (2001) A guided tour to approximate string matching. ACM Comput Surv 33:31–88CrossRef
Zurück zum Zitat Nor MIAM, Izhar M, Norulhusna Ahmad, Kaidi H (2018) Joint source channel decoding exploiting 2 d source correlation with parameter estimation for image transmission over Rayleigh fading channels. vol 8, pp 2633–2642 Nor MIAM, Izhar M, Norulhusna Ahmad, Kaidi H (2018) Joint source channel decoding exploiting 2 d source correlation with parameter estimation for image transmission over Rayleigh fading channels. vol 8, pp 2633–2642
Zurück zum Zitat Pejoski S, Kafedziski V (2013) Joint source channel coding framework for real time h.264/avc video transmission over wireless mimo channels using outage probability. In: 2013 11th international conference on telecommunications in modern satellite, cable and broadcasting services (TELSIKS) 01, pp 221–224 Pejoski S, Kafedziski V (2013) Joint source channel coding framework for real time h.264/avc video transmission over wireless mimo channels using outage probability. In: 2013 11th international conference on telecommunications in modern satellite, cable and broadcasting services (TELSIKS) 01, pp 221–224
Zurück zum Zitat Rao M, Farsad N, Goldsmith A (2018) Variable length joint source-channel coding of text using deep neural networks. In: 2018 IEEE 19th international workshop on signal processing advances in wireless communications (SPAWC), pp 1–5 Rao M, Farsad N, Goldsmith A (2018) Variable length joint source-channel coding of text using deep neural networks. In: 2018 IEEE 19th international workshop on signal processing advances in wireless communications (SPAWC), pp 1–5
Zurück zum Zitat Shemona JS, Chellappan AK (2020) Segmentation techniques for early cancer detection in red blood cells with deep learning-based classifier—a comparative approach. IET Image Process 14:1726–1732CrossRef Shemona JS, Chellappan AK (2020) Segmentation techniques for early cancer detection in red blood cells with deep learning-based classifier—a comparative approach. IET Image Process 14:1726–1732CrossRef
Zurück zum Zitat Shewalkar A, Nyavanandi D, Ludwig SA (2019) Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU. J Artif Intell Soft Comput Res 9:235–245CrossRef Shewalkar A, Nyavanandi D, Ludwig SA (2019) Performance evaluation of deep neural networks applied to speech recognition: RNN, LSTM and GRU. J Artif Intell Soft Comput Res 9:235–245CrossRef
Zurück zum Zitat Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15:1929–1958MathSciNetMATH Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15:1929–1958MathSciNetMATH
Zurück zum Zitat Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: NIPS Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. In: NIPS
Zurück zum Zitat Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: NIPS Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: NIPS
Zurück zum Zitat Wu C, Wang B (2017) Extracting topics based on word2vec and improved jaccard similarity coefficient. In: 2017 IEEE second international conference on data science in Cyberspace (DSC), pp 389–397 Wu C, Wang B (2017) Extracting topics based on word2vec and improved jaccard similarity coefficient. In: 2017 IEEE second international conference on data science in Cyberspace (DSC), pp 389–397
Zurück zum Zitat Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, Krikun M, Cao Y, Gao Q, Macherey K, Klingner J, Shah A, Johnson M, Liu X, Kaiser L, Gouws S, Kato Y, Kudo T, Kazawa H, Stevens K, Kurian G, Patil N, Wang W, Young C, Smith J, Riesa J, Rudnick A, Vinyals O, Corrado GS, Hughes M, Dean J (2016) Google’s neural machine translation system: bridging the gap between human and machine translation. arXiv:abs/1609.08144 Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, Krikun M, Cao Y, Gao Q, Macherey K, Klingner J, Shah A, Johnson M, Liu X, Kaiser L, Gouws S, Kato Y, Kudo T, Kazawa H, Stevens K, Kurian G, Patil N, Wang W, Young C, Smith J, Riesa J, Rudnick A, Vinyals O, Corrado GS, Hughes M, Dean J (2016) Google’s neural machine translation system: bridging the gap between human and machine translation. arXiv:​abs/​1609.​08144
Zurück zum Zitat Zhang J, Cao Y, Han G, Fu X (2019) Deep neural network-based underwater OFDM receiver. IET Commun 13:1998–2002CrossRef Zhang J, Cao Y, Han G, Fu X (2019) Deep neural network-based underwater OFDM receiver. IET Commun 13:1998–2002CrossRef
Zurück zum Zitat Zhang X, Su J, Qin Y, Liu Y, Ji R, Wang H (2018) Asynchronous bidirectional decoding for neural machine translation. In: The thirty-second AAAI conference on artificial intelligence. arXiv:abs/1801.05122 Zhang X, Su J, Qin Y, Liu Y, Ji R, Wang H (2018) Asynchronous bidirectional decoding for neural machine translation. In: The thirty-second AAAI conference on artificial intelligence. arXiv:​abs/​1801.​05122
Zurück zum Zitat Zhou L, Zhang J, Zong C (2019) Synchronous bidirectional neural machine translation. Trans Assoc Comput Linguist 7:91–105CrossRef Zhou L, Zhang J, Zong C (2019) Synchronous bidirectional neural machine translation. Trans Assoc Comput Linguist 7:91–105CrossRef
Zurück zum Zitat Ziv J, Lempel A (1977) A universal algorithm for sequential data compression. IEEE Trans Inf Theory 23:337–343MathSciNetCrossRef Ziv J, Lempel A (1977) A universal algorithm for sequential data compression. IEEE Trans Inf Theory 23:337–343MathSciNetCrossRef
Metadaten
Titel
Attention-based neural joint source-channel coding of text for point to point and broadcast channel
verfasst von
Ting Liu
Xuechen Chen
Publikationsdatum
13.09.2021
Verlag
Springer Netherlands
Erschienen in
Artificial Intelligence Review / Ausgabe 3/2022
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-021-10067-3

Weitere Artikel der Ausgabe 3/2022

Artificial Intelligence Review 3/2022 Zur Ausgabe

Premium Partner