Skip to main content
Erschienen in: Optical and Quantum Electronics 3/2024

01.03.2024

Design and research of multimedia information publishing system based on speech recognition technology

verfasst von: Zhuoran Li, Yafei Wang, Cong Wang

Erschienen in: Optical and Quantum Electronics | Ausgabe 3/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Internet, also known as the Internet, refers to the huge Internet connected between the LAN and the LAN, which connects a huge Internet from a set of common protocols. Today, accurate real-time multimedia information publishing technology has been widely used in many fields. In order to achieve the purpose of information disclosure, the multimedia information disclosure system uses multimedia resources as an object, and the resources are displayed by the user’s editing of the management party multimedia resource. With the increased workload of management and multimedia’s maintenance information, the current multimedia information management and control software cannot meet the needs of the market. In order to ensure the real-time, accuracy of the multimedia information disclosure, it is necessary to check if the speech recognition technology can accurately collect voice information. Techniques are particularly important for speech recognition, so it is also called automatic voice recognition; its purpose is to convert the words content contained in human speech to computer recognition content. It is different from the speaker’s identification, the speaker is different, the latter tries to identify or confirm that the person who makes a voice, rather than identifying or confirming the vocabulary included. However, voice recognition technology requires a very perfect system design, and the system design can use system science ideas and methods, according to systematic analysis results, it is also possible to design a new system. Therefore, this paper uses the voice system on the Internet to provide a solution to the inaccuracicity in speech recognition technology, and use speech recognition techniques to disclose multimedia information.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abernethy, M.A., Bouwens, J., Van Lent, L.: Leadership and control system design. Manag. Account. Res. 21(1), 2–16 (2010)CrossRef Abernethy, M.A., Bouwens, J., Van Lent, L.: Leadership and control system design. Manag. Account. Res. 21(1), 2–16 (2010)CrossRef
Zurück zum Zitat Becchi, G., Bertini, M., Del Bimbo, A., et al.: A distributed system for multimedia monitoring, publishing and retrieval. Procedia Comput. Sci. 38, 100–107 (2014)CrossRef Becchi, G., Bertini, M., Del Bimbo, A., et al.: A distributed system for multimedia monitoring, publishing and retrieval. Procedia Comput. Sci. 38, 100–107 (2014)CrossRef
Zurück zum Zitat Bonawitz, K., Eichner, H., Grieskamp, W., et al.: Towards federated learning at scale: system design. Proc. Mach. Learn. Syst. 1, 374–388 (2019) Bonawitz, K., Eichner, H., Grieskamp, W., et al.: Towards federated learning at scale: system design. Proc. Mach. Learn. Syst. 1, 374–388 (2019)
Zurück zum Zitat Chen, C.Y., Chang, B.R., Huang, P.S.: Multimedia augmented reality information system for museum guidance. Pers. Ubiquit. Comput. 18(2), 315–322 (2014)CrossRef Chen, C.Y., Chang, B.R., Huang, P.S.: Multimedia augmented reality information system for museum guidance. Pers. Ubiquit. Comput. 18(2), 315–322 (2014)CrossRef
Zurück zum Zitat Elimat, A.K., AbuSeileek, A.F.: Automatic speech recognition technology as an effective means for teaching pronunciation. Jalt Call J. 10(1), 21–47 (2014)CrossRef Elimat, A.K., AbuSeileek, A.F.: Automatic speech recognition technology as an effective means for teaching pronunciation. Jalt Call J. 10(1), 21–47 (2014)CrossRef
Zurück zum Zitat Fontan, L., Ferrané, I., Farinas, J., et al.: Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss. J. Speech Lang. Hear. Res. 60(9), 2394–2405 (2017)CrossRefPubMed Fontan, L., Ferrané, I., Farinas, J., et al.: Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss. J. Speech Lang. Hear. Res. 60(9), 2394–2405 (2017)CrossRefPubMed
Zurück zum Zitat Kazancioglu, H.O., Dahhan, A.S., Acar, A.H.: How could multimedia information about dental implant surgery effects patients’ anxiety level? Med. Oral Patol. Oral Cir. Bucal 22(1), e102–e107 (2017)PubMed Kazancioglu, H.O., Dahhan, A.S., Acar, A.H.: How could multimedia information about dental implant surgery effects patients’ anxiety level? Med. Oral Patol. Oral Cir. Bucal 22(1), e102–e107 (2017)PubMed
Zurück zum Zitat Li, X., Lin, L., Liu, X., et al.: STB based multimedia information publication system. J. Netw. 6(9), 1305–1312 (2011) Li, X., Lin, L., Liu, X., et al.: STB based multimedia information publication system. J. Netw. 6(9), 1305–1312 (2011)
Zurück zum Zitat Liu, H.C., Chuang, H.H.: An examination of cognitive processing of multimedia information based on viewers’ eye movements. Interact. Learn. Environ. 19(5), 503–517 (2011)CrossRef Liu, H.C., Chuang, H.H.: An examination of cognitive processing of multimedia information based on viewers’ eye movements. Interact. Learn. Environ. 19(5), 503–517 (2011)CrossRef
Zurück zum Zitat Mrva-Montoya, A.: Beyond the monograph: Publishing research for multimedia and multiplatform delivery. J. Sch. Publ. 46(4), 321–342 (2015)CrossRef Mrva-Montoya, A.: Beyond the monograph: Publishing research for multimedia and multiplatform delivery. J. Sch. Publ. 46(4), 321–342 (2015)CrossRef
Zurück zum Zitat Pereira, M.H., de Souza, C.L., Pádua, F.L., et al.: SAPTE: A multimedia information system to support the discourse analysis and information retrieval of television programs. Multimed. Tools Appl. 74(23), 10923–10963 (2015)CrossRef Pereira, M.H., de Souza, C.L., Pádua, F.L., et al.: SAPTE: A multimedia information system to support the discourse analysis and information retrieval of television programs. Multimed. Tools Appl. 74(23), 10923–10963 (2015)CrossRef
Zurück zum Zitat Qin, Z., Yu, J., Cong, Y., et al.: Topic correlation model for cross-modal multimedia information retrieval. Pattern Anal. Appl. 19(4), 1007–1022 (2016)MathSciNetCrossRef Qin, Z., Yu, J., Cong, Y., et al.: Topic correlation model for cross-modal multimedia information retrieval. Pattern Anal. Appl. 19(4), 1007–1022 (2016)MathSciNetCrossRef
Zurück zum Zitat Toda, S., Kobayashi, K., Saito, Y., et al.: Know-Live: a farm information Web disclosure system with subjective information. Agric. Inf. Res. 22(1), 12–23 (2013) Toda, S., Kobayashi, K., Saito, Y., et al.: Know-Live: a farm information Web disclosure system with subjective information. Agric. Inf. Res. 22(1), 12–23 (2013)
Zurück zum Zitat Vincent, E., Watanabe, S., Nugraha, A.A., et al.: An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Comput. Speech Lang. 46, 535–557 (2017)CrossRef Vincent, E., Watanabe, S., Nugraha, A.A., et al.: An analysis of environment, microphone and data simulation mismatches in robust speech recognition. Comput. Speech Lang. 46, 535–557 (2017)CrossRef
Zurück zum Zitat Vrugt, J.A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation. Environ Model Softw. 75, 273–316 (2016)CrossRef Vrugt, J.A.: Markov chain Monte Carlo simulation using the DREAM software package: theory, concepts, and MATLAB implementation. Environ Model Softw. 75, 273–316 (2016)CrossRef
Zurück zum Zitat Wu, T.J., Tai, Y.N.: Effects of multimedia information technology integrated multi-sensory instruction on students’ learning motivation and outcome. Eurasia J. Math. Sci. Technol. Educ. 12(4), 1065–1074 (2016)CrossRef Wu, T.J., Tai, Y.N.: Effects of multimedia information technology integrated multi-sensory instruction on students’ learning motivation and outcome. Eurasia J. Math. Sci. Technol. Educ. 12(4), 1065–1074 (2016)CrossRef
Metadaten
Titel
Design and research of multimedia information publishing system based on speech recognition technology
verfasst von
Zhuoran Li
Yafei Wang
Cong Wang
Publikationsdatum
01.03.2024
Verlag
Springer US
Erschienen in
Optical and Quantum Electronics / Ausgabe 3/2024
Print ISSN: 0306-8919
Elektronische ISSN: 1572-817X
DOI
https://doi.org/10.1007/s11082-023-05926-y

Weitere Artikel der Ausgabe 3/2024

Optical and Quantum Electronics 3/2024 Zur Ausgabe

Neuer Inhalt