Skip to main content
Erschienen in: Neural Computing and Applications 8/2019

06.01.2018 | Original Article

Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources

verfasst von: Chiung-Yao Fang, An-Chun Luo, Yu-Shan Deng, Chia-Ju Lu, Sei-Wang Chen

Erschienen in: Neural Computing and Applications | Ausgabe 8/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nowadays, lecture-recording systems play a vital role in collecting spoken discourse for e-learning. However, in view of the growing development of e-learning, the lack of content is becoming a problem. This research presents a smart lecture-recording (SLR) system that can record orations at the same level of quality as a human team, but with a reduced degree of human involvement. The proposed SLR system is composed of two subsystems, referred to as virtual cameraman (VC), and virtual director (VD), respectively. All camera man components of VC subsystem are automatic and can take actions that include target and event detection, tracking, and view searching. The videos taken by these three components are forwarded to the VD subsystem, in which the representative shot is chosen for recording or direct broadcasting. We refer to this function of the VD subsystem as shot selection that is based on the content analysis. The capability of shot selection is pre-trained through a machine-learning process characterized by the counter-propagation neural (CPN) network. However, the CPN network yielded poor results when the input data were heterogeneous data. To increases the accuracy of shot selection, we applied multiple kernel learning (MKL) techniques into CPN network, called MK-CPN, to transform all the heterogeneous data from different content analysis methods into unified space. A series of experiments for real lecture has been conducted. The results showed that the proposed SLR system can provide oration records close to some extend to those taken by real human teams. We believe that the proposed system may not be limited to live speeches, if it can be configured with appropriate training materials.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Rowe LA, Harley D, Pletcher P, Lawrence S (2001) BIBS: a lecture webcasting system. Berkeley Multimedia Research Center Report, pp 1–23 Rowe LA, Harley D, Pletcher P, Lawrence S (2001) BIBS: a lecture webcasting system. Berkeley Multimedia Research Center Report, pp 1–23
2.
Zurück zum Zitat Rui Y, He L, Gupta A, Liu Q (2001) Building an intelligent camera management system. ACM Multimed 9:2–11 Rui Y, He L, Gupta A, Liu Q (2001) Building an intelligent camera management system. ACM Multimed 9:2–11
3.
Zurück zum Zitat Bianchi M (1998) AutoAuditorium: a fully automatic, multi-camera system to televise auditorium presentations. In: Joint DARPA/NIST smart spaces technology workshop Bianchi M (1998) AutoAuditorium: a fully automatic, multi-camera system to televise auditorium presentations. In: Joint DARPA/NIST smart spaces technology workshop
4.
Zurück zum Zitat Bianchi M (2004) Automatic video production of lectures using an intelligent and aware environment. In: The 3rd international conference on mobile and ubiquitous multimedia, pp 117–123 Bianchi M (2004) Automatic video production of lectures using an intelligent and aware environment. In: The 3rd international conference on mobile and ubiquitous multimedia, pp 117–123
5.
Zurück zum Zitat Abowd GD (1999) Classroom 2000: an experiment with the instrumentation of a living educational environment. IBM Syst J 38:508–530CrossRef Abowd GD (1999) Classroom 2000: an experiment with the instrumentation of a living educational environment. IBM Syst J 38:508–530CrossRef
6.
Zurück zum Zitat Cruz G, Hill R (1994) Capturing and playing multimedia events with STREAMS. In: ACM international conference on multimedia, pp 193–200 Cruz G, Hill R (1994) Capturing and playing multimedia events with STREAMS. In: ACM international conference on multimedia, pp 193–200
7.
Zurück zum Zitat Zhang C, Rui Y, Crawford J, He LW (2008) An automated end-to-end lecture capture and broadcasting system. ACM Trans Multimed Comput Commun Appl 4:2–11CrossRef Zhang C, Rui Y, Crawford J, He LW (2008) An automated end-to-end lecture capture and broadcasting system. ACM Trans Multimed Comput Commun Appl 4:2–11CrossRef
8.
Zurück zum Zitat Yong R, Anoop G, Jonathan G, He LW (2004) Automating lecture capture and broadcast: technology and videography. Multimed Syst 10:3–15CrossRef Yong R, Anoop G, Jonathan G, He LW (2004) Automating lecture capture and broadcast: technology and videography. Multimed Syst 10:3–15CrossRef
9.
Zurück zum Zitat Onishi M, Fukunaga K (2004) Shooting the lecture scene using computer-controlled cameras based on situation understanding and evaluation of video images. In: The 17th international conference on pattern recognition, pp 781–784 Onishi M, Fukunaga K (2004) Shooting the lecture scene using computer-controlled cameras based on situation understanding and evaluation of video images. In: The 17th international conference on pattern recognition, pp 781–784
10.
Zurück zum Zitat Lu CT, Chen SW (2011) Automatic lecture recording system. In: The 24th IPPR conference on computer vision, graphics, and image processing Lu CT, Chen SW (2011) Automatic lecture recording system. In: The 24th IPPR conference on computer vision, graphics, and image processing
11.
Zurück zum Zitat Cheng Y (1995) Mean shift, mode seeking, and clustering. IEEE Trans Pattern Anal Mach Intell 17(8):790–799CrossRef Cheng Y (1995) Mean shift, mode seeking, and clustering. IEEE Trans Pattern Anal Mach Intell 17(8):790–799CrossRef
12.
Zurück zum Zitat Gleicher M, Masanz J (2000) Towards virtual videography. In: ACM Multimedia, pp 375–378 Gleicher M, Masanz J (2000) Towards virtual videography. In: ACM Multimedia, pp 375–378
13.
Zurück zum Zitat Okuni S, Tsuruoka S, Rayat GP, Kawanaka H, Shinogi T (2007) Video scene segmentation using the state recognition of blackboard for blended learning. In: International conference on convergence information technology, pp 2437–2442 Okuni S, Tsuruoka S, Rayat GP, Kawanaka H, Shinogi T (2007) Video scene segmentation using the state recognition of blackboard for blended learning. In: International conference on convergence information technology, pp 2437–2442
14.
Zurück zum Zitat Kumano M, Ariki Y, Amano M, Uehara K (2002) Video editing support system based on video grammar and content analysis. In: International conference on pattern recognition, pp 1031–1036 Kumano M, Ariki Y, Amano M, Uehara K (2002) Video editing support system based on video grammar and content analysis. In: International conference on pattern recognition, pp 1031–1036
15.
Zurück zum Zitat Wang T, Mansfield A, Hu R, Collomosse J (2009) An evolutionary approach to automatic video editing. In: International conference on visual media production (CVMP), pp 127–134 Wang T, Mansfield A, Hu R, Collomosse J (2009) An evolutionary approach to automatic video editing. In: International conference on visual media production (CVMP), pp 127–134
16.
Zurück zum Zitat Machnicki E, Rowe LA (2002) Virtual director: automating a webcast. In: SPIE international conference on multimedia computer network. San Jose, California, pp 208–225 Machnicki E, Rowe LA (2002) Virtual director: automating a webcast. In: SPIE international conference on multimedia computer network. San Jose, California, pp 208–225
17.
Zurück zum Zitat Liu Q, Rui Y, Gupta A, Cadiz JJ (2001) Automating camera management for lecture room environments. In: The SIGCHI conference on human factors in computing systems, pp 442–449 Liu Q, Rui Y, Gupta A, Cadiz JJ (2001) Automating camera management for lecture room environments. In: The SIGCHI conference on human factors in computing systems, pp 442–449
18.
Zurück zum Zitat Ugalde HMR, Carmona JC, Reyes-Reyes J, Alvarado VM, Corbier C (2015) Balanced simplicity–accuracy neural network model families for system identification. Neural Comput Appl 26(1):171–186CrossRef Ugalde HMR, Carmona JC, Reyes-Reyes J, Alvarado VM, Corbier C (2015) Balanced simplicity–accuracy neural network model families for system identification. Neural Comput Appl 26(1):171–186CrossRef
19.
Zurück zum Zitat Xu Z, Song Q, Wang D (2014) A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks. Neural Comput Appl 24(7):1851–1866CrossRef Xu Z, Song Q, Wang D (2014) A robust recurrent simultaneous perturbation stochastic approximation training algorithm for recurrent neural networks. Neural Comput Appl 24(7):1851–1866CrossRef
20.
Zurück zum Zitat LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551CrossRef
21.
Zurück zum Zitat Zhang H, Cao X, Ho J, Chow T (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inform 13(2):520–531CrossRef Zhang H, Cao X, Ho J, Chow T (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inform 13(2):520–531CrossRef
22.
Zurück zum Zitat Hecht-Nielsen R (1987) Counter-propagation networks. Appl Opt 26(23):4979–4983CrossRef Hecht-Nielsen R (1987) Counter-propagation networks. Appl Opt 26(23):4979–4983CrossRef
23.
Zurück zum Zitat G¨onen M, Alpaydın E (2011) Multiple kernel learning algorithms. J Mach Learn Res 12:2211–2268MathSciNetMATH G¨onen M, Alpaydın E (2011) Multiple kernel learning algorithms. J Mach Learn Res 12:2211–2268MathSciNetMATH
24.
Zurück zum Zitat Lin YY, Liu TL, Fuh CS (2011) Multiple kernel learning for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell 33(6):1147–1160CrossRef Lin YY, Liu TL, Fuh CS (2011) Multiple kernel learning for dimensionality reduction. IEEE Trans Pattern Anal Mach Intell 33(6):1147–1160CrossRef
25.
Zurück zum Zitat Cheng KH, Hsieh CH, Wang CC (2011) Human action recognition using 3D body joints. In: The 24th IPPR conference on computer vision, graphics, and image processing Cheng KH, Hsieh CH, Wang CC (2011) Human action recognition using 3D body joints. In: The 24th IPPR conference on computer vision, graphics, and image processing
26.
Zurück zum Zitat Lin SY, You ZH, Hung YP (2011) A real-time action recognition approach with 3D tracked body joints and its application. In: The 24th IPPR conference on computer vision, graphics, and image processing Lin SY, You ZH, Hung YP (2011) A real-time action recognition approach with 3D tracked body joints and its application. In: The 24th IPPR conference on computer vision, graphics, and image processing
27.
Zurück zum Zitat Johann P, Hamböker R (1994) Parametric statistical theory. Walter de Gruyter, Berlin, pp 207–208. ISBN 3-11-013863-8 Johann P, Hamböker R (1994) Parametric statistical theory. Walter de Gruyter, Berlin, pp 207–208. ISBN 3-11-013863-8
28.
Zurück zum Zitat Rosten E, Drummond T (2005) Fusing points and lines for high performance tracking. In: IEEE international conference on computer vision (ICCV’05), vol 2, pp 1508–1511 Rosten E, Drummond T (2005) Fusing points and lines for high performance tracking. In: IEEE international conference on computer vision (ICCV’05), vol 2, pp 1508–1511
29.
Zurück zum Zitat Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: Imaging understanding workshop, pp 121–130 Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: Imaging understanding workshop, pp 121–130
30.
Zurück zum Zitat Liu T, Yuan Z, Sun J, Wang J, Zheng N, Tang X, Shum HY (2011) Learning to detect a salient object. IEEE Trans Pattern Anal Mach Intell 33(2):353–367CrossRef Liu T, Yuan Z, Sun J, Wang J, Zheng N, Tang X, Shum HY (2011) Learning to detect a salient object. IEEE Trans Pattern Anal Mach Intell 33(2):353–367CrossRef
31.
Zurück zum Zitat Fang CJ, Chen SW, Fu CS (2003) Automatic change detection of driving environments in a vision-based driver assistance system. IEEE Trans Neural Netw 14(3):646–657CrossRef Fang CJ, Chen SW, Fu CS (2003) Automatic change detection of driving environments in a vision-based driver assistance system. IEEE Trans Neural Netw 14(3):646–657CrossRef
32.
Zurück zum Zitat Abdollahian G, Taskiran CM, Pizlo Z, Delp EJ (2010) Camera motion-based analysis of user generated video. IEEE Trans Multimed 12(1):28–41CrossRef Abdollahian G, Taskiran CM, Pizlo Z, Delp EJ (2010) Camera motion-based analysis of user generated video. IEEE Trans Multimed 12(1):28–41CrossRef
Metadaten
Titel
Building a smart lecture-recording system using MK-CPN network for heterogeneous data sources
verfasst von
Chiung-Yao Fang
An-Chun Luo
Yu-Shan Deng
Chia-Ju Lu
Sei-Wang Chen
Publikationsdatum
06.01.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 8/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-017-3328-6

Weitere Artikel der Ausgabe 8/2019

Neural Computing and Applications 8/2019 Zur Ausgabe