Skip to main content
Erschienen in: Neural Computing and Applications 8/2020

25.02.2019 | Original Article

Remote detection of idling cars using infrared imaging and deep networks

verfasst von: Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau

Erschienen in: Neural Computing and Applications | Ausgabe 8/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Idling vehicles waste energy and pollute the environment through exhaust emission. In some countries, idling a vehicle for more than a predefined duration is prohibited and automatic idling vehicle detection is desirable for law enforcement. We propose the first automatic system to detect idling cars, using infrared (IR) imaging and deep networks. We rely on the differences in spatio-temporal heat signatures of idling and stopped cars and monitor the car temperature with a long-wavelength IR camera. We formulate the idling car detection problem as spatio-temporal event detection in IR image sequences and employ deep networks for spatio-temporal modeling. We collected the first IR image sequence dataset for idling car detection. First, we detect the cars in each IR image using a convolutional neural network, which is pre-trained on regular RGB images and fine-tuned on IR images for higher accuracy. Then, we track the detected cars over time to identify the cars that are parked. Finally, we use the 3D spatio-temporal IR image volume of each parked car as input to convolutional and recurrent networks to classify them as idling or not. We carried out an extensive empirical evaluation of temporal and spatio-temporal modeling approaches with various convolutional and recurrent architectures. We present promising experimental results on our IR image sequence dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Akhloufi M, Bendada A (2008) Thermal faceprint: a new thermal face signature extraction for infrared face recognition. In: Canadian conference on computer and robot vision, pp 269–272. IEEE Akhloufi M, Bendada A (2008) Thermal faceprint: a new thermal face signature extraction for infrared face recognition. In: Canadian conference on computer and robot vision, pp 269–272. IEEE
2.
3.
Zurück zum Zitat Azizpour H, Sharif Razavian A, Sullivan J, Maki A, Carlsson S (2015) From generic to specific deep representations for visual recognition. In: IEEE conference on computer vision and pattern recognition workshops, pp 36–45 Azizpour H, Sharif Razavian A, Sullivan J, Maki A, Carlsson S (2015) From generic to specific deep representations for visual recognition. In: IEEE conference on computer vision and pattern recognition workshops, pp 36–45
4.
Zurück zum Zitat Bastan M, Yap KH, Chau LP (2018) Idling car detection with ConvNets in infrared image sequences. In: International symposium on circuits and systems. IEEE Bastan M, Yap KH, Chau LP (2018) Idling car detection with ConvNets in infrared image sequences. In: International symposium on circuits and systems. IEEE
5.
Zurück zum Zitat Bebis G, Gyaourova A, Singh S, Pavlidis I (2006) Face recognition by fusing thermal infrared and visible imagery. Image Vis Comput 24(7):727–742CrossRef Bebis G, Gyaourova A, Singh S, Pavlidis I (2006) Face recognition by fusing thermal infrared and visible imagery. Image Vis Comput 24(7):727–742CrossRef
6.
Zurück zum Zitat Bertozzi M, Broggi A, Caraffi C, Del Rose M, Felisa M, Vezzoni G (2007) Pedestrian detection by means of far-infrared stereo vision. Comput Vis Image Underst 106(2):194–204CrossRef Bertozzi M, Broggi A, Caraffi C, Del Rose M, Felisa M, Vezzoni G (2007) Pedestrian detection by means of far-infrared stereo vision. Comput Vis Image Underst 106(2):194–204CrossRef
7.
Zurück zum Zitat Bodansky D (2016) The Paris climate change agreement: a new hope? Am J Int Law 110(2):288–319CrossRef Bodansky D (2016) The Paris climate change agreement: a new hope? Am J Int Law 110(2):288–319CrossRef
8.
Zurück zum Zitat Chen Y, Zhang X, Zhang Y, Maybank SJ, Fu Z (2018) Visible and infrared image registration based on region features and edginess. Mach Vis Appl 29(1):113–123CrossRef Chen Y, Zhang X, Zhang Y, Maybank SJ, Fu Z (2018) Visible and infrared image registration based on region features and edginess. Mach Vis Appl 29(1):113–123CrossRef
10.
Zurück zum Zitat Chung JS, Senior A, Vinyals O, Zisserman A (2017) Lip reading sentences in the wild. In: IEEE conference on computer vision and pattern recognition Chung JS, Senior A, Vinyals O, Zisserman A (2017) Lip reading sentences in the wild. In: IEEE conference on computer vision and pattern recognition
12.
Zurück zum Zitat Donahue J, Anne Hendricks L, Guadarrama S, Rohrbach M, Venugopalan S, Saenko K, Darrell T (2015) Long-term recurrent convolutional networks for visual recognition and description. In: IEEE conference on computer vision and pattern recognition, pp 2625–2634 Donahue J, Anne Hendricks L, Guadarrama S, Rohrbach M, Venugopalan S, Saenko K, Darrell T (2015) Long-term recurrent convolutional networks for visual recognition and description. In: IEEE conference on computer vision and pattern recognition, pp 2625–2634
13.
Zurück zum Zitat Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The Pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338CrossRef
14.
Zurück zum Zitat Fendri E, Boukhriss RR, Hammami M (2017) Fusion of thermal infrared and visible spectra for robust moving object detection. Pattern Anal Appl 20:907–926MathSciNetCrossRef Fendri E, Boukhriss RR, Hammami M (2017) Fusion of thermal infrared and visible spectra for robust moving object detection. Pattern Anal Appl 20:907–926MathSciNetCrossRef
15.
Zurück zum Zitat Filipe S, Alexandre LA (2014) Algorithms for invariant long-wave infrared face segmentation: evaluation and comparison. Pattern Anal Appl 17(4):823–837MathSciNetCrossRef Filipe S, Alexandre LA (2014) Algorithms for invariant long-wave infrared face segmentation: evaluation and comparison. Pattern Anal Appl 17(4):823–837MathSciNetCrossRef
16.
Zurück zum Zitat Gade R, Moeslund TB (2014) Thermal cameras and applications: a survey. Mach Vis Appl 25(1):245–262CrossRef Gade R, Moeslund TB (2014) Thermal cameras and applications: a survey. Mach Vis Appl 25(1):245–262CrossRef
17.
Zurück zum Zitat Gaines L, Rask E, Keller G (2012) Which is greener: idle, or stop and restart. Argonne National Laboratory, US Department of Energy Gaines L, Rask E, Keller G (2012) Which is greener: idle, or stop and restart. Argonne National Laboratory, US Department of Energy
18.
Zurück zum Zitat Gault T, Farag A (2013) A fully automatic method to extract the heart rate from thermal video. In: IEEE conference on computer vision and pattern recognition workshops, pp 336–341 Gault T, Farag A (2013) A fully automatic method to extract the heart rate from thermal video. In: IEEE conference on computer vision and pattern recognition workshops, pp 336–341
19.
Zurück zum Zitat Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: International conference on artificial intelligence and statistics, pp 249–256 Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: International conference on artificial intelligence and statistics, pp 249–256
21.
Zurück zum Zitat Hinz S, Stilla U (2006) Car detection in aerial thermal images by local and global evidence accumulation. Pattern Recognit Lett 27(4):308–315CrossRef Hinz S, Stilla U (2006) Car detection in aerial thermal images by local and global evidence accumulation. Pattern Recognit Lett 27(4):308–315CrossRef
22.
Zurück zum Zitat Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef
23.
Zurück zum Zitat Hou R, Chen C, Shah M (2017) Tube convolutional neural network (T-CNN) for action detection in videos. In: International conference on computer vision Hou R, Chen C, Shah M (2017) Tube convolutional neural network (T-CNN) for action detection in videos. In: International conference on computer vision
24.
Zurück zum Zitat Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S, Murphy K (2017) Speed/accuracy trade-offs for modern convolutional object detectors. In: IEEE conference on computer vision and pattern recognition, vol 4 Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S, Murphy K (2017) Speed/accuracy trade-offs for modern convolutional object detectors. In: IEEE conference on computer vision and pattern recognition, vol 4
25.
26.
Zurück zum Zitat Kim S (2014) Analysis of small infrared target features and learning-based false detection removal for infrared search and track. Pattern Anal Appl 17(4):883–900MathSciNetCrossRef Kim S (2014) Analysis of small infrared target features and learning-based false detection removal for infrared search and track. Pattern Anal Appl 17(4):883–900MathSciNetCrossRef
27.
Zurück zum Zitat Kingma D, Ba J (2014) Adam: a method for stochastic optimization. In: International conference on learning representations Kingma D, Ba J (2014) Adam: a method for stochastic optimization. In: International conference on learning representations
28.
Zurück zum Zitat Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: International conference on computer vision Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: International conference on computer vision
29.
Zurück zum Zitat Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755. Springer Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft COCO: common objects in context. In: European conference on computer vision, pp 740–755. Springer
30.
Zurück zum Zitat Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision, pp 21–37. Springer Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector. In: European conference on computer vision, pp 21–37. Springer
31.
Zurück zum Zitat Ma CY, Chen MH, Kira Z, AlRegib G (2017) TS-LSTM and temporal-inception: exploiting spatiotemporal dynamics for activity recognition. arXiv preprint arXiv:1703.10667 Ma CY, Chen MH, Kira Z, AlRegib G (2017) TS-LSTM and temporal-inception: exploiting spatiotemporal dynamics for activity recognition. arXiv preprint arXiv:​1703.​10667
32.
Zurück zum Zitat Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: Machine Learning in Python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: Machine Learning in Python. J Mach Learn Res 12(Oct):2825–2830MathSciNetMATH
33.
Zurück zum Zitat Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: IEEE conference on computer vision and pattern recognition Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: IEEE conference on computer vision and pattern recognition
34.
Zurück zum Zitat Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99 Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in neural information processing systems, pp 91–99
35.
Zurück zum Zitat Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) ImageNet large scale visual recognition challenge. Int J Comput Vis 115(3):211–252MathSciNetCrossRef
36.
Zurück zum Zitat Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: International conference on learning representations
37.
Zurück zum Zitat Sutskever I, Martens J, Dahl G, Hinton G (2013) On the importance of initialization and momentum in deep learning. In: International conference on machine learning, pp 1139–1147 Sutskever I, Martens J, Dahl G, Hinton G (2013) On the importance of initialization and momentum in deep learning. In: International conference on machine learning, pp 1139–1147
38.
Zurück zum Zitat Vollmer M, Möllmann KP (2017) Infrared thermal imaging: fundamentals, research and applications. Wiley, New YorkCrossRef Vollmer M, Möllmann KP (2017) Infrared thermal imaging: fundamentals, research and applications. Wiley, New YorkCrossRef
39.
Zurück zum Zitat Wu B, Iandola F, Jin PH, Keutzer K (2017) SqueezeDet: unified, small, low power fully convolutional neural networks for real-time object detection for autonomous driving. In: IEEE conference on computer vision and pattern recognition workshops Wu B, Iandola F, Jin PH, Keutzer K (2017) SqueezeDet: unified, small, low power fully convolutional neural networks for real-time object detection for autonomous driving. In: IEEE conference on computer vision and pattern recognition workshops
41.
Zurück zum Zitat Xu H, Das A, Saenko K (2017) R-C3D: region convolutional 3D network for temporal activity detection. In: International conference on computer vision Xu H, Das A, Saenko K (2017) R-C3D: region convolutional 3D network for temporal activity detection. In: International conference on computer vision
42.
Zurück zum Zitat Zhuang J, Liu Q (2016) Transferred IR pedestrian detector toward distinct scenarios adaptation. Neural Comput Appl 27(3):557–569CrossRef Zhuang J, Liu Q (2016) Transferred IR pedestrian detector toward distinct scenarios adaptation. Neural Comput Appl 27(3):557–569CrossRef
Metadaten
Titel
Remote detection of idling cars using infrared imaging and deep networks
verfasst von
Muhammet Bastan
Kim-Hui Yap
Lap-Pui Chau
Publikationsdatum
25.02.2019
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 8/2020
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-019-04077-0

Weitere Artikel der Ausgabe 8/2020

Neural Computing and Applications 8/2020 Zur Ausgabe