Skip to main content

2020 | OriginalPaper | Buchkapitel

Face Detection in MWIR Spectrum

verfasst von : Suha Reddy Mokalla, Thirimachos Bourlai

Erschienen in: Securing Social Identity in Mobile Platforms

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The capability to perform face recognition in the visible and thermal spectra is of prime interest in many law enforcement and military organizations. Face detection is an important pre-processing step for face recognition. Though many algorithms are available for face detection in the visible spectrum, an assessment of how these algorithms can be retrained for the thermal spectrum is an important study. Current available visible-based face detection algorithms are very effective in daytime conditions, however, when there are extreme changes in illumination conditions like very low-light to no light (night-time), these become challenging. Due to limited amount of data available for researchers from sensors in the thermal band (due to the increased cost of having and operating state of the art thermal sensors), there are only a few proposed algorithms. In this work, we conducted a study to determine the impact of factors such as indoor/outdoor environment, distance from the camera, application of sunscreen, training set size, etc. on training deep-learning models for a face detection system in the thermal spectrum that simultaneously performs face detection and frontal/non-frontal classification. Existing deep learning models such as SSD (Single Shot Multi-box Detector), R-FCN (Region Based Fully Convolutional Network) and R-CNN (Region Based Convolutional Neural Network), are re-trained using thermal images for face detection and pose estimation tasks. Results from each model are compared, and the model with the best performance is further trained and tested on different datasets, including indoor, outdoor at different stand-off distances. The highest accuracy is achieved using a Faster R-CNN model with ResNet-101 and the accuracy is 99.4% after a 10-fold cross-validation. More experiments are performed to further study the efficiency and limitations of this model. The data set we use was collected under constrained indoor and unconstrained outdoor conditions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
Release of MWIR Face Dataset: This is currently a private database with availability determined on case-by-case basis. If interested in working with this database, please contact the corresponding author.
 
Literatur
1.
Zurück zum Zitat Biswas SK, Milanfar P (2017) Linear support tensor machine with LSK channels: pedestrian detection in thermal infrared images. IEEE Trans Image Process 26(9):4229–4242MathSciNetCrossRef Biswas SK, Milanfar P (2017) Linear support tensor machine with LSK channels: pedestrian detection in thermal infrared images. IEEE Trans Image Process 26(9):4229–4242MathSciNetCrossRef
2.
Zurück zum Zitat Herrmann C, Ruf M, Beyerer J (2018, April) CNN-based thermal infrared person detection by domain adaptation. In Proceedings of the Autonomous Systems: Sensors, Vehicles, Security and the Internet of Everything, Orlando, FL, USA, pp. 15–19 Herrmann C, Ruf M, Beyerer J (2018, April) CNN-based thermal infrared person detection by domain adaptation. In Proceedings of the Autonomous Systems: Sensors, Vehicles, Security and the Internet of Everything, Orlando, FL, USA, pp. 15–19
3.
Zurück zum Zitat Cross SE, Innes B, Roberts MS, Tsuzuki T, Robertson TA, McCormick P (2007) Human skin penetration of sunscreen nanoparticles: in-vitro assessment of a novel micronized zinc oxide formulation. Skin Pharmacol Physiol 148–154 Cross SE, Innes B, Roberts MS, Tsuzuki T, Robertson TA, McCormick P (2007) Human skin penetration of sunscreen nanoparticles: in-vitro assessment of a novel micronized zinc oxide formulation. Skin Pharmacol Physiol 148–154
4.
Zurück zum Zitat Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. Adv Neural Inf Proces, vol. abs/1605.06409, 379–387 Dai J, Li Y, He K, Sun J (2016) R-FCN: object detection via region-based fully convolutional networks. Adv Neural Inf Proces, vol. abs/1605.06409, 379–387
5.
Zurück zum Zitat Dowdall J, Pavlidis I, Bebis G (2003) Face detection in the Near-IR spectrum. Image Vis Comput 565–578 Dowdall J, Pavlidis I, Bebis G (2003) Face detection in the Near-IR spectrum. Image Vis Comput 565–578
6.
Zurück zum Zitat Eveland CK, Socolinsky DA, Wolff LB (2003) Tracking human faces in infrared video. Image Vis Comput 579–590 Eveland CK, Socolinsky DA, Wolff LB (2003) Tracking human faces in infrared video. Image Vis Comput 579–590
7.
Zurück zum Zitat Farfade SS, Saberian MJ, Li LJ (2015) Multi-view face detection using deep convolutional neural networks. In: Proceedings of the 5th ACM on international conference on multimedia retrieval, pp 643–650 Farfade SS, Saberian MJ, Li LJ (2015) Multi-view face detection using deep convolutional neural networks. In: Proceedings of the 5th ACM on international conference on multimedia retrieval, pp 643–650
8.
Zurück zum Zitat Girshick RB (2015) Fast R-CNN. CoRR, pp 91–99 Girshick RB (2015) Fast R-CNN. CoRR, pp 91–99
9.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. CoRR, pp 770–778 He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. CoRR, pp 770–778
10.
Zurück zum Zitat Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertainty Fuzziness Knowledge Based Syst, 6(2):107–116MathSciNetCrossRef Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertainty Fuzziness Knowledge Based Syst, 6(2):107–116MathSciNetCrossRef
11.
Zurück zum Zitat Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreett M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. CoRR Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreett M, Adam H (2017) MobileNets: efficient convolutional neural networks for mobile vision applications. CoRR
12.
Zurück zum Zitat Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S, Murphy K (2016) Speed/accuracy trade-offs for modern convolutional object detectors. CoRR, pp 7310–7311 Huang J, Rathod V, Sun C, Zhu M, Korattikara A, Fathi A, Fischer I, Wojna Z, Song Y, Guadarrama S, Murphy K (2016) Speed/accuracy trade-offs for modern convolutional object detectors. CoRR, pp 7310–7311
13.
Zurück zum Zitat Jiang H, Learned-Miller E (2017) Face detection with the faster R-CNN. In: 2017 12th IEEE international conference on automatic face gesture recognition (FG 2017), pp 650–657 Jiang H, Learned-Miller E (2017) Face detection with the faster R-CNN. In: 2017 12th IEEE international conference on automatic face gesture recognition (FG 2017), pp 650–657
14.
Zurück zum Zitat Komatsu S, Markman A, Mahalanobis A, Chen K, Javidi B (2017) Three-dimensional integral imaging and object detection using long-wave infrared imaging. Appl Opt D120–D126 Komatsu S, Markman A, Mahalanobis A, Chen K, Javidi B (2017) Three-dimensional integral imaging and object detection using long-wave infrared imaging. Appl Opt D120–D126
15.
Zurück zum Zitat Kwaśniewska A, Rumiński J, Rad P (2017) Deep features class activation map for thermal face detection and tracking. In: 2017 10th international conference on human system interactions (HSI). IEEE, pp 41–47 Kwaśniewska A, Rumiński J, Rad P (2017) Deep features class activation map for thermal face detection and tracking. In: 2017 10th international conference on human system interactions (HSI). IEEE, pp 41–47
16.
Zurück zum Zitat Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 5325–5334 Li H, Lin Z, Shen X, Brandt J, Hua G (2015) A convolutional neural network cascade for face detection. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 5325–5334
17.
Zurück zum Zitat Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector, pp 21–37 Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: single shot multibox detector, pp 21–37
18.
Zurück zum Zitat Ma C, Trung N, Uchiyama H, Nagahara H, Shimada A, Taniguchi R (2017) Adapting local features for face detection in thermal image. Sensors 2741 Ma C, Trung N, Uchiyama H, Nagahara H, Shimada A, Taniguchi R (2017) Adapting local features for face detection in thermal image. Sensors 2741
19.
Zurück zum Zitat Ma C, Trung NT, Uchiyama H, Nagahara H, Shimada A, Taniguchi RI (2017) Mixed features for face detection in thermal image. Proc SPIE Int Soc Opt Eng 103380E Ma C, Trung NT, Uchiyama H, Nagahara H, Shimada A, Taniguchi RI (2017) Mixed features for face detection in thermal image. Proc SPIE Int Soc Opt Eng 103380E
20.
Zurück zum Zitat Murata T, Matsuno S, Mito K, Itakura N, Mizuno T (2017) Investigation of facial region extraction algorithm focusing on temperature distribution characteristics of facial thermal images. In: HCI international 2017 – posters’ extended abstracts, pp 347–352 Murata T, Matsuno S, Mito K, Itakura N, Mizuno T (2017) Investigation of facial region extraction algorithm focusing on temperature distribution characteristics of facial thermal images. In: HCI international 2017 – posters’ extended abstracts, pp 347–352
21.
Zurück zum Zitat Ranjan R, Patel VM, Chellappa R (2017) HyperFace: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell, Vol. abs/1603.01249, 121–135 Ranjan R, Patel VM, Chellappa R (2017) HyperFace: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell, Vol. abs/1603.01249, 121–135
22.
Zurück zum Zitat Reese K, Zheng Y, Elmaghraby AS (2012) A comparison of face detection algorithms in visible and thermal spectrums. In: Object recognition supported by user interaction for service robots Reese K, Zheng Y, Elmaghraby AS (2012) A comparison of face detection algorithms in visible and thermal spectrums. In: Object recognition supported by user interaction for service robots
23.
Zurück zum Zitat Ren S, He K, Girshick RB, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. CoRR, pp 91–99 Ren S, He K, Girshick RB, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. CoRR, pp 91–99
24.
Zurück zum Zitat Sun X, Wu P, Hoi SCH (2017) Face detection using deep learning: an improved faster R-CNN approach. Neurocomputing 42–50 Sun X, Wu P, Hoi SCH (2017) Face detection using deep learning: an improved faster R-CNN approach. Neurocomputing 42–50
25.
Zurück zum Zitat Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2014) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9 Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2014) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–9
26.
Zurück zum Zitat Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2015) Rethinking the inception architecture for computer vision. CoRR, pp 2818–2826 Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2015) Rethinking the inception architecture for computer vision. CoRR, pp 2818–2826
27.
Zurück zum Zitat Yang S, Luo P, Loy CC, Tang X (2015) From facial parts responses to face detection: a deep learning approach. In: 2015 IEEE international conference on computer vision (ICCV), pp 3676–3684 Yang S, Luo P, Loy CC, Tang X (2015) From facial parts responses to face detection: a deep learning approach. In: 2015 IEEE international conference on computer vision (ICCV), pp 3676–3684
28.
Zurück zum Zitat Yang S, Luo P, Loy CC, Tang X (2018) Faceness-Net: face detection through deep facial part responses. IEEE Trans Pattern Anal Mach Intell, Vol. abs/1701.08393, 1845–1859 Yang S, Luo P, Loy CC, Tang X (2018) Faceness-Net: face detection through deep facial part responses. IEEE Trans Pattern Anal Mach Intell, Vol. abs/1701.08393, 1845–1859
29.
Zurück zum Zitat Yang S, Xiong Y, Loy CC, Tang X (2017) Face detection through scale-friendly deep convolutional networks. CoRR Yang S, Xiong Y, Loy CC, Tang X (2017) Face detection through scale-friendly deep convolutional networks. CoRR
30.
Zurück zum Zitat Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett, vol. abs/1604.02878, 1499–1503 Zhang K, Zhang Z, Li Z, Qiao Y (2016) Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Signal Process Lett, vol. abs/1604.02878, 1499–1503
31.
Zurück zum Zitat Zheng Y (2012) Face detection and eyeglasses detection for thermal face recognition. 83000C Zheng Y (2012) Face detection and eyeglasses detection for thermal face recognition. 83000C
32.
Zurück zum Zitat Zhu C, Zheng Y, Luu K, Savvides M (2017) CMS-RCNN: contextual multi-scale region-based CNN for unconstrained face detection, pp 57–79 Zhu C, Zheng Y, Luu K, Savvides M (2017) CMS-RCNN: contextual multi-scale region-based CNN for unconstrained face detection, pp 57–79
Metadaten
Titel
Face Detection in MWIR Spectrum
verfasst von
Suha Reddy Mokalla
Thirimachos Bourlai
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-39489-9_8