nach oben

Machine Vision and Applications

Erschienen in:

01.10.2014 | Special Issue Paper

Context-based person identification framework for smart video surveillance

verfasst von: Liyan Zhang, Dmitri V. Kalashnikov, Sharad Mehrotra, Ronen Vaisenberg

Erschienen in: Machine Vision and Applications | Ausgabe 7/2014

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Smart video surveillance (SVS) applications enhance situational awareness by allowing domain analysts to focus on the events of higher priority. SVS approaches operate by trying to extract and interpret higher “semantic” level events that occur in video. One of the key challenges of SVS is that of person identification where the task is for each subject that occurs in a video shot to identify the person it corresponds to. The problem of person identification is especially challenging in resource-constrained environments where transmission delay, bandwidth restriction, and packet loss may prevent the capture of high-quality data. Conventional person identification approaches which primarily are based on analyzing facial features are often not sufficient to deal with poor-quality data. To address this challenge, we propose a framework that leverages heterogeneous contextual information together with facial features to handle the problem of person identification for low-quality data. We first investigate the appropriate methods to utilize heterogeneous context features including clothing, activity, human attributes, gait, people co-occurrence, and so on. We then propose a unified approach for person identification that builds on top of our generic entity resolution framework called RelDC, which can integrate all these context features to improve the quality of person identification. This work thus links one well-known problem of person identification from the computer vision research area (that deals with video/images) with another well-recognized challenge known as entity resolution from the database and AI/ML areas (that deals with textual data). We apply the proposed solution to a real-world dataset consisting of several weeks of surveillance videos. The results demonstrate the effectiveness and efficiency of our approach even on low-quality video data.

Vorheriger Artikel Pedestrian detection based on sparse coding and transfer learning

Nächster Artikel A refined particle filter based on determined level set model for robust contour tracking

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

\(704 \times 480\) resolution per frame.

We use the standard definition of relational datasets as used in the database literature.

Project sherlock @ uci. http://sherlock.ics.uci.edu

Ahonen, T., Hadid, A., Pietik, M.: Face description with local binary patterns: Application to face recognition. IEEE Trans. Pattern Anal. (2006)

An, L., Kafai, M., Bhanu, B.: Dynamic bayesian network for unconstrained face recognition in surveillance camera networks. IEEE J. Emerg. Sel. Top. Circuits Syst. 3(2), 155–164 (2013)CrossRef

Balcan, M., Blum, A., Choi, P.P., Lafferty, J., Pantano, B., Rwebangira, M.R., Zhu, X.: Person identification in webcam images: An application of semi-supervised learning. In: ICML Workshop on Learning from Partially Classified Training Data (2005)

Chaudhuri, S., Ganjam, K., Ganti, V., Kapoor, R., Narasayya, V., Vassilakis, T.: Data cleaning in Microsoft SQL Server 2005. In: ACM SIGMOD Conference (2005)

Chen, S., Kalashnikov, D.V., Mehrotra, S.: Adaptive graphical approach to entity resolution. In: Proceedings of ACM IEEE Joint Conference on Digital Libraries (JCDL 2007), Vancouver, British Columbia, Canada, June 17–23 (2007)

Chen, Z., Kalashnikov, D.V., Mehrotra, S.: Exploiting relationships for object consolidation. In: Proceedings of International ACM SIGMOD Workshop on Information Quality in Information Systems (ACM IQIS 2005), Baltimore, MD, USA, June 17 (2005)

Chen, Z.S., Kalashnikov, D.V., Mehrotra, S.: Exploiting context analysis for combining multiple entity resolution systems. In: Proceedings of ACM SIGMOD International Conference on Management of Data (ACM SIGMOD 2009), Providence, RI, USA, June 29–July 2 (2009)

Cristani, M., Bicego, M., Murino, V.: Audio-visual event recognition in surveillance video sequences. IEEE Trans. Multimed. (2007)

10.

Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)

11.

Gallagher, A., Chen, T.: Clothing cosegmentation for recognizing people. In: IEEE CVPR (2008)

12.

Gao, Y., Tang, J., Hong, R., Yan, S., Dai, Q., Zhang, N., Chua, T.: Camera constraint-free view-based 3D object retrieval. IEEE Trans. Image Process. 21(4), 2269–2281 (2012)MathSciNetCrossRef

13.

Gao, Y., Wang, M., Tao, D., Ji, R., Dai, Q.: 3D object retrieval and recognition with hypergraph analysis. IEEE Trans. Image Process. 21(9), 4290–4303 (2012)MathSciNetCrossRef

14.

Gao, Y., Wang, M., Zha, Z., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–376 (2013)MathSciNetCrossRef

15.

Grossman, R.L., Kamath, C., Kegelmeyer, P., Kumar, V., Namburu, R.R.: Data Mining for Scientific and Engineering Applications. Kluwer, Dordrecht (2001)CrossRefMATH

16.

Han, J., Altman, R.B., Kumar, V., Mannila, H., Pregibon, D.: Emerging scientific applications in data mining. Commun. ACM 45(8), 54–58 (2002)CrossRef

17.

Hong, R., Tang, J., Tan, H.-K., Ngo, C.-W., Yan, S., Chua, T.-S.: Beyond search: event-driven summarization for web videos. TOMCCAP 7(4), 35 (2011)CrossRef

18.

Hong, R., Wang, M., Li, G., Nie, L., Zha, Z.-J., Chua, T.-S.: Multimedia question answering. IEEE MultiMed. 19(4), 72–78 (2012)CrossRef

19.

Kalashnikov, D.V.: Super-EGO: fast multi-dimensional similarity join. Int. J. Very Large Data Bases 4(2), 561–585 (2013)MathSciNetCrossRef

20.

Kalashnikov, D.V., Chen, Z., Mehrotra, S., Nuray, R.: Web people search via connection analysis. IEEE Trans. Knowl. Data Eng. 20(11) (2008)

21.

Kalashnikov, D.V., Mehrotra, S.: Domain-independent data cleaning via analysis of entity-relationship graph. ACM Trans Database Syst. 31(2), 716–767 (2006)CrossRef

22.

Kalashnikov, D.V., Mehrotra, S., Chen, S., Nuray, R., Ashish, N.: Disambiguation algorithm for people search on the web. In: Proceedings of the IEEE 23rd International Conference on Data Engineering (IEEE ICDE 2007), Istanbul, Turkey, April 16–20 2007 (short publication)

23.

Kalashnikov, D.V., Mehrotra, S., Chen, Z.: Exploiting relationships for domain-independent data cleaning. In: SIAM International Conference on Data Mining (SDM 2005), Newport Beach, CA, USA, April 21–23 (2005)

24.

Kantardzic, M., Zurada, J.: Next Generation of Data-Mining Applications. Wiley, London (2005)MATH

25.

Kumar, N., Berg, A.C., Belhumeur, P.N., Nayar, S.K.: Describable visual attributes for face verification and image search. In: IEEE TPAMI (2011)

26.

McCallum, A., Wellner, B.: Object consolidation by graph partitioning with a conditionally-trained distance metric. In: KDD Workshop on Data Cleaning, Record Linkage and Object Consolidation (2003)

27.

Nuray-Turan, R., Kalashnikov, D.V., Mehrotra, S.: Self-tuning in graph-based reference disambiguation. In: Proceedings of the 12th International Conference on Database Systems for Advanced Applications (DASFAA 2007), Springer LNCS, Bangkok, Thailand, April 9–12 (2007)

28.

Nuray-Turan, R., Kalashnikov, D.V., Mehrotra, S.: Exploiting web querying for web people search. ACM Trans. Database Syst. 37(1) (2012)

29.

Nuray-Turan, R., Kalashnikov, D.V., Mehrotra, S.: Adaptive connection strength models for relationship-based entity resolution. ACM J. Data Inf. Qual. 4(2) (2013)

30.

Nuray-Turan, R., Kalashnikov, D.V., Mehrotra, S., Yu, Y.: Attribute and object selection queries on objects with probabilistic attributes. ACM Trans. Database Syst. 37(1) (2012)

31.

Tang, J., Hong, R., Yan, S., Chua, T.-S., Qi, G.-J., Jain, R.: Image annotation by knn-sparse graph-based label propagation over noisily tagged web images. ACM TIST 2(2), 14 (2011)

32.

Tang, J., Yan, S., Hong, R., Qi, G.-J., Chua, T.-S.: Inferring semantic concepts from community-contributed images and noisy tags. ACM Multimed., pp. 223–232 (2009)

33.

Tian, Y., Brown, L.M.G., Hampapur, A., Senior, M.L., Shu, C.: Ibm smart surveillance system (s3): event based video surveillance system with an open and extensible framework. Mach. Vis. Appl. (2008)

34.

Vaisenberg, R., Mehrotra, S., Ramanan, D.: Semartcam scheduler: emantics driven eal- ime data collection from indoor amera networks to maximize event detection. J. Real Time Image Process. 5(4) (2010)

35.

Wang, M., Hua, X.-S., Hong, R., Tang, J., Qi, G.-J., Song, Y.: Unified video annotation via multigraph learning. IEEE Trans. Circuits Syst. Video Technol. 19(5), 733–746 (2009)CrossRef

36.

Wang, M., Li, H., Tao, D., Lu, K., Wu, X.: Multimodal graph-based reranking for web image search. IEEE Trans. Image Process. 21(11), 4649–4661 (2012)MathSciNetCrossRef

37.

Yang, M., Yu, K.: Real-time clothing recognition in surveillance videos. In: ICIP (2011)

38.

Zhang, L., Kalashnikov, D.V., Mehrotra, S.: A unified framework for context assisted face clustering. In: ACM International Conference on Multimedia Retrieval (ACM ICMR 2013), Dallas, Texas, USA, April 16–19 (2013)

39.

Zhang, L., Vaisenberg, R., Mehrotra, S., Kalashnikov, D.V.: Video entity resolution: applying er techniques for smart video surveillance. In: IQ2S Workshop in Conjunction with IEEE PERCOM 2011 (2011)

40.

Zhong, H., Shi, J., Visontai, M.: Detecting unusual activity in video. In: CVPR (2004)

Titel: Context-based person identification framework for smart video surveillance
verfasst von: Liyan Zhang
Dmitri V. Kalashnikov
Sharad Mehrotra
Ronen Vaisenberg
Publikationsdatum: 01.10.2014
Verlag: Springer Berlin Heidelberg
Erschienen in: Machine Vision and Applications / Ausgabe 7/2014
Print ISSN: 0932-8092
Elektronische ISSN: 1432-1769
DOI: https://doi.org/10.1007/s00138-013-0535-8

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 7/2014

Semi-supervised Unified Latent Factor learning with multi-view data

Free-viewpoint video relighting from multi-view sequence under general illumination

Realistic human action recognition by Fast HOG3D and self-organization feature map

Multi-scale patch-based sparse appearance model for robust object tracking

Special issue on contextual vision computing

Image visual attention computation and application via the learning of object attributes

Premium Partner