skip to main content
10.1145/1631272.1631463acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Brain state decoding for rapid image retrieval

Published:19 October 2009Publication History

ABSTRACT

Human visual perception is able to recognize a wide range of targets under challenging conditions, but has limited throughput. Machine vision and automatic content analytics can process images at a high speed, but suffers from inadequate recognition accuracy for general target classes. In this paper, we propose a new paradigm to explore and combine the strengths of both systems. A single trial EEG-based brain machine interface (BCI) subsystem is used to detect objects of interest of arbitrary classes from an initial subset of images. The EEG detection outcomes are used as input to a graph-based pattern mining subsystem to identify, refine, and propagate the labels to retrieve relevant images from a much larger pool. The combined strategy is unique in its generality, robustness, and high throughput. It has great potential for advancing the state of the art in media retrieval applications. We have evaluated and demonstrated significant performance gains of the proposed system with multiple and diverse image classes over several data sets, including those from Internet (Caltech 101) and remote sensing images. In this paper, we will also present insights learned from the experiments and discuss future research directions.

References

  1. N. Bigdely-Shamlo, A. Vankov, R. Ramirez, and S. Makeig. Brain Activity-Based Image Classification From Rapid Serial Visual Presentation. IEEE Trans. on NSRE, 16(5):432--441, 2008.Google ScholarGoogle Scholar
  2. N. Bigdely-Shamlo, A. Vankov, R. R. Ramirez, and S. Makeig. Brain activity-based image classification from rapid serial visual presentation. IEEE Trans. on NSRE, 16(5):432--441, Oct. 2008.Google ScholarGoogle Scholar
  3. J. Donoghue. Bridging the brain to the world: a perspective on neural interface systems. Neuron, 60(3):511--521, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  4. M. Dyrholm, C. Christoforou, and L. Parra. Bilinear discriminant component analysis. JMLR, 8:1097--1111, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2008 (VOC2008) Results. http://www.pascalnetwork.org/challenges/VOC/voc2008/workshop/index.html.Google ScholarGoogle Scholar
  6. L. Fei-Fei, R. Fergus, and P. Perona. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. CVIU, 106(1):59--70, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Fergus and P. Perona. A Visual Category Filter for Google Images. In Proc. ECCV, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  8. A. Gerson, L. Parra, and P. Sajda. Cortically coupled computer vision for rapid image search. IEEE Trans. on NSRE, 14(2):174--179, 2006.Google ScholarGoogle Scholar
  9. M. Gladwell. Blink: The power of thinking without thinking. Little, brown and company: Time warner book group, New York, 2005.Google ScholarGoogle Scholar
  10. M. Hein and M. Maier. Manifold denoising. Proc. NIPS, 19, 2006.Google ScholarGoogle Scholar
  11. T. Huang, C. Dagli, S. Rajaram, E. Chang, M. Mandel, G. Poliner, and D. Ellis. Active Learning for Interactive Multimedia Retrieval. Proc. of the IEEE, 96(4):648--667, 2008.Google ScholarGoogle Scholar
  12. T. Jebara, J. Wang, and S.-F. Chang. Graph construction and b-matching for semi-supervised learning. In Proc. ICML, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Y. Jiang, C. Ngo, and J. Yang. Towards optimal bag-of-features for object categorization and semantic video retrieval. In Proc. of CIVR, pages 494--501, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Jing and S. Baluja. VisualRank: Applying PageRank to Large-Scale Image Search. IEEE Trans. on PAMI, 12, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Kapoor, P. Shenoy, and D. Tan. Combining brain computer interfaces with vision for object categorization. In Proc. CVPR, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  16. K. Kay, T. Naselaris, R. Prenger, and J. Gallant. Identifying natural images from human brain activity. Nature, 452(7185):352--355, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  17. C. Keysers, D. Xiao, P. Foldiak, and D. Perrett. The speed of sight. Journal of Cognitive Neuroscience, 13(1):90--101, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. Langford, L. Li, and T. Zhang. Sparse Online Learning via Truncated Gradient. JMLR, 10:777--801, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. D. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. C. Micchelli and M. Pontil. Learning the kernel function via regularization. JMLR, 6(2):1099--1125, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. K. Mikolajczyk and C. Schmid. Scale&affine invariant interest point detectors. IJCV, 60(1):63--86, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. T. Mitchell, R. Hutchinson, R. Niculescu, F. Pereira, X. Wang, M. Just, and S. Newman. Learning to decode cognitive states from brain images. Machine Learning, 57(1):145--175, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Y. Miyawaki, H. Uchida, O. Yamashita, M. Sato, Y. Morito, H. Tanabe, N. Sadato, and Y. Kamitani. Visual Image Reconstruction from Human Brain Activity using a Combination of Multiscale Local Image Decoders. Neuron, 60(5):915--929, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  24. A. Oliva. Gist of the scene. In Encyclopedia of Neurobiology of Attention, pages 251--256, San Diego, CA, 2005. Elsevier.Google ScholarGoogle Scholar
  25. L. Parra, C. Christoforou, A. Gerson, M. Dyrholm, A. Luo, M. Wagner, M. Philiastides, and P. Sajda. Spatiotemporal linear decoding of brain state: Application to performance augmentation in high-throughput tasks. IEEE Signal Processing Magazine, 25(1):95--115, January 2008.Google ScholarGoogle ScholarCross RefCross Ref
  26. L. Parra, C. Christoforou, A. Gerson, M. Dyrholm, A. Luo, M. Wagner, M. Philiastides, and P. Sajda. Spatiotemporal linear decoding of brain state: Application to performance augmentation in high-throughput tasks. IEEE Signal Processing Magazine, 25(1):95--115, January 2008.Google ScholarGoogle ScholarCross RefCross Ref
  27. P. Poolman, R. Frank, P. Luu, S. Pederson, and D. Tucker. A single-trial analytic framework for EEG analysis and its application to target detection and classification. NeuroImage, 42(2):787--798, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  28. M. Potter and E. Levy. Recognition memory for a rapid sequence of pictures. Journal of Experimental Psychology, 81(1):10, 1969.Google ScholarGoogle ScholarCross RefCross Ref
  29. Y. Rui, T. Huang, M. Ortega, and S. Mehrotra. Relevance feedback: a power tool for interactive content-based image retrieval. IEEE Trans. on CSVT, 8(5):644--655, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Sanderson and P. Clough. cross-language image retrieval track. http://imageclef.org/.Google ScholarGoogle Scholar
  31. P. Shenoy and D. Tan. Human-Aided Computing: Utilizing Implicit Human Processing to Classify Images. In Proc. CHI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In MIR '06: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, pages 321--330, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. S. Thorpe, D. Fize, and C. Marlot. Speed of processing in the human visual system. Nature, 381(6582):520--522, 1996.Google ScholarGoogle ScholarCross RefCross Ref
  34. J. Wang, S. F. Chang, X. Zhou, and S. T. C. Wong. Active microscopic cellular image annotation by superposable graph transduction with imbalanced labels. In Proc. CVPR, 2008.Google ScholarGoogle Scholar
  35. J. Wang, T. Jebara, and S.-F. Chang. Graph transduction via alternating minimization. In Proc. ICML, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J. Wang, Y.-G. Jiang, and S.-F. Chang. Label diagnosis through self tuning for web image search. In Proc. CVPR, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  37. D. Zhou, J. Weston, A. Gretton, O. Bousquet, and B. Scholkopf. Ranking on data manifolds. In Proc. NIPS, 2004.Google ScholarGoogle Scholar

Index Terms

  1. Brain state decoding for rapid image retrieval

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '09: Proceedings of the 17th ACM international conference on Multimedia
          October 2009
          1202 pages
          ISBN:9781605586083
          DOI:10.1145/1631272

          Copyright © 2009 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 October 2009

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader