Skip to main content
Erschienen in: International Journal of Computer Vision 10/2023

13.06.2023 | Manuscript

Full-Spectrum Out-of-Distribution Detection

verfasst von: Jingkang Yang, Kaiyang Zhou, Ziwei Liu

Erschienen in: International Journal of Computer Vision | Ausgabe 10/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Existing out-of-distribution (OOD) detection literature clearly defines semantic shift as a sign of OOD but does not have a consensus over covariate shift. Samples experiencing covariate shift but not semantic shift from the in-distribution (ID) are either excluded from the test set or treated as OOD, which contradicts the primary goal in machine learning—being able to generalize beyond the training distribution. In this paper, we take into account both shift types and introduce full-spectrum OOD (F-OOD) detection, a more realistic problem setting that considers both detecting semantic shift and being tolerant to covariate shift; and design three benchmarks. These new benchmarks have a more fine-grained categorization of distributions (i.elet@tokeneonedot, training ID, covariate-shifted ID, near-OOD, and far-OOD) for the purpose of more comprehensively evaluating the pros and cons of algorithms. To address the F-OOD detection problem, we propose SEM, a simple feature-based semantics score function. SEM is mainly composed of two probability measures: one is based on high-level features containing both semantic and non-semantic information, while the other is based on low-level feature statistics only capturing non-semantic image styles. With a simple combination, the non-semantic part is canceled out, which leaves only semantic information in SEM that can better handle F-OOD detection. Extensive experiments on the three new benchmarks show that SEM significantly outperforms current state-of-the-art methods. Our code and benchmarks are released in https://​github.​com/​Jingkang50/​OpenOOD.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
With a slight abuse of notation, we use \(\varvec{x}\) here to denote an image.
 
Literatur
Zurück zum Zitat Bengio, Y., Courville, A., & Vincent, P. (2013). Representation learning: A review and new perspectives. In IEEE transactions on pattern analysis and machine intelligence (TPAMI). Bengio, Y., Courville, A., & Vincent, P. (2013). Representation learning: A review and new perspectives. In IEEE transactions on pattern analysis and machine intelligence (TPAMI).
Zurück zum Zitat Choi, H., Jang, E., & Alemi, A.A. (2018). WAIC, but why? Generative ensembles for robust anomaly detection. arXiv preprint arXiv:1810.01392. Choi, H., Jang, E., & Alemi, A.A. (2018). WAIC, but why? Generative ensembles for robust anomaly detection. arXiv preprint arXiv:​1810.​01392.
Zurück zum Zitat Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S. & Vedaldi, A. (2014). Describing textures in the wild. In Proceedings of the ieee conference on computer vision and pattern recognition (CVPR). Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S. & Vedaldi, A. (2014). Describing textures in the wild. In Proceedings of the ieee conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Hendrycks, D. & Dietterich, T. (2019). Benchmarking neural network robustness to common corruptions and perturbations. In Proceedings of international conference on learning representations (ICLR). Hendrycks, D. & Dietterich, T. (2019). Benchmarking neural network robustness to common corruptions and perturbations. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Hendrycks, D. & Gimpel, K. (2017). A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proceedings of international conference on learning representations (ICLR). Hendrycks, D. & Gimpel, K. (2017). A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Hendrycks, D., Mazeika, M., & Dietterich, T. (2019). Deep anomaly detection with outlier exposure. In Proceedings of international conference on learning representations (ICLR). Hendrycks, D., Mazeika, M., & Dietterich, T. (2019). Deep anomaly detection with outlier exposure. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Huang, Gao., Liu, Zhuang., Van Der Maaten, L., & Weinberger, K.Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Huang, Gao., Liu, Zhuang., Van Der Maaten, L., & Weinberger, K.Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Huang, X. & Belongie, S. (2017). Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Huang, X. & Belongie, S. (2017). Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Hull, Jonathan J. (1994). A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(5), 550–554.CrossRef Hull, Jonathan J. (1994). A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(5), 550–554.CrossRef
Zurück zum Zitat Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple layers of features from tiny images. Citeseer. Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple layers of features from tiny images. Citeseer.
Zurück zum Zitat Lee, K., Lee, K., Lee, H., & Shin, J. (2018). A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In: Proceedings of advances in neural information processing systems (NeurIPS). Lee, K., Lee, K., Lee, H., & Shin, J. (2018). A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In: Proceedings of advances in neural information processing systems (NeurIPS).
Zurück zum Zitat Li, Y., & Vasconcelos, N. (2020). Background data resampling for outlier-aware classification. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR). Li, Y., & Vasconcelos, N. (2020). Background data resampling for outlier-aware classification. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liang, S., Li, Y., & Srikant, R. (2017). Enhancing the reliability of out-of-distribution image detection in neural networks. In Proceedings of international conference on learning representations (ICLR). Liang, S., Li, Y., & Srikant, R. (2017). Enhancing the reliability of out-of-distribution image detection in neural networks. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Lin, C., Yuan, Z., Zhao, S., Sun, P., Wang, C., & Cai, J. (2021). Domain-invariant disentangled network for generalizable object detection. In Proceedings of the IEEE international conference on computer vision (ICCV). Lin, C., Yuan, Z., Zhao, S., Sun, P., Wang, C., & Cai, J. (2021). Domain-invariant disentangled network for generalizable object detection. In Proceedings of the IEEE international conference on computer vision (ICCV).
Zurück zum Zitat Liu, W., Wang, X., Owens, J., & Li Y. (2020). Energy-based out-of-distribution detection. In: Proceedings of advances in neural information processing systems (NeurIPS). Liu, W., Wang, X., Owens, J., & Li Y. (2020). Energy-based out-of-distribution detection. In: Proceedings of advances in neural information processing systems (NeurIPS).
Zurück zum Zitat Ming, Y., Yin, H., & Li, Y.(2021). On the impact of spurious correlation for out-of-distribution detection. arXiv preprint arXiv:2109.05642. Ming, Y., Yin, H., & Li, Y.(2021). On the impact of spurious correlation for out-of-distribution detection. arXiv preprint arXiv:​2109.​05642.
Zurück zum Zitat Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., Ng, A.Y. (2011). Reading digits in natural images with unsupervised feature learning. In Proceedings of NIPS workshop on deep learning and unsupervised feature learning. Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., Ng, A.Y. (2011). Reading digits in natural images with unsupervised feature learning. In Proceedings of NIPS workshop on deep learning and unsupervised feature learning.
Zurück zum Zitat Peng, X., Huang, Z., Sun, X., Saenko, K. (2019). Domain agnostic learning with disentangled representations. In International conference on machine learning, pp. 5102–5112. PMLR. Peng, X., Huang, Z., Sun, X., Saenko, K. (2019). Domain agnostic learning with disentangled representations. In International conference on machine learning, pp. 5102–5112. PMLR.
Zurück zum Zitat Ren, J., Liu, P.J., Fertig, E., Snoek, J., Poplin, R., Depristo, M., Dillon, J., & Lakshminarayanan, B. (2019). Likelihood ratios for out-of-distribution detection. In Proceedings of advances in neural information processing systems (NeurIPS). Ren, J., Liu, P.J., Fertig, E., Snoek, J., Poplin, R., Depristo, M., Dillon, J., & Lakshminarayanan, B. (2019). Likelihood ratios for out-of-distribution detection. In Proceedings of advances in neural information processing systems (NeurIPS).
Zurück zum Zitat RSNA (2017). RSNA Pediatric Bone Age Challenge, 2017. RSNA (2017). RSNA Pediatric Bone Age Challenge, 2017.
Zurück zum Zitat Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al. (2015). Imagenet large-scale visual recognition challenge. International Journal of Computer Vision (IJCV), 115, 211–252.MathSciNetCrossRef Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al. (2015). Imagenet large-scale visual recognition challenge. International Journal of Computer Vision (IJCV), 115, 211–252.MathSciNetCrossRef
Zurück zum Zitat Santa Cruz, B. G., Bossa, M. N., Sölter, J., & Husch, A. D. (2021). Public covid-19 x-ray datasets and their impact on model bias-a systematic review of a significant problem. Medical Image Analysis, 74, 102225.CrossRef Santa Cruz, B. G., Bossa, M. N., Sölter, J., & Husch, A. D. (2021). Public covid-19 x-ray datasets and their impact on model bias-a systematic review of a significant problem. Medical Image Analysis, 74, 102225.CrossRef
Zurück zum Zitat Sastry, C.S., Oore, S. (2020). Detecting out-of-distribution examples with gram matrices. In: ICML. Sastry, C.S., Oore, S. (2020). Detecting out-of-distribution examples with gram matrices. In: ICML.
Zurück zum Zitat Sedlmeier, A., Gabor, T., Phan, T., Belzner, L. & Claudia L.-P. (2019). Uncertainty-based out-of-distribution detection in deep reinforcement learning. arXiv preprint arXiv:1901.02219. Sedlmeier, A., Gabor, T., Phan, T., Belzner, L. & Claudia L.-P. (2019). Uncertainty-based out-of-distribution detection in deep reinforcement learning. arXiv preprint arXiv:​1901.​02219.
Zurück zum Zitat Serrà, J., Álvarez, D., Gómez, V., Slizovskaia, O., Núñez, J.F., Luque, J. (2020). Input complexity and out-of-distribution detection with likelihood-based generative models. In Proceedings of international conference on learning representations (ICLR). Serrà, J., Álvarez, D., Gómez, V., Slizovskaia, O., Núñez, J.F., Luque, J. (2020). Input complexity and out-of-distribution detection with likelihood-based generative models. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Sinha, A., Ayush, K., Song, K., Uzkent, B., Jin, H. & Ermon, S. (2021). Negative data augmentation. In Proceedings of international conference on learning representations (ICLR). Sinha, A., Ayush, K., Song, K., Uzkent, B., Jin, H. & Ermon, S. (2021). Negative data augmentation. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Van Amersfoort, J., Smith, L., Teh, Y.W. & Gal, Y. (2020). In ICML: Uncertainty estimation using a single deep deterministic neural network. Van Amersfoort, J., Smith, L., Teh, Y.W. & Gal, Y. (2020). In ICML: Uncertainty estimation using a single deep deterministic neural network.
Zurück zum Zitat Vayá, M. de la I., Saborit, J.M., Montell, J.A., Pertusa, A., Bustos, A., Cazorla, M., Galant, J., Barber, X., Orozco-Beltrán, D., García-García, F. et al. (2020). Bimcv covid-19+: A large annotated dataset of RX and CT images from covid-19 patients. arXiv preprint arXiv:2006.01174. Vayá, M. de la I., Saborit, J.M., Montell, J.A., Pertusa, A., Bustos, A., Cazorla, M., Galant, J., Barber, X., Orozco-Beltrán, D., García-García, F. et al. (2020). Bimcv covid-19+: A large annotated dataset of RX and CT images from covid-19 patients. arXiv preprint arXiv:​2006.​01174.
Zurück zum Zitat Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Vezhnevets, A.S., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J. et al. (2017). Starcraft ii: A new challenge for reinforcement learning. arXiv preprint arXiv:1708.04782. Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Vezhnevets, A.S., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J. et al. (2017). Starcraft ii: A new challenge for reinforcement learning. arXiv preprint arXiv:​1708.​04782.
Zurück zum Zitat Vyas, A., Jammalamadaka, N., Zhu, X., Das, D., Kaul, B. & Willke, T.L. (2018). Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In Proceedings of the European conference on computer vision (ECCV). Vyas, A., Jammalamadaka, N., Zhu, X., Das, D., Kaul, B. & Willke, T.L. (2018). Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In Proceedings of the European conference on computer vision (ECCV).
Zurück zum Zitat Wang, H., Li, Z., Feng, L., & Zhang, W. (2022). Vim: Out-of-distribution with virtual-logit matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. Wang, H., Li, Z., Feng, L., & Zhang, W. (2022). Vim: Out-of-distribution with virtual-logit matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.
Zurück zum Zitat Wang, L., Lin, Z. Q., & Wong, A. (2020). Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Scientific Reports, 10(1), 1–12. Wang, L., Lin, Z. Q., & Wong, A. (2020). Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Scientific Reports, 10(1), 1–12.
Zurück zum Zitat Xiao, H., Rasul, K. & Vollgraf, R. (2017). Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747. Xiao, H., Rasul, K. & Vollgraf, R. (2017). Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:​1708.​07747.
Zurück zum Zitat Yang, X., He, X., Zhao, J., Zhang, Y., Zhang, S. & Xie, P. (2020). Covid-ct-dataset: A ct scan dataset about covid-19. arXiv preprint arXiv:2003.13865. Yang, X., He, X., Zhao, J., Zhang, Y., Zhang, S. & Xie, P. (2020). Covid-ct-dataset: A ct scan dataset about covid-19. arXiv preprint arXiv:​2003.​13865.
Zurück zum Zitat Yang, J., Wang, H., Feng, L., Yan, X., Zheng, H., Zhang, W., & Liu, Z. (2021). Semantically coherent out-of-distribution detection. In Proceedings of the IEEE international conference on computer vision (ICCV). Yang, J., Wang, H., Feng, L., Yan, X., Zheng, H., Zhang, W., & Liu, Z. (2021). Semantically coherent out-of-distribution detection. In Proceedings of the IEEE international conference on computer vision (ICCV).
Zurück zum Zitat Yang, J., Zhou, K., Li, Y. & Liu, Z.(2021). Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:2110.11334. Yang, J., Zhou, K., Li, Y. & Liu, Z.(2021). Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:​2110.​11334.
Zurück zum Zitat Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D. (2018). Mixup: Beyond empirical risk minimization. In Proceedings of international conference on learning representations (ICLR). Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D. (2018). Mixup: Beyond empirical risk minimization. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Zhou, K., Liu, Z., Qiao, Y., Xiang, T., Loy, C.C. (2022). Domain generalization: A survey. IEEE transactions on pattern analysis and machine intelligence (TPAMI). Zhou, K., Liu, Z., Qiao, Y., Xiang, T., Loy, C.C. (2022). Domain generalization: A survey. IEEE transactions on pattern analysis and machine intelligence (TPAMI).
Zurück zum Zitat Zhou, K., Yang, Y., Qiao, Y., & Xiang, T. (2021). Domain generalization with mixstyle. In Proceedings of international conference on learning representations (ICLR). Zhou, K., Yang, Y., Qiao, Y., & Xiang, T. (2021). Domain generalization with mixstyle. In Proceedings of international conference on learning representations (ICLR).
Zurück zum Zitat Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., & Torralba, A. (2017). Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)., 40(6), 1452–1464.CrossRef Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., & Torralba, A. (2017). Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)., 40(6), 1452–1464.CrossRef
Metadaten
Titel
Full-Spectrum Out-of-Distribution Detection
verfasst von
Jingkang Yang
Kaiyang Zhou
Ziwei Liu
Publikationsdatum
13.06.2023
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 10/2023
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-023-01811-z

Weitere Artikel der Ausgabe 10/2023

International Journal of Computer Vision 10/2023 Zur Ausgabe

Premium Partner