nach oben

International Journal of Computer Vision

Erschienen in:

13.06.2023 | Manuscript

Full-Spectrum Out-of-Distribution Detection

verfasst von: Jingkang Yang, Kaiyang Zhou, Ziwei Liu

Erschienen in: International Journal of Computer Vision | Ausgabe 10/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Existing out-of-distribution (OOD) detection literature clearly defines semantic shift as a sign of OOD but does not have a consensus over covariate shift. Samples experiencing covariate shift but not semantic shift from the in-distribution (ID) are either excluded from the test set or treated as OOD, which contradicts the primary goal in machine learning—being able to generalize beyond the training distribution. In this paper, we take into account both shift types and introduce full-spectrum OOD (F-OOD) detection, a more realistic problem setting that considers both detecting semantic shift and being tolerant to covariate shift; and design three benchmarks. These new benchmarks have a more fine-grained categorization of distributions (i.elet@tokeneonedot, training ID, covariate-shifted ID, near-OOD, and far-OOD) for the purpose of more comprehensively evaluating the pros and cons of algorithms. To address the F-OOD detection problem, we propose SEM, a simple feature-based semantics score function. SEM is mainly composed of two probability measures: one is based on high-level features containing both semantic and non-semantic information, while the other is based on low-level feature statistics only capturing non-semantic image styles. With a simple combination, the non-semantic part is canceled out, which leaves only semantic information in SEM that can better handle F-OOD detection. Extensive experiments on the three new benchmarks show that SEM significantly outperforms current state-of-the-art methods. Our code and benchmarks are released in https://github.com/Jingkang50/OpenOOD.

Vorheriger Artikel A2B: Anchor to Barycentric Coordinate for Robust Correspondence

Nächster Artikel DOVE: Learning Deformable 3D Objects by Watching Videos

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

With a slight abuse of notation, we use \(\varvec{x}\) here to denote an image.

Bengio, Y., Courville, A., & Vincent, P. (2013). Representation learning: A review and new perspectives. In IEEE transactions on pattern analysis and machine intelligence (TPAMI).

Bulatov, Y. (2011). NotMNIST dataset. http://yaroslavvb.blogspot.com/2011/09/notmnist-dataset.html.

Choi, H., Jang, E., & Alemi, A.A. (2018). WAIC, but why? Generative ensembles for robust anomaly detection. arXiv preprint arXiv:1810.01392.

Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S. & Vedaldi, A. (2014). Describing textures in the wild. In Proceedings of the ieee conference on computer vision and pattern recognition (CVPR).

He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).

Hendrycks, D. & Dietterich, T. (2019). Benchmarking neural network robustness to common corruptions and perturbations. In Proceedings of international conference on learning representations (ICLR).

Hendrycks, D. & Gimpel, K. (2017). A baseline for detecting misclassified and out-of-distribution examples in neural networks. In Proceedings of international conference on learning representations (ICLR).

Hendrycks, D., Mazeika, M., & Dietterich, T. (2019). Deep anomaly detection with outlier exposure. In Proceedings of international conference on learning representations (ICLR).

Huang, Gao., Liu, Zhuang., Van Der Maaten, L., & Weinberger, K.Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).

Huang, X. & Belongie, S. (2017). Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).

Hull, Jonathan J. (1994). A database for handwritten text recognition research. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(5), 550–554.CrossRef

Krizhevsky, A., Hinton, G., et al. (2009). Learning multiple layers of features from tiny images. Citeseer.

LeCun, Y., Cortes, C. & Burges, C.JC. (1998). The mnist database of handwritten digits, 1998. http://yann.lecun.com/exdb/mnist.

LeCun, Y. et al. (2015). Lenet-5, convolutional neural networks. http://yann.lecun.com/exdb/lenet.

Lee, K., Lee, K., Lee, H., & Shin, J. (2018). A simple unified framework for detecting out-of-distribution samples and adversarial attacks. In: Proceedings of advances in neural information processing systems (NeurIPS).

Li, Y., & Vasconcelos, N. (2020). Background data resampling for outlier-aware classification. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).

Liang, S., Li, Y., & Srikant, R. (2017). Enhancing the reliability of out-of-distribution image detection in neural networks. In Proceedings of international conference on learning representations (ICLR).

Lin, C., Yuan, Z., Zhao, S., Sun, P., Wang, C., & Cai, J. (2021). Domain-invariant disentangled network for generalizable object detection. In Proceedings of the IEEE international conference on computer vision (ICCV).

Liu, W., Wang, X., Owens, J., & Li Y. (2020). Energy-based out-of-distribution detection. In: Proceedings of advances in neural information processing systems (NeurIPS).

Ming, Y., Yin, H., & Li, Y.(2021). On the impact of spurious correlation for out-of-distribution detection. arXiv preprint arXiv:2109.05642.

Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., Ng, A.Y. (2011). Reading digits in natural images with unsupervised feature learning. In Proceedings of NIPS workshop on deep learning and unsupervised feature learning.

Peng, X., Huang, Z., Sun, X., Saenko, K. (2019). Domain agnostic learning with disentangled representations. In International conference on machine learning, pp. 5102–5112. PMLR.

Ren, J., Liu, P.J., Fertig, E., Snoek, J., Poplin, R., Depristo, M., Dillon, J., & Lakshminarayanan, B. (2019). Likelihood ratios for out-of-distribution detection. In Proceedings of advances in neural information processing systems (NeurIPS).

RSNA (2017). RSNA Pediatric Bone Age Challenge, 2017.

Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., et al. (2015). Imagenet large-scale visual recognition challenge. International Journal of Computer Vision (IJCV), 115, 211–252.MathSciNetCrossRef

Santa Cruz, B. G., Bossa, M. N., Sölter, J., & Husch, A. D. (2021). Public covid-19 x-ray datasets and their impact on model bias-a systematic review of a significant problem. Medical Image Analysis, 74, 102225.CrossRef

Sastry, C.S., Oore, S. (2020). Detecting out-of-distribution examples with gram matrices. In: ICML.

Sedlmeier, A., Gabor, T., Phan, T., Belzner, L. & Claudia L.-P. (2019). Uncertainty-based out-of-distribution detection in deep reinforcement learning. arXiv preprint arXiv:1901.02219.

Serrà, J., Álvarez, D., Gómez, V., Slizovskaia, O., Núñez, J.F., Luque, J. (2020). Input complexity and out-of-distribution detection with likelihood-based generative models. In Proceedings of international conference on learning representations (ICLR).

Sinha, A., Ayush, K., Song, K., Uzkent, B., Jin, H. & Ermon, S. (2021). Negative data augmentation. In Proceedings of international conference on learning representations (ICLR).

Van Amersfoort, J., Smith, L., Teh, Y.W. & Gal, Y. (2020). In ICML: Uncertainty estimation using a single deep deterministic neural network.

Vayá, M. de la I., Saborit, J.M., Montell, J.A., Pertusa, A., Bustos, A., Cazorla, M., Galant, J., Barber, X., Orozco-Beltrán, D., García-García, F. et al. (2020). Bimcv covid-19+: A large annotated dataset of RX and CT images from covid-19 patients. arXiv preprint arXiv:2006.01174.

Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Vezhnevets, A.S., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J. et al. (2017). Starcraft ii: A new challenge for reinforcement learning. arXiv preprint arXiv:1708.04782.

Vyas, A., Jammalamadaka, N., Zhu, X., Das, D., Kaul, B. & Willke, T.L. (2018). Out-of-distribution detection using an ensemble of self supervised leave-out classifiers. In Proceedings of the European conference on computer vision (ECCV).

Wang, H., Li, Z., Feng, L., & Zhang, W. (2022). Vim: Out-of-distribution with virtual-logit matching. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.

Wang, L., Lin, Z. Q., & Wong, A. (2020). Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. Scientific Reports, 10(1), 1–12.

Winther, H.B., Laser, H., Gerbel, S., Maschke, S.K., Hinrichs, J.B., Vogel-Claussen, J., Wacker, F.K., Höper, M.M., & Meyer, B.C. (2020). Covid-19 image repository. URL https://figshare.com/articles/COVID-19_Image_Repository/12275009.

Xiao, H., Rasul, K. & Vollgraf, R. (2017). Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747.

Yang, X., He, X., Zhao, J., Zhang, Y., Zhang, S. & Xie, P. (2020). Covid-ct-dataset: A ct scan dataset about covid-19. arXiv preprint arXiv:2003.13865.

Yang, J., Wang, H., Feng, L., Yan, X., Zheng, H., Zhang, W., & Liu, Z. (2021). Semantically coherent out-of-distribution detection. In Proceedings of the IEEE international conference on computer vision (ICCV).

Yang, J., Zhou, K., Li, Y. & Liu, Z.(2021). Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:2110.11334.

Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D. (2018). Mixup: Beyond empirical risk minimization. In Proceedings of international conference on learning representations (ICLR).

Zhou, K., Liu, Z., Qiao, Y., Xiang, T., Loy, C.C. (2022). Domain generalization: A survey. IEEE transactions on pattern analysis and machine intelligence (TPAMI).

Zhou, K., Yang, Y., Qiao, Y., & Xiang, T. (2021). Domain generalization with mixstyle. In Proceedings of international conference on learning representations (ICLR).

Zhou, B., Lapedriza, A., Khosla, A., Oliva, A., & Torralba, A. (2017). Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)., 40(6), 1452–1464.CrossRef

Titel: Full-Spectrum Out-of-Distribution Detection
verfasst von: Jingkang Yang
Kaiyang Zhou
Ziwei Liu
Publikationsdatum: 13.06.2023
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 10/2023
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-023-01811-z

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 10/2023

Transformer-Based Context Condensation for Boosting Feature Pyramids in Object Detection

Importance First: Generating Scene Graph of Human Interest

A2B: Anchor to Barycentric Coordinate for Robust Correspondence

Pyramid NeRF: Frequency Guided Fast Radiance Field Optimization

When Multi-Focus Image Fusion Networks Meet Traditional Edge-Preservation Technology

What Limits the Performance of Local Self-attention?

Premium Partner