Skip to main content
Top

IQ-VQA: Intelligent Visual Question Answering

  • 2021
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The chapter 'IQ-VQA: Intelligent Visual Question Answering' delves into the intricacies of Visual Question Answering (VQA) systems, where AI models must answer natural language questions based on contextual images. It highlights the challenges posed by language priors in VQA datasets, which can lead to false impressions of model performance. The author presents a cyclic training scheme and an implication generator module to improve the consistency and robustness of VQA models. The framework is model-independent and can be integrated with any VQA architecture. The implication generator introduces linguistic variations in questions to enhance the model's understanding of contextual images. The chapter also introduces a new VQA-Implication dataset for evaluating consistency and robustness, showcasing significant improvements in model performance. The qualitative and quantitative analysis of attention maps further demonstrates the enhanced multi-modal understanding achieved through the proposed framework. The chapter concludes with a discussion on future work, emphasizing the importance of diverse and human-annotated implications for better performance.

Electronic supplementary material

The online version of this chapter (https://doi.org/10.1007/978-3-030-68790-8_28) contains supplementary material, which is available to authorized users.
V. Goel and M. Chandak—Equal Contribution.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
IQ-VQA: Intelligent Visual Question Answering
Authors
Vatsal Goel
Mohit Chandak
Ashish Anand
Prithwijit Guha
Copyright Year
2021
DOI
https://doi.org/10.1007/978-3-030-68790-8_28
This content is only visible if you are logged in and have the appropriate permissions.
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG