Skip to main content
Top

Hybrid CNN-LSTM with Attention Mechanism for Medical Visual Question Answering

  • 2026
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter explores the development and evaluation of a hybrid CNN-LSTM model with an attention mechanism for medical visual question answering (MedVQA). The model integrates convolutional neural networks (CNNs) for image feature extraction and long short-term memory (LSTM) networks for question encoding, enhanced by an attention mechanism to focus on relevant image regions. The study evaluates the model on two datasets: ImageCLEF VQA-MED 2019 and ImageCLEF VQA-RAD 2019, which include a variety of medical images such as X-rays, MRI, and CT scans, along with corresponding questions and answers. The results demonstrate the model's ability to achieve high training accuracy and reasonable validation accuracy, indicating its potential to improve clinical decision-making and healthcare outcomes. The chapter also discusses the challenges of class imbalances and overfitting, as well as recommendations for future improvements, such as using regularization techniques and collecting more balanced data. The findings highlight the model's effectiveness in handling complex medical question structures and its potential to transform healthcare by providing accurate and timely answers to medical queries.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Hybrid CNN-LSTM with Attention Mechanism for Medical Visual Question Answering
Authors
Vandana Ratwani
Jitendra Bhatia
Jitali Patel
Copyright Year
2026
DOI
https://doi.org/10.1007/978-3-032-06253-6_24
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG