Skip to main content
Top

Hardware-Efficient Neural Network for Voice Disorder Classification from Multi-Source Datasets

  • 2026
  • OriginalPaper
  • Chapter
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter explores the development and implementation of a hardware-efficient convolutional neural network (CNN) for classifying voice disorders, including dysphonia and reflux laryngitis. The model leverages clinically relevant acoustic features such as jitter, shimmer, harmonics-to-noise ratio (HNR), fundamental frequency (F₀), and formants, which are critical markers for voice disorders. The CNN architecture is optimized for deployment on Field Programmable Gate Arrays (FPGAs), enabling real-time diagnostics with minimal resource consumption. The study evaluates the model's performance using the Saarbruecken Voice Database (SVD) and Voiced datasets, achieving high accuracy and demonstrating the feasibility of integrating AI-based voice diagnostics into portable medical devices. The results show that the model outperforms traditional classifiers like support vector machines (SVM) and decision trees, with an overall accuracy of 91.4%. The FPGA implementation achieves an inference time of 80 ms per sample, making it suitable for real-time applications. The chapter also discusses the trade-offs between accuracy and resource utilization, highlighting the model's efficiency and potential for deployment in clinical and mobile environments. Future research directions include expanding the model to support online voice input and incorporating multimodal data for enhanced diagnostic capabilities.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Business + Economics & Engineering + Technology"

Online-Abonnement

Springer Professional "Business + Economics & Engineering + Technology" gives you access to:

  • more than 130.000 books
  • more than 540 journals

from the following subject areas:

  • Automotive
  • Construction + Real Estate
  • Business IT + Informatics
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Mechanical Engineering + Materials
  • Surfaces + Materials Technology
  • Insurance + Risk


Secure your knowledge advantage now!

Springer Professional "Engineering + Technology"

Online-Abonnement

Springer Professional "Engineering + Technology" gives you access to:

  • more than 75.000 books
  • more than 390 journals

from the following specialised fileds:

  • Automotive
  • Business IT + Informatics
  • Construction + Real Estate
  • Electrical Engineering + Electronics
  • Energy + Sustainability
  • Mechanical Engineering + Materials
  • Surfaces + Materials Technology





 

Secure your knowledge advantage now!

Springer Professional "Business + Economics"

Online-Abonnement

Springer Professional "Business + Economics" gives you access to:

  • more than 100.000 books
  • more than 340 journals

from the following specialised fileds:

  • Construction + Real Estate
  • Business IT + Informatics
  • Finance + Banking
  • Management + Leadership
  • Marketing + Sales
  • Insurance + Risk



Secure your knowledge advantage now!

Title
Hardware-Efficient Neural Network for Voice Disorder Classification from Multi-Source Datasets
Authors
Jyoti Mishra
R. K. Sharma
Copyright Year
2026
DOI
https://doi.org/10.1007/978-3-032-07735-6_4
This content is only visible if you are logged in and have the appropriate permissions.

Premium Partner

    Image Credits
    Neuer Inhalt/© ITandMEDIA, Nagarro GmbH/© Nagarro GmbH, AvePoint Deutschland GmbH/© AvePoint Deutschland GmbH, AFB Gemeinnützige GmbH/© AFB Gemeinnützige GmbH, USU GmbH/© USU GmbH, Ferrari electronic AG/© Ferrari electronic AG