Skip to main content
Top

Two-Stage Cascaded Speech Enhancement by Exploiting Magnitude and Phase Optimization

  • 16-05-2025
Published in:

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This article delves into the critical area of speech enhancement, addressing the persistent challenges of improving speech quality and intelligibility in noisy environments. Traditional methods, while effective, often struggle with the mismatch problem, leading to significant performance degradation. The proposed two-stage cascaded speech enhancement method offers a novel solution by separately optimizing the magnitude and phase components of speech signals. The first stage focuses on recovering the magnitude using an estimated mask network, while the second stage optimizes the phase, leveraging the complementary information between magnitude and phase. This approach not only enhances the overall performance of speech enhancement but also provides a clearer learning objective for the network, improving its interpretability. The article also introduces an implicit phase optimization architecture to avoid the interference caused by direct phase estimation, further refining the speech enhancement process. Through extensive simulations and comparisons with baseline methods, the proposed method demonstrates superior performance in terms of perceptual evaluation of speech quality (PESQ), short-time objective intelligibility (STOI), and signal-to-distortion ratio (SDR), making it a significant contribution to the field of speech enhancement.

Not a customer yet? Then find out more about our access models now:

Individual Access

Start your personal individual access now. Get instant access to more than 164,000 books and 540 journals – including PDF downloads and new releases.

Starting from 54,00 € per month!    

Get access

Access for Businesses

Utilise Springer Professional in your company and provide your employees with sound specialist knowledge. Request information about corporate access now.

Find out how Springer Professional can uplift your work!

Contact us now
Title
Two-Stage Cascaded Speech Enhancement by Exploiting Magnitude and Phase Optimization
Authors
Chaojin Qing
Linsi He
Xiaowei Fu
Hui Lin
Publication date
16-05-2025
Publisher
Springer US
Published in
Circuits, Systems, and Signal Processing / Issue 9/2025
Print ISSN: 0278-081X
Electronic ISSN: 1531-5878
DOI
https://doi.org/10.1007/s00034-025-03154-1
This content is only visible if you are logged in and have the appropriate permissions.
This content is only visible if you are logged in and have the appropriate permissions.