Abstract
The human auditory cortex contextually integrates audio-visual (AV) cues to enhance the comprehension of speech in noisy environments. Numerous studies have investigated the effectiveness of AV integration for speech enhancement (SE). This paper evaluates the effectiveness of the COG-MHEAR AV SE Challenge baseline model using an out-of-domain free-flowing corpus. Experimental results indicate that the COG-MHEAR AV SE Challenge baseline model exhibits superior performance when applied to an out-of-domain corpus.
Original language | English |
---|---|
Pages | 75-78 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 1 Sept 2024 |
Event | 3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement - Kos, Greece Duration: 1 Sept 2024 → 1 Sept 2024 https://www.isca-archive.org/avsec_2024/index.html |
Workshop
Workshop | 3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement |
---|---|
Country/Territory | Greece |
City | Kos |
Period | 1/09/24 → 1/09/24 |
Internet address |