Evaluating the Audio-Visual Speech Enhancement Challenge (AVSEC) Baseline Model Using an Out-of-Domain Free-Flowing Corpus

Kia K. Dashtipour (Lead / Corresponding author), Mandar Gogate, Adeel Hussain, Bryony Buck, Arif Reza Anwary, Tughrul Arslan, Amir Hussain

Research output: Contribution to conferencePaperpeer-review

10 Downloads (Pure)

Abstract

The human auditory cortex contextually integrates audio-visual (AV) cues to enhance the comprehension of speech in noisy environments. Numerous studies have investigated the effectiveness of AV integration for speech enhancement (SE). This paper evaluates the effectiveness of the COG-MHEAR AV SE Challenge baseline model using an out-of-domain free-flowing corpus. Experimental results indicate that the COG-MHEAR AV SE Challenge baseline model exhibits superior performance when applied to an out-of-domain corpus.
Original languageEnglish
Pages75-78
Number of pages4
DOIs
Publication statusPublished - 1 Sept 2024
Event3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement - Kos, Greece
Duration: 1 Sept 20241 Sept 2024
https://www.isca-archive.org/avsec_2024/index.html

Workshop

Workshop3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement
Country/TerritoryGreece
CityKos
Period1/09/241/09/24
Internet address

Fingerprint

Dive into the research topics of 'Evaluating the Audio-Visual Speech Enhancement Challenge (AVSEC) Baseline Model Using an Out-of-Domain Free-Flowing Corpus'. Together they form a unique fingerprint.

Cite this