Visual Basic Speech Recognition Program

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

Abstract: Recently, audio-visual speech recognition has attracted increasing attention. However, most existing works only focused on scenarios with two speakers. In this work, we study the effect of ...

IEEE

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

Abstract: Audio-Visual Speech Recognition (AVSR) combines lip-based video with audio and can improve performance in noise, but most methods are trained only on English data. One limitation is the lack ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Curriculum Learning aided Audio-Visual Speech Recognition with Arbitrary Speaker Number

mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition

Trending now