Audio-Visual Speech Recognition (AVSR) corpus of MISP2021 challenge ... The corpus and the code are released to promote the research not only in speech area but also for the computer vision area and ...
Denoise any real-world audio/video and obtain the clean speech. Works in unconstrained settings for any speaker in any language. Inputs only audio but uses the benefits of lip movements by generating ...