Deep audio-visual system for closed-set word-level speech recognition

Published in 2019 International Conference on Multimodal Interaction, 2019

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Wei Tang, Minhao Fan, Yue Cao, Peng Zhang, Lei Xie, "Deep audio-visual system for closed-set word-level speech recognition." 2019 International Conference on Multimodal Interaction, 2019.