Sitemap
Complete list of all pages and content on this website.
Page List
Post List
publications
A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency
Published in Proc. ISCSLP, 2014
Use Google Scholar for full citation
Recommended citation: Shaofei Zhang, Lei Xie, Zhong-Hua Fu, Yougen Yuan, "A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency." Proc. ISCSLP, 2014.
Externalization Improvement in a Real-time Binaural Sound Image Rendering System
Published in Proc. ICOT, 2015
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Zhonghua Fu, Ming Xu, Lei Xie, Qi Cong, "Externalization Improvement in a Real-time Binaural Sound Image Rendering System." Proc. ICOT, 2015.
Deep neural network derived bottleneck features for accurate audio classification
Published in Proc. ICMEW, 2016
Use Google Scholar for full citation
Recommended citation: Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang, Mingli Song, "Deep neural network derived bottleneck features for accurate audio classification." Proc. ICMEW, 2016.
Learning Neural Network Representation using Cross-lingual Bottleneck Features with Word-pair Information
Published in Proc. INTERSPEECH, 2016
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Learning Neural Network Representation using Cross-lingual Bottleneck Features with Word-pair Information." Proc. INTERSPEECH, 2016.
Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech for Feature Representation
Published in Proc. ASRU, 2017
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech for Feature Representation." Proc. ASRU, 2017.
Pairwise Learning using Multi-lingual Bottleneck Features for Low-resource Query-by-example Spoken Term Detection
Published in Proc. ICASSP, 2017
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Pairwise Learning using Multi-lingual Bottleneck Features for Low-resource Query-by-example Spoken Term Detection." Proc. ICASSP, 2017.
Sound Image Externalization for Headphone based Real time 3D Audio
Published in Frontiers of Computer Science, 2017
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Lei Xie, Zhonghua Fu, Ming Xu, Qi Cong, "Sound Image Externalization for Headphone based Real time 3D Audio." Frontiers of Computer Science, 2017.
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
Published in Proc. INTERSPEECH, 2018
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search." Proc. INTERSPEECH, 2018.
Many-to-many voice conversion based on bottleneck features with variational autoencoder for non-parallel training data
Published in Proc. APSIPA ASC, 2018
Use Google Scholar for full citation
Recommended citation: Yanping Li, Kong Aik Lee, Yougen Yuan, Haizhou Li, Zhen Yang, "Many-to-many voice conversion based on bottleneck features with variational autoencoder for non-parallel training data." Proc. APSIPA ASC, 2018.
Deep audio-visual system for closed-set word-level speech recognition
Published in 2019 International Conference on Multimodal Interaction, 2019
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Wei Tang, Minhao Fan, Yue Cao, Peng Zhang, Lei Xie, "Deep audio-visual system for closed-set word-level speech recognition." 2019 International Conference on Multimodal Interaction, 2019.
Query-by-example speech search using recurrent neural acoustic word embeddings with temporal context
Published in IEEE Access, 2019
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, "Query-by-example speech search using recurrent neural acoustic word embeddings with temporal context." IEEE Access, 2019.
Verifying deep keyword spotting detection with acoustic word embeddings
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie, "Verifying deep keyword spotting detection with acoustic word embeddings." 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019.
Fast query-by-example speech search using attention-based deep binary embeddings
Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Bin Ma, "Fast query-by-example speech search using attention-based deep binary embeddings." IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.
VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge
Published in arXiv preprint arXiv:2110.15316, 2021
Use Google Scholar for full citation
Recommended citation: Yougen Yuan, Zhiqiang Lv, Shen Huang, Pengfei Hu, "VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge." arXiv preprint arXiv:2110.15316, 2021.
A Method of Audio-Visual Person Verification by Mining Connections between Time Series
Published in Proc. INTERSPEECH, 2023
Use Google Scholar for full citation
Recommended citation: Peiwen Sun, Shanshan Zhang, Zishan Liu, Yougen Yuan, Taotao Zhang, Honggang Zhang, Pengfei Hu, "A Method of Audio-Visual Person Verification by Mining Connections between Time Series." Proc. INTERSPEECH, 2023.
