Sitemap

Complete list of all pages and content on this website.

Page List

Yougen Yuan - AI Researcher & Engineer

Post List

publications

A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency

Published in Proc. ISCSLP, 2014

Use Google Scholar for full citation

Recommended citation: Shaofei Zhang, Lei Xie, Zhong-Hua Fu, Yougen Yuan, "A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency." Proc. ISCSLP, 2014.

Externalization Improvement in a Real-time Binaural Sound Image Rendering System

Published in Proc. ICOT, 2015

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Zhonghua Fu, Ming Xu, Lei Xie, Qi Cong, "Externalization Improvement in a Real-time Binaural Sound Image Rendering System." Proc. ICOT, 2015.

Deep neural network derived bottleneck features for accurate audio classification

Published in Proc. ICMEW, 2016

Use Google Scholar for full citation

Recommended citation: Bihong Zhang, Lei Xie, Yougen Yuan, Huaiping Ming, Dongyan Huang, Mingli Song, "Deep neural network derived bottleneck features for accurate audio classification." Proc. ICMEW, 2016.

Learning Neural Network Representation using Cross-lingual Bottleneck Features with Word-pair Information

Published in Proc. INTERSPEECH, 2016

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li, "Learning Neural Network Representation using Cross-lingual Bottleneck Features with Word-pair Information." Proc. INTERSPEECH, 2016.

Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech for Feature Representation

Published in Proc. ASRU, 2017

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech for Feature Representation." Proc. ASRU, 2017.

Pairwise Learning using Multi-lingual Bottleneck Features for Low-resource Query-by-example Spoken Term Detection

Published in Proc. ICASSP, 2017

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Pairwise Learning using Multi-lingual Bottleneck Features for Low-resource Query-by-example Spoken Term Detection." Proc. ICASSP, 2017.

Sound Image Externalization for Headphone based Real time 3D Audio

Published in Frontiers of Computer Science, 2017

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Lei Xie, Zhonghua Fu, Ming Xu, Qi Cong, "Sound Image Externalization for Headphone based Real time 3D Audio." Frontiers of Computer Science, 2017.

Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search

Published in Proc. INTERSPEECH, 2018

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li, "Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search." Proc. INTERSPEECH, 2018.

Many-to-many voice conversion based on bottleneck features with variational autoencoder for non-parallel training data

Published in Proc. APSIPA ASC, 2018

Use Google Scholar for full citation

Recommended citation: Yanping Li, Kong Aik Lee, Yougen Yuan, Haizhou Li, Zhen Yang, "Many-to-many voice conversion based on bottleneck features with variational autoencoder for non-parallel training data." Proc. APSIPA ASC, 2018.

Deep audio-visual system for closed-set word-level speech recognition

Published in 2019 International Conference on Multimodal Interaction, 2019

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Wei Tang, Minhao Fan, Yue Cao, Peng Zhang, Lei Xie, "Deep audio-visual system for closed-set word-level speech recognition." 2019 International Conference on Multimodal Interaction, 2019.

Query-by-example speech search using recurrent neural acoustic word embeddings with temporal context

Published in IEEE Access, 2019

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, "Query-by-example speech search using recurrent neural acoustic word embeddings with temporal context." IEEE Access, 2019.

Verifying deep keyword spotting detection with acoustic word embeddings

Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Zhiqiang Lv, Shen Huang, Lei Xie, "Verifying deep keyword spotting detection with acoustic word embeddings." 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2019.

Fast query-by-example speech search using attention-based deep binary embeddings

Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Lei Xie, Cheung-Chi Leung, Hongjie Chen, Bin Ma, "Fast query-by-example speech search using attention-based deep binary embeddings." IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.

VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge

Published in arXiv preprint arXiv:2110.15316, 2021

Use Google Scholar for full citation

Recommended citation: Yougen Yuan, Zhiqiang Lv, Shen Huang, Pengfei Hu, "VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge." arXiv preprint arXiv:2110.15316, 2021.

A Method of Audio-Visual Person Verification by Mining Connections between Time Series

Published in Proc. INTERSPEECH, 2023

Use Google Scholar for full citation

Recommended citation: Peiwen Sun, Shanshan Zhang, Zishan Liu, Yougen Yuan, Taotao Zhang, Honggang Zhang, Pengfei Hu, "A Method of Audio-Visual Person Verification by Mining Connections between Time Series." Proc. INTERSPEECH, 2023.

Yougen Yuan

Sitemap

Page List

Page Not Found

Yougen Yuan - AI Researcher & Engineer

Page Layout Examples

Posts by Category

Posts by Collection

Curriculum Vitae

MarkdownGuide

Page Archive

Publications

Sitemap

Posts by Tags

Talk Map

Talks

Teaching Experience

Privacy Policy & Terms of Use

Blogposts

Markdown Generator

Post List

publications

A Hybrid Virtual Bass System with Improved Phase Vocoder and High Efficiency

Externalization Improvement in a Real-time Binaural Sound Image Rendering System

Deep neural network derived bottleneck features for accurate audio classification

Learning Neural Network Representation using Cross-lingual Bottleneck Features with Word-pair Information

Extracting Bottleneck Features and Word-like Pairs from Untranscribed Speech for Feature Representation

Pairwise Learning using Multi-lingual Bottleneck Features for Low-resource Query-by-example Spoken Term Detection

Sound Image Externalization for Headphone based Real time 3D Audio

Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search

Many-to-many voice conversion based on bottleneck features with variational autoencoder for non-parallel training data

Deep audio-visual system for closed-set word-level speech recognition

Query-by-example speech search using recurrent neural acoustic word embeddings with temporal context

Verifying deep keyword spotting detection with acoustic word embeddings

Fast query-by-example speech search using attention-based deep binary embeddings

VRM-Phase I VKW system description of long-short video customizable keyword wakeup challenge

A Method of Audio-Visual Person Verification by Mining Connections between Time Series

talks