Sung-Feng Huang
About Me
Sung-Feng Huang is a Research Scientist at NVIDIA Research Taiwan. His research focuses on generative AI for speech and audio, with expertise in speech recognition, synthesis, separation, and machine learning techniques such as self-supervised and meta learning.
He received his Ph.D. from National Taiwan University, where he was co-advised by Prof. Lin-shan Lee and Prof. Hung-yi Lee. During his doctoral studies, he worked extensively on advanced speech processing technologies and machine learning methodologies, contributing to the development of innovative AI-driven audio applications.
Before joining NVIDIA as a full-time Research Scientist, Sung-Feng interned with the same research team, where he gained hands-on experience in cutting-edge AI research. Now, he is dedicated to advancing generative AI in speech and audio, driving new possibilities in human-computer interaction and audio-based AI systems.
Work experience
Research Scientist @ NVIDIA Research Taiwan Mar. 2025 - Present
AI Researcher Intern @ NVIDIA Research Taiwan Oct. 2023 - Aug. 2024
NLP Researcher Intern @ Apple Jun. 2019 - Sep. 2019
- Input and Intelligence Team
ML Researcher Intern @ HTC Jul. 2017 - Dec. 2017
- Department of Deep Learning and Algorithm
Teaching Assistant of Machine Learning, Linear Algebra, Digital Speech Processing
Reviewer of ICASSP, ACL, ICML, AAAI, COCOSDA, ISCSLP, SLT, Interspeech
Selected Publications
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration
Pin-Jui Ku, Alexander H. Liu, Roman Korostik, Sung-Feng Huang, Szu-Wei Fu, Ante JukićIEEE ICASSP 2025
Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits
Sung-Feng Huang, Heng-Cheng Kuo, Zhehuai Chen, Xuesong Yang, Chao-Han Huck Yang, Yu Tsao, Yu-Chiang Frank Wang, Hung-yi Lee, Szu-Wei FuIEEE SLT 2024
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization
Wei-Ping Huang, Sung-Feng Huang, Hung-yi LeeIEEE ASRU 2023
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning
Sung-Feng Huang, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-yi LeeIEEE ICASSP 2023
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
Sung-Feng Huang, Chyi-Jiunn Lin, Da-Rong Liu, Yi-Chen Chen, Hung-yi LeeIEEE/ACM TASLP 2022
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network
Da-rong Liu, Po-chun Hsu, Yi-chen Chen, Sung-feng Huang, Shun-po Chuang, Da-yi Wu, Hung-yi LeeIEEE/ACM TASLP 2021
Non-Autoregressive Mandarin-English Code-Switching Speech Recognition
Shun-Po Chuang, Heng-Jui Chang, Sung-Feng Huang, Hung-yi LeeIEEE ASRU 2021
Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Sung-Feng Huang, Shun-Po Chuang, Da-Rong Liu, Yi-Chen Chen, Gene-Ping Yang, Hung-yi LeeInterspeech 2020
Pretrained Language Model Embryology: The Birth of ALBERT
Cheng-Han Chiang, Sung-Feng Huang, Hung-yi LeeEMNLP 2020
Audio Word2vec: Sequence-to-Sequence Autoencoding for Unsupervised Learning of Audio Segmentation and Representation
Yi-Chen Chen, Sung-Feng Huang, Hung-yi Lee, Yu-Hsuan Wang, Chia-Hao ShenIEEE/ACM TASLP 2019
Phonetic-and-Semantic Embedding of Spoken words with Applications in Spoken Content Retrieval
Yi-Chen Chen, Sung-Feng Huang, Chia-Hao Shen, Hung-yi Lee, Lin-shan LeeIEEE SLT 2018
For the complete list, please visit my Google Scholar profile.
Education
Honors
NTU GICE Excellent Elite Cultivation Program Scholarship NTU GICE 2019 - 2023
NTU Direct-Entry Ph.D. Program Scholarship NTU 2019 - 2022
13th Asian Physics Olympiad (APhO) - Bronze Prize & Best Researcher Award APhO 2013