Hi there 🙌
I am a first-year Ph.D. student in Computer Science at The University of Hong Kong (HKU), privileged to be jointly supervised by Prof. Reynold C.K. Cheng, Prof. Francis C.M. Lau, and Prof. Yupeng Li. Before joining HKU, I had the honor of working under Prof. Zhizheng Wu at the Chinese University of Hong Kong, Shenzhen and the Shanghai AI Laboratory as a research assistant, where I contributed as a core developer and maintainer for the open-source project Amphion, and Prof. Zhen Ming (Jack) Jiang at York University as a Mitacs Research Intern.
I am the creator of Emilia, a leading dataset in expressive and spontaneous text-to-speech (TTS) synthesis, along with its preprocessing pipeline, Emilia-Pipe. As of April 2025, Emilia has been downloaded over 440k times by more than 700 research institutions/companies, including Stanford, CMU, OpenAI, Google, and NVIDIA. It has also been recognized as the "most liked dataset" in the audio category on HuggingFace and become a foundational training dataset for state-of-the-art TTS models such as F5-TTS, MaskGCT, and SparkTTS.
My current research interests revolve around Social Computing and Large Language Models (LLMs), where I aim to leverage LLMs to address critical societal challenges such as misinformation, fake news, and deepfakes.
Links 🔗