TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching

Abstract

Singing voice synthesis has made remarkable progress in generating natural and high-quality voices. However, existing methods rarely provide precise control over vocal techniques such as intensity, mixed voice, falsetto, bubble, and breathy tones, thus limiting the expressive potential of synthetic voices. We introduce TechSinger, an advanced system for controllable singing voice synthesis that supports five languages and seven vocal techniques. TechSinger leverages a flow-matching-based generative model to produce singing voices with enhanced expressive control over various techniques. To enhance the diversity of training data, we develop a technique detection model that automatically annotates datasets with phoneme-level technique labels. Additionally, our prompt-based technique prediction model enables users to specify desired vocal attributes through natural language, offering fine-grained control over the synthesized singing. Experimental results demonstrate that TechSinger significantly enhances the expressiveness and realism of synthetic singing voices, outperforming existing methods in terms of audio quality and technique-specific control.

Cite

Text

Guo et al. "TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I22.34571

Markdown

[Guo et al. "TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/guo2025aaai-techsinger/) doi:10.1609/AAAI.V39I22.34571

BibTeX

@inproceedings{guo2025aaai-techsinger,
  title     = {{TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching}},
  author    = {Guo, Wenxiang and Zhang, Yu and Pan, Changhao and Huang, Rongjie and Tang, Li and Li, Ruiqi and Hong, Zhiqing and Wang, Yongqi and Zhao, Zhou},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23978-23986},
  doi       = {10.1609/AAAI.V39I22.34571},
  url       = {https://mlanthology.org/aaai/2025/guo2025aaai-techsinger/}
}