[GitHub Trending] OpenBMB/VoxCPM
Scored daily by a customisable AI persona to surface the most relevant engineering leadership news.
Tokenizer-free TTS, advanced AI but peripheral to reader's focus.
OpenBMB's VoxCPM2 is a 2B parameter, tokenizer-free TTS built on MiniCPM-4, trained on 2M+ hours of multilingual speech. It generates 48kHz audio via AudioVAE V2, supports voice design from text, controllable cloning from reference clips, and real-time streaming (RTF 0.13 with vLLM-Omni). Apache-2.0 licensed, it is deployable on RTX 4090 and via vLLM API.
- Integrate VoxCPM2 into your stack using vLLM-Omni for real-time, multilingual TTS with voice cloning and design capabilities.
For platform engineers building AI voice agents or multilingual applications, this is a production-ready, open-source TTS that can be deployed on cloud infrastructure with vLLM, enabling low-latency speech synthesis without proprietary APIs.
OpenBMB