Audio & Voice
Tortoise TTS
Advanced text-to-speech with multi-voice capabilities and natural prosody for professional use.
Tortoise TTS is a cutting-edge speech synthesis model known for its realistic intonation and multi-voice support. Ideal for developers and enterprises, it powers applications requiring high-quality voice generation. Features include customizable voice styles and seamless integration with AI platforms.
Kimi-Audio: A universal open-source audio foundation model handling ASR, AQA, AAC & more. Pre-trained on 13M hours for SOTA performance. Features hybrid architecture & low-latency inference.