Audio & Voice
Whisper AI
Open-source speech recognition for multilingual transcription and translation with AI.
Whisper AI by OpenAI is a robust ASR system supporting 99 languages with accent resilience. Ideal for researchers and app developers, it enables accurate transcriptions and translations from audio/video content.
ACE-Step is an open-source music generation model combining speed, coherence, and control, generating 4-minute tracks in 20 seconds with fine-grained detail.