Free Google: Chirp 3
Chirp 3 is Google's latest multilingual speech-to-text model. It offers enhanced transcription accuracy across 24 GA languages and 77+ preview languages, with support for automatic language detection, automatic punctuation, and a built-in denoiser for cleaner audio processing.
Use via API
curl -X POST https://api.rewind.ai/v1/stt/ \
-H "Authorization: Bearer sk-rewind-..." \
-H "Content-Type: application/json" \
-d '{"model":"google/chirp-3","audio_url":"https://..."}'API Documentation Get API Key