The voice AI landscape, mapped.
Browse playable providers alongside directory-only options. vozses highlights what you can test today and what is coming next.
Search providers or filter by tags to see who is playable today.
Azure TTS
Enterprise-grade neural voices with broad language coverage.
ElevenLabs
High-fidelity voices with expressive controls and cloning.
Cartesia
Low-latency voice models built for real-time agents.
Amazon Polly
Reliable cloud TTS with SSML support and AWS integration.
Google Cloud TTS
Neural voices with broad language coverage and tooling.
Coqui TTS
Open-source TTS with self-hosted flexibility.
OpenAI (Speech)
Speech generation and transcription in one API suite.
Google Cloud TTS
Neural voices with strong language coverage and tooling.
Play.ht / PlayAI
Creator-friendly studio with a wide voice catalog.
WellSaid Labs
Studio-quality enterprise voices and collaboration tools.
Resemble AI
Secure voice cloning with real-time APIs.
Lovo
Creator-focused voiceover platform and TTS API.
Speechify Studio
Browser-based dubbing and voice production workflow.
Hume AI
Expressive voices with emotion-aware controls.
Descript
Editing-first audio workflow with Overdub voices.
Murf AI
Team voice production with a large voice library.
Coqui TTS
Open-source TTS for self-hosting and full control.
Qwen3-TTS
Open TTS models you can self-host for standardized demos.
VibeVoice (1.5B)
Lightweight open voice model aimed at fast inference.
IndexTTS2
High-quality open model with a restricted use license.
Fish Speech (OpenAudio)
Strong open TTS direction, but weights are non-commercial.
Chatterbox
Developer-friendly open TTS stack for self-hosted baselines.
PocketTTS
Small-footprint TTS for lightweight and low-resource setups.
Echo-TTS
Open model commonly used for research and quick demos.