If you want to build your own voices and fine-tune language models for automated calls, Elto is another good option. It offers photorealistic voices with customization, fine-tuned language models that adapt to new call flows, and other features like Human Handoff, Knowledge Bases, and Custom Voices. Elto also offers a highly scalable solution with low latency and supports integration with REST and GraphQL APIs.
Another good option is ElevenLabs, which offers high-quality, realistic voices in 29 languages and more than 120 voices. It offers voice cloning, fine-tuning and long-form voice generation. The service includes a free plan with 3 custom voices and speech in 29 languages, so it's good for content creators and businesses trying to improve their audio.
PlayHT is another option. It's got a library of more than 600 ultra-realistic AI voices and real-time voice cloning, custom pronunciations and voice inflections. PlayHT is flexible, with a broad range of use cases including video voiceovers, audio publishing and conversational AI. It offers several pricing tiers, including a free option.
If you're looking for something more specialized, Resemble offers hyper-realistic AI voices with features like fast voice cloning, speech-to-speech and multilingual support. It's geared for customer service, gaming and entertainment, and offers flexible integration. The platform also has security features like watermarked audio and deepfake audio detection.