If you need APIs for real-time transcription to transcribe live recordings of audio, Rev AI is another top choice. It offers real-time transcription in 9 languages and supports multiple languages for asynchronous transcription. The service is geared for media and entertainment, education and call centers, with options for sentiment analysis, topic extraction and summarization. Pricing is pay-as-you-go, with costs starting at $0.02 per minute for machine transcription.
Another top contender is Gladia, which promises high accuracy and multilingual speech-to-text translation in 99 languages. Its features include speaker diarization, code-switching and word-level timestamps. Gladia is good for content and media, virtual meetings and call centers, and offers pricing tiers including a free plan and enterprise deals.
Deepgram has a range of APIs, including speech-to-text and text-to-speech. It supports multiple languages and offers detailed transcription data that's useful for speech analytics and media transcription. Its low-latency text-to-speech API is good for building voicebots and customer service apps.
If you're on a tighter budget, Trint offers AI-powered transcription services with up to 99% accuracy in more than 40 languages. It's got real-time collaboration tools and supports 50+ languages, so it's good for content creators, researchers and businesses. Trint's live transcription through mobile apps means it'll fit into your workflow.