Another option is Gladia, which uses optimized Whisper ASR technology for high-accuracy AI transcription. It also has speaker diarization, multilingual support and near real-time language detection. Gladia's API is compatible with a variety of tech stacks and can be used for transcription, translation, summarization and topic classification, with detailed pricing options.
Deepgram has a range of APIs for speech-to-text, text-to-speech and audio intelligence. It supports many languages and offers detailed transcription data that's useful for speech analytics, media transcription and contact centers. Deepgram also has a free API playground and flexible pricing options, including a free $200 credit to get started.