If you're looking for another Gladia alternative, AssemblyAI has a wide range of AI models for speech-to-text transcription, speaker identification, sentiment analysis and other tasks. It supports more than 99 languages and offers integration tools with a free tier for testing and pay-as-you-go pricing for production. The service is geared for companies building their own AI products and has data security protections with GDPR, PCI-DSS and SOC 2 Type 1/Type 2 standards.
Another option is Deepgram, which offers speech-to-text and text-to-speech APIs with audio intelligence abilities. It supports multiple languages and offers detailed transcription data that's good for speech analytics, media transcription and contact centers. Deepgram also offers a free API playground and flexible pricing options, including a $200 credit to get started.
For customers who need high accuracy and support for many languages, SpeechText offers advanced deep neural network models for transcription. With features like automatic punctuation and domain-specific models, it supports more than 30 languages and can handle non-native speaker accents. SpeechText protects data with GDPR compliance and encryption, and offers a variety of pricing tiers depending on your needs.
Last, Byrdhouse offers a full-featured solution for real-time voice and caption translation across more than 100 languages. It includes features like voice-to-text transcription, auto-language detection and profanity detection, so it's good for improving communication in multicultural teams and global businesses. Byrdhouse offers flexible pricing tiers, including a free tier for real-time translation, and offers A-Z technical support.