For high-accuracy transcription of large amounts of audio and video, Vocapia has a powerful option. Its VoxSigma software suite uses AI-based machine learning for speech recognition in many languages, including audio segmentation, speaker identification and language identification. It can transcribe in batches or in real time, and it's got a REST API for high scalability and reliability. It's geared for professional customers in broadcast monitoring, media asset management and speech analytics.
SpeechText also offers a high-accuracy transcription service, using deep neural networks to convert audio and video into text. It supports more than 30 languages and offers domain-specific models, an audio search engine and a variety of editing tools. With several pricing tiers and API integration, SpeechText could be a good option for journalism, medicine and business.
If you need something that can handle a lot of file formats and languages, TranscribeMe offers fast, accurate and relatively inexpensive transcription services. It can handle multilingual options and offers different types of transcription, including automated and human-edited transcripts. TranscribeMe is designed to handle large projects with guaranteed accuracy, so it's a good option if you need to transcribe lots of audio.