Vocapia offers a range of speech-to-text software and services for broadcast monitoring, lecture and seminar transcription, video subtitling, conference call transcription and speech analytics. The tools employ AI techniques like machine learning to achieve top performance in many languages and with a variety of audio sources.
The foundation of Vocapia's technology is the VoxSigma software suite, which offers large vocabulary speech recognition, audio segmentation, speaker identification and language recognition. It's geared for professional customers who need to transcribe lots of audio and video documents, either in batch mode or in real-time.
VoxSigma is available as a Web service using a REST API, with failover servers and geographic redundancy for high availability. The online version offers full speech transcription, audio indexing and speech-text alignment, and it can be updated daily with new language models.
The service supports Arabic, Cantonese, Czech, Dutch, English, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian and Urdu.
Vocapia charges for its SaaS service based on speech duration, excluding silences, with no minimum cost per submission. The cost is about 0.01 euro (or $0.01) per minute for generic systems and large-scale use. Free trials are available by request.
Vocapia also offers other services, including document-based adaptation, on-demand batch processing and custom models for specific needs. Customers can request these services through the website.
For customers who need high-accuracy speech recognition, Vocapia offers support through hotline services by email and phone to help customers and system integrators troubleshoot problems.
Vocapia's tools can be useful for broadcast monitoring, media asset management, speech analytics and subtitling. They can turn raw audio data into structured, searchable XML documents that make content more useful and accessible.
Published on June 12, 2024
Analyzing Vocapia...