Speech to Text Software.

Plans
  • Custom
Platforms
Social Links

Description

Broadcast monitoring & audio visual archive indexing:  The VoxSigma speech-to-text software suite offers advanced language technologies including speech recognition, language identification and speaker diarization to transform raw audio data into structured and searchable XML documents, enabling users to access content in video documents.

Transcription of business conference calls:  Vocapia's speech recognition software significantly reduces the cost of transcribing business conference calls. The audio document is converted to a fully annotated XML document including speech and non speech segments, speaker labels, words with time codes, high quality confidence scores, as well as punctuation. Vocapia offers services to adapt, tune or create specific models or systems tailored to exactly match the application needs. [MORE]

Debate and lecture transcription and indexing:  VoxSigma helps reduce the production time and cost to produce transcripts, minutes and/or summaries of public presentations and meetings. VoxSigma also aligns existing transcriptions with audio files, thus significantly enhancing usability. This same speech-text alignment technology is used for audiobooks. [MORE]

Video Subtitling:  While fully automatic processing generally does not deliver high enough quality subtitles, Vocapia's speaker diarization, speech to text transcription and speech-text alignment technologies significantly reduce the effort entailed when closely integrated in the subtitle creation process. [MORE]

Telephone Speech Analytics:  Vocapia's speech recognition software and language identification software process telephone data making the recorded calls searchable and analyzable via text-based methods. VoxSigma is used by call management companies and for defense applications. The transcripts are further analyzed and categorized, generating statistics about customer calls. Large vocabulary continuous speech recognition is a key technology for automatic, comprehensive analysis of recorded calls. [MORE]

Avionics:  In aircraft cockpits, speech recognition software can be used to improve command and control and allow analysis of radio communications to assist pilots. We provide real-time solutions for low power embedded systems.

Rating and Reviews

Add Rating and Review
0
(0 Reviews)
5  
0%
4  
0%
3  
0%
2  
0%
1  
0%

Questions & Answers

Can automatic speech recognition be used to transcribe unrestricted broadcast data?

Yes, but the speech recognition accuracy varies greatly depending upon a large number of factors, including the type of speech (from prepared to spontaneous speech and conversational speech) and the noise level. So you can expect very good results when transcribing the speech of an anchor speaker in a TV or radio news show, but much less good results for the speech of someone engaged in a very casual conversation.

Oct. 6, 2023


Can automatic transcriptions be used the same way I process text?

Yes, the output of the VoxSigma software is an XML file that can be easily converted into plain punctuated text by discarding additional information such as word time-codes and word confidence scores.

Oct. 6, 2023


How long it take to develop an ASR for a specific language?

It depends greatly on the available language resources for the specific language. It also depends on the type of speech data you want to process. We are supporting many languages, including Arabic, Cantonese, Czech, Dutch, English, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Italian, Latvian, Lithuanian, Mandarin, Pashto, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Turkish, Ukrainian and Urdu. Contact us to get a more precise answer for the languages you are interested in.

Oct. 6, 2023


Do I need to configure the system vocabulary or grammar?

Vocapia Research LVCSR systems come with fully trained language models, so the only information you have to provide to the system is the language being spoken. If the language is not known, the language can be identified automatically (among 20 known languages) by using the VoxSigma language recognition software. A language identification system identifies the language being spoken from the speech signal.

Oct. 6, 2023