Finding a more accurate and more affordable solution. Achieving a higher level of accuracy compared to Google speech-to-text solution and finding a new speech recognition solution, with a high degree of scalability, able to handle large volumes of audio data. Finding a more affordable solution, as the other solutions they had tried were expensive and with questionable results.
mediaTrust main objectives:
Create highly accurate transcripts with the highest possible accuracy rate and monitor brand/ product mentions as effectively as possible.
Be able to always deliver the most accurate transcription to the final customers.
mediaTrust wanted to be able to transcribe all the company names, product names, brand names, terminology, and specific words or mentions, in order for their final customers to analyze when the conversation is about a specific product, brand, person or company.
mediaTRUST tested the infrastructure’s accuracy and scalability by performing extensive load testing and simultaneously sending thousands of files for transcription. All of the tests were highly successful. With Vatis Tech, they were able to train and customize a speech model using audio from mediaTrust’s own platform, which led to a dramatic accuracy increase beyond what the all-in-one ASRs could reach. mediaTrust achieved:
A speech-to-text API for the Romanian language with over 95% accuracy rate on media type of audio data, and a transcription speed of approximately 15% of the file length (1 hour of audio was transcribed in less than 10 minutes)
Improvements on specific keywords, primarily brands or products.
An integration process that was ready in less than a day, with the remaining time spent only on performing accuracy and scalability tests.
A successful implementation process, due to our API integration, which was something challenging, if not impossible, with other providers. Due to our deployment flexibility - mediaTRUST was able to install our solution in their infrastructure.
Bogdan Apostol
CEO
The other speech to text solutions mediaTrust had tried were up to 4x more expensive, and the manual transcription solutions were up to 10x more expensive compared to Vatis Tech. mediaTRUST has been able to reduce transcription costs.
1 hour of audio content was transcribed in less than 10 minutes, while manual transcription takes on average 3 hours. Vatis Tech speech to text software allowed mediaTRUST to monitor audio and video materials for its clients faster and be more efficient.
With Vatis Tech it was possible to improve certain keywords, primarily brands or products.
A high content accuracy of over 95% could be achieved.
Ability to take on higher volumes of new business, based on media monitoring powered by speech to text AI solution, and thus fulfill even more complex projects with higher customer demands.
Move to automatic transcription of all media monitoring processes.