Powering virtual meetings with Language AI: Claap's success story with Gladia's Speech-to-Text

A case study showcasing the benefits of Gladia's AI API for Claap, an all-in-one video workspace that implemented our solution to provide its international users with advanced video transcription capabilities.
Meet Claap ✨
Claap is a leading all-in-one video workspace that helps organizations speed up decision-making by reducing back-to-back virtual meetings and endless messaging threads. Claap unifies meeting recording, screen recording, and a video wiki into a single collaborative workspace, unlocking a new level of alignment, transparency, and efficiency.
Founded in France in 2021, the company serves international customers like Revolut, Kavak, and Qonto, and helps them solve their remote collaboration challenges.
• 14 employees
• 5,000+ customer workspaces
• Global user base: French, English (US/UK), Spanish, German, Dutch, Danish, Chinese, Hindi, Portuguese, Italian and other languages in use

Claap’s team knew that in order to realize their vision, they needed to make video content a truly intuitive medium of information. This, in turn, meant that they had to make video content easy to summarize, categorize, search, and digest. A high-quality transcription tool was therefore crucial to their roadmap.
Challenge
Claap’s Co-founder and CTO, Thomas Hernandez, needed a highly accurate, fast, and easy-to-implement multilingual solution both for core transcription and for video intelligence add-ons.
While the Claap team considered building the solution in-house using a layer of add-ons on top of an open-source product, they felt that transcription was becoming enough of a commodity that a specialized API provider would be a better fit, thanks to reduced time-to-market and lower infrastructure costs.
The issue they encountered, however, was that the incumbent providers did not meet the near-perfect quality they were looking for, as they were primarily US-centric, with lower quality for non-native English speakers. They were also either too slow, or prohibitively expensive.
Objectives
To deploy a high-quality, scalable video transcription API and audio intelligence add-ons for all Claap's customers all around the globe.
Specifications
✔️ A high-quality transcription at a scalable cost so it could be deployed in every user space.
✔️ A solution adapted to speakers from multiple geographies, with different languages and regional accents.
✔️ A transcription technology that is fast enough to add a valuable layer of insight to user videos almost immediately.
Solution
Enter Gladia! 🚀 With Gladia, the Claap team was able to implement:
- A highly accurate transcription solution registering a Word Error Rate between 1% and 3%.
- Transcription at blazing speed, with one hour of video transcribed on average in under 60s.
- A truly multilingual approach to transcription, with core features and add-on all available in over 90 languages.
- A quick implementation and iteration cycle, with fixes and features released every week, as well as weekly client support.
- Audio intelligence add-ons such as diarization, word-level timestamp, translations, and other features, enabling Claap to make content more accessible to their international users.
Thanks to our API, Claap was able to unlock the following AI-powered functionalities for its users:
- Synced playback — follow along with a video transcript while you listen to the recording
- Speaker detection — automatically detect multiple speakers to jump in the right moment
- Search within video — search the full text transcript and find the exact moment you’re looking for
- Add comments while watching transcript — easily add comments related to a specific time stamp.

Impact
After deciding to add audio transcription to their roadmap, Claap started testing the API immediately. By working with the Gladia team to iterate and scale up, they saw a noticeable impact on their own customers, from users praising the quality of the transcription to prospects converting specifically after trying it out.

In a nutshell, Claap's case study illustrates the many possibilities unlocked by audio AI for businesses, aiming to improve user experience, boost retention and upgrade its core product functionalities - all with a single turnkey API.
We're thrilled to have been part of this amazing journey, and we owe a huge thanks to our client for putting their trust in us. As we move forward, we're excited to team up with more clients, tackle new challenges, and make AI more accessible to companies worldwide.
💌 Having read this case study, do you feel like Gladia can be the right fit for your business too?
Don't hesitate to contact our sales team to discuss this in more detail. Beyond virtual meetings, we cater to a range of use cases, including content & media, call centers, workspace collaboration, and more.
Contact us
Read more

Speech-To-Text
How to integrate live transcription API with Twilio to transcribe calls in real time.
Twilio, used by hundreds of thousands of businesses and more than ten million developers worldwide, can now integrate with our live transcription API. The integration makes it easier for users to natively transcribe any phone call in real time while using Twilio. With transcribed text at your disposal, you'll then be able to analyze, archive, and act upon voice data more effectively.

Speech-To-Text
Best speech-to-text APIs in 2023
Speech-to-text (STT), also known as automatic speech or voice recognition, is a type of AI technology that recognizes human speech in audio or video and transcribes it into written output. In the form of an API, it can power a variety of applications, ranging from call bots to voice assistants to AI-powered virtual meeting platforms.

Speech-To-Text
How to build a voice-to-text Discord both with Gladia real-time transcription API
Discord, the leading communication platform for gamers and communities, is designed for seamless communication with other users, be it through text channels, DMs, 1-1 calls or even collective voice channels.