Blog

Technical guides, customer stories, and product updates
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Speech-To-Text

What is speech-to-text & how does it work?

Speech-to-text (STT), also known as Automatic Speech Recognition (ASR), is an AI technology that transcribes spoken language into written text. Previously reserved for the privileged few, STT is becoming increasingly leveraged by companies worldwide to embed new audio features in existing apps and create smart assistants for a range of use cases.

Speech-To-Text

How to build a Google Meet transcription bot with Python, React and Gladia API

In today's fast-paced world, effective communication and collaboration are essential. Tools like Google Meet have revolutionized how we connect and conduct meetings remotely. However, it can be very challenging to keep track of all action items and key insights shared during long meetings.

Speech-To-Text

How much does it really cost to host Whisper AI transcription?

Open-source ASR models are often presented as the most cost-effective solution to embedding Language AI into your applications. But is that always the case? Here's our take.

Speech-To-Text

Thinking of using open-source Whisper ASR? Here are the main factors to consider

Perhaps you’re a developer looking for an Automatic Speech Recognition (ASR) solution for the first time. Or an executive looking for more affordable, faster, more accurate alternatives to the mainstream speech-to-text solutions for your business. Where do you turn to?

Speech-To-Text

Here’s how to pick the right speech-to-text provider for your Speech AI journey

Until recently, AI speech-to-text has been reserved for the happy few. But commodification is on its way. As prices dropped while the accuracy and speed of transcription increased, there has been an explosion of speech-to-text providers catering to a broader range of companies and use cases. In this article, we give you a bird's-eye view of the market and introduce you to the speed-accuracy-cost tradeoff in audio transcription to help you pick the best Automatic Speech Recognition (ASR) provider for your use case and budget. 

Case Studies

Powering virtual meetings with Speech to Text AI: Claap's success story with Gladia

A case study showcasing the benefits of Gladia's AI API for Claap, an all-in-one video workspace that implemented our solution to provide its international users with advanced video transcription capabilities.

Product News

From Speech to Knowledge: Gladia’s Audio Intelligence API

Gladia is proud to announce the general availability of its groundbreaking Speech-to-Text API, previously in alpha. The revamped enterprise-grade API supports transcription, speaker diarization, word-level timestamp, code-switching, and beta translation in 99 languages.

Product News

March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

A glimpse into Gladia's roadmap for its Speech-to-Text API, starting with speaker diarization. We’re incredibly excited to be building our Audio Intelligence product in a community-led way, delivering a holistic final product adapted to the many needs and use cases brought to our attention.

Speech-To-Text

Here’s how speech-to-text AI can benefit your business today

Speech-to-text AI is entering an exciting phase and becoming a commodity. By powering Audio intelligence, products like Gladia's Audio Transcription API create value for all businesses, from collaboration platforms to content studios to media companies to call centers.