Heading 1

Heading 2

Heading 3

Heading 4

Heading 5
Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

  1. Item 1
  2. Item 2
  3. Item 3

Unordered list

Text link

Bold text

Emphasis

Superscript

Subscript

Read more

Speech-To-Text

Mastering multilingual speech-to-text: handle code-switching with AI

The article explains why code-switching makes multilingual speech-to-text harder, especially when speakers switch languages mid-sentence or use accents in noisy environments.

Speech-To-Text

Best Whisper alternatives for 2026: Comparison of top speech-to-text APIs

The article compares the top Whisper alternatives for 2026 across accuracy, latency, pricing, features, and production readiness.

Speech-To-Text

Mastering CRM data enrichment: AI & speech-to-text for smarter leads

The article explains how AI and speech-to-text can enrich CRM records by turning sales calls into structured lead data like names, budgets, timelines, sentiment, and intent signals. It covers pipeline architecture, accuracy testing, compliance, cost planning, CRM integration, and production monitoring.

March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

Published on Jun 2, 2023
March 2023 Roadmap its Speech-to-Text API: Speaker Diarization, Word-Level Timestamps and more

A glimpse into Gladia's roadmap for its Speech-to-Text API, starting with speaker diarization. We’re incredibly excited to be building our Audio Intelligence product in a community-led way, delivering a holistic final product adapted to the many needs and use cases brought to our attention.

Following Gladia’s Speech-to-Text AI alpha release two weeks ago, we’ve received dozens of new feature requests from the alpha users, to make our core real-time audio transcription API even more exciting and versatile.

We heard you and are happy to announce that the API is growing more robust by the minute and is now available with more capabilities — on top of its blazing speed and top-tier output quality.

We’re incredibly excited to be building our Audio Intelligence product in a community-led way, delivering a holistic final product adapted to the many needs and use cases brought to our attention.

Here’s what we have in store already

Speech-to-Text (STT) Transcription

Setting a new standard for the industry, our STT API is build on OpenAI’s Whisper and can transcribe audio in 10s/h at 3.52%WER. Tested and approved by thousands of alpha users across a range of use cases (e.g. call center, virtual meetings, YouTube videos, podcasts).

Speech-to-Text Translation

Upload your file, select an output language of your choice, and enjoy the final translated transcript free of errors. Currently available in 99 languages, and counting. If your language is not supported yet, drop us a message in this Twitter thread.

Transcription from YouTube URL

Drop a video URL and enjoy a highly accurate output file (.srt or JSON) that can be used as an alternative to YouTube’s auto-captions to improve the viewer’s experience on your channel. Transcription as subtitles file (.srt) will become available shortly too.

And here is the list of new most anticipated features we’re planning to release in March.

Speaker Diarization

You will now be able to automatically identify and recognize all speakers mixed in a single audio or video stream, including when multiple languages are used.

Word-Level Timestamps

A feature enabling Gladia users to produce a highly accurate JSON transcript with time stamps at every word.

Live-Streaming Transcription

We’re adding the ability to transcribe speech in real-time, using your microphone.

We’re preparing a series of deep dives on some of these new features to showcase how our tech works behind the scenes. Stay tuned!

As always, feel free to test the API and give us your feedback  Discord. We truly love iterating with the community.

Contact us

280
Your request has been registered
A problem occurred while submitting the form.

Read more