New feature: Partials
Achieve faster, smoother real-time conversations with partial transcripts in < 100 ms.
Test Gladia in action
Take a tour of our playground to discover the core API features and capabilities.
STT Voice agent buyer's guide
This guide gives technical leaders a clear framework for evaluating STT vendors. Get the guide to find all the essential criteria.
Our road to real-time audio AI
With $16M in series A funding. Speed, accuracy, and insight — real-time AI, finally funded to deliver.
We evaluated Gladia Solaria against 8 leading providers across 7 datasets and 74 hours of audio. The full methodology is open-sourced so results can be independently reproduced.
Lower WER is better. Filter by dataset to focus on what matters to you.
Each audio file was sent to every provider's production API using default settings. No custom model tuning or prompt engineering was applied. All providers were tested on identical audio files.
Transcription outputs were normalized using the OpenAI Whisper text normalizer before WER computation. Diarization Error Rate (DER) is measured on the DIHARD III challenge datasets using standard protocols.
The full benchmarking framework is open-sourced to enable transparent, reproducible evaluation of speech recognition systems.
Full methodology and evaluation framework available. Reproduce every result independently.