Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Product News

What is summarization?

Summarization in speech-to-text (STT) AI is a popular feature that streamlines the extraction of essential information from spoken content. By condensing lengthy audio recordings or live conversations into concise summaries, STT summarization enhances user experience, facilitating quicker understanding and decision-making for the final users.

Case Studies

Opening up new markets for a sales meeting and CRM enrichment platform: Spoke's success story with Gladia

In the past, sales teams around the world were presented with a twofold challenge. In addition to showcasing their products in the best light to prospects, they needed to take detailed notes during the call and fill their CRM software manually afterward.

Product News

A new open-source developer app for AI translation, dubbing and lip synching to try

Text-to-speech, voice cloning, and visual dubbing are some of the hottest trends in AI at the moment. Used in tandem with AI transcription and translation, they make it possible to generate hyper-realistic voiceovers, indistinguishable from the sound of the speaker’s natural voice and speech patterns — including in entirely new languages.

Speech-To-Text

What is OpenAI Whisper?

OpenAI Whisper is a cutting-edge Automatic Speech Recognition (ASR) system designed to transcribe spoken language into written text, leveraging deep learning techniques. Released in September 2022, this neural net has by now become a legendary tool in natural language processing, offering unparalleled accuracy and versatility and giving rise to numerous open-source and commercial applications.

Speech-To-Text

Top 5 Whisper GitHub projects: A practical guide for programmers

In September 2022, OpenAI unveiled Whisper, an innovative open-source automatic speech recognition (ASR) model trained on an impressive dataset of 680,000 hours of diverse speech. Since its release, the model has received widespread recognition for its remarkable robustness and accuracy. It rivaled human capabilities in English speech recognition and set a new standard for multilingual transcription and translation.

Tutorials

How to set up a Node.js transcription WebSocket with the Gladia live audio transcription API: A step-by-step guide

Have you ever used an audio-to-text transcription application to convert audio-to-text and wondered how it worked or how to build one? Are you a developer looking to add audio transcription to your next project? This article answers these questions and more.