Blog

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Product News

Introducing Whisper-Zero

Today, we're thrilled to release a new breakthrough ASR system, Whisper-Zero —a complete rework of Whisper combined with multiple state-of-the-art models, using over 1.5 million hours of diverse audio, including phone-quality and noisy data from real-life environments.

Tutorials

How to build a Google Meet Bot for recording and video transcription

Tools like Google Meet have revolutionized how we connect and conduct meetings remotely. However, it can be very challenging to keep track of all action items and key insights shared during long meetings.

Speech-To-Text

Automatic speaker recognition (ASR): identification, verification and diarization

Due to individual differences in physical attributes like vocal tract shapes, every person possesses a distinct voice pattern. In automatic speech recognition (ASR), this uniqueness is harnessed to identify and analyze speakers by extracting and analyzing voice features such as pitch and frequencies.

Tutorials

Building a Whisper YouTube transcription generator for automated captioning

With over 500 hours of video uploaded to YouTube every minute, providing accurate captions and transcripts is essential for creators to make their content engaging and accessible. However, manually transcribing long videos is tedious and time-consuming.

Tutorials

How to summarize audio using Whisper ASR and GPT 3.5

From online meetings to voice memos and media content, the amount of audio data generated by companies daily is as vast as it is valuable.

Speech-To-Text

Best network architecture for speech recognition software

Building high-quality speech recognition software for your businesses has never been easier. But one needs the right infrastructure to make the most out of AI transcription at an enterprise scale.