ElevenLabs is launching Scribe, Its own text-to-speech model

ElevenLabs is an AI audio generation, one of the leading companies and tools in the space, launching Scribe, its own text-to-speech model.

The company recently $180M of funding competing with speech models like OpenAI’s Whisper, Deepgram, and Gladia like models.

ElevenLabs Scribe model supports over 99 languages with features like word-level timestamps, audio-event tagging, multi-speaker detection, etc.

You can use this for your meeting summaries, movie subtitles, real-time sports titles, and others. The model makes the lowest errors in Italian (98.7%) and in English it’s at 96.7%.

Creators and businesses can use Scribe directly with the ElevenLabs dashboard to upload video or audio to generate formatted transcripts.

It is a useful model they have released. You can check it to see how it works.

You can check out the announcement video for more information.

Related Posts