Skip to content

Whisper

Repository: openai/whisperDescription: Robust speech recognition via large-scale weak supervision, providing high-accuracy transcription across many languages. Key Features:

  • Multilingual: Supports 99 languages for transcription and translation.
  • Robust Performance: Trained on 680,000 hours of web-scale supervised data.
  • Voice Activity Detection: Handles noise and background sounds effectively.
  • Translation: Can translate speech from various languages into English text. Primary Use Cases: Transcription, subtitle generation, voice-controlled interfaces. Tags: #voice-ai #speech-to-text #multilingual #openai Added: 2026-06-18 Source: GitHub

Notes / Why Notable

Whisper is the industry standard for open-source speech-to-text, often used as the backend for countless transcription services and AI agents requiring voice input capabilities.

Maintained with Yeda — Karpathy LLM Wiki paradigm.