Whisper
Repository: openai/whisperDescription: Robust speech recognition via large-scale weak supervision, providing high-accuracy transcription across many languages. Key Features:
- Multilingual: Supports 99 languages for transcription and translation.
- Robust Performance: Trained on 680,000 hours of web-scale supervised data.
- Voice Activity Detection: Handles noise and background sounds effectively.
- Translation: Can translate speech from various languages into English text. Primary Use Cases: Transcription, subtitle generation, voice-controlled interfaces. Tags: #voice-ai #speech-to-text #multilingual #openai Added: 2026-06-18 Source: GitHub
Notes / Why Notable
Whisper is the industry standard for open-source speech-to-text, often used as the backend for countless transcription services and AI agents requiring voice input capabilities.