description

Speech to Text

Transcribe audio files to text with word-level timestamps

mic

Drop audio file here

MP3, WAV, M4A, FLAC, OGG, AAC, WebM. Max 200MB.

Features

schedule
Word-level timestamps for precise alignment
translate
Multi-language detection and support
subtitles
SRT and JSON output formats
speed
Multiple Whisper model variants for speed vs accuracy