powered by
Process audio longer than 30 seconds in chunks.
transcribe_long( file, model, tokenizer, config, language, task, device, dtype, verbose )
Combined transcription result
Audio file
WhisperModel
Tokenizer
Model config
Language
Task
Device
Dtype
Verbose