- file
Path to the audio file to convert.
- model
Model name to use for transcription. For API backends, this
is passed directly (e.g., "whisper-1"). For whisper, this is
the model size (e.g., "tiny", "base", "small", "medium", "large").
If NULL, uses the backend's default.
- language
Language code (e.g., "en", "es", "fr"). Optional hint
to improve transcription accuracy.
- response_format
Response format for API backend. One of "text",
"json", or "verbose_json". Ignored for whisper backend.
- backend
Which backend to use: "auto" (default), "whisper",
or "openai". Auto mode tries whisper first, then openai API
(if configured).
- prompt
Optional text to guide the transcription. For API backend,
this is passed as initial_prompt to help with spelling of names,
acronyms, or domain-specific terms. Ignored for whisper backend.