Obiguard’s AI gateway enables speech-to-text using models such as OpenAI Whisper.
Transcription
and Translation
options for speech-to-text (STT) models,
following the OpenAI API format. You can submit audio files in formats such as flac
, mp3
, mp4
, mpeg
, mpga
, m4a
, ogg
, wav
, or webm
as part of your API request.
Example:
Provider | Model | Functions |
---|---|---|
OpenAI | whisper-1 | Transcription, Translation |