> ## Documentation Index
> Fetch the complete documentation index at: https://veniceai-docs-revamp.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# Speech-to-Text Models

> Speech recognition models for transcribing audio to text

<div id="model-search-placeholder" data-filter="asr">Loading models...</div>

***

## Usage

Speech-to-text models transcribe spoken audio into written text. They are accessed via the [Audio Transcriptions API](/api-reference/endpoint/audio/transcriptions).

### Supported audio formats

`mp3`, `mp4`, `mpeg`, `mpga`, `m4a`, `wav`, `webm`, `flac`, `ogg`

### Response formats

| Format         | Description                                               |
| -------------- | --------------------------------------------------------- |
| `json`         | Default. Returns `{ "text": "..." }`.                     |
| `text`         | Plain transcribed text.                                   |
| `srt`          | SubRip subtitle format with timestamps.                   |
| `vtt`          | WebVTT subtitle format with timestamps.                   |
| `verbose_json` | Full response with segment-level timestamps and metadata. |

<Note>
  Pricing is billed per second of input audio. See the [Audio Transcriptions API](/api-reference/endpoint/audio/transcriptions) for request examples and parameter details.
</Note>
