whisper-1

Common Name: Whisper

Released on Feb 17, 2025 12:00 AM

General-purpose speech recognition model. Transcribes and translates audio to text in multiple languages.

Context

128K

Inputaudio

Outputtext

Collecting…

Input Audio$0.0066/Minutes

Availability Trend (24h)

$0.165/$0.66/M

ctx128Kmax16Kavail—tps—

InOutCap

A cost-efficient audio-capable model that accepts text, audio, and image inputs and can generate text and audio outputs.

$0.165/$0.66/M

ctx128Kmax16Kavail—tps—

InOutCap

A cost-efficient audio-capable model that accepts text, audio, and image inputs and can generate text and audio outputs.

$2.75/$11.00/M

ctx128Kmax16Kavail—tps—

InOutCap

GPT-4o with native audio input and output capabilities for real-time speech-to-speech conversations.

$2.75/$11.00/M

ctx128Kmax16Kavail—tps—

InOutCap

GPT-4o with native audio input and output capabilities for real-time speech-to-speech conversations.