gemini-1.5-flash-8b

Common Name: Gemini 1.5 Flash 8B

Google
-10%On SaleReleased on Aug 27, 2024 12:00 AMKnowledge Cutoff Apr 1, 2024 12:00 AMSupportedTool Invocation
CompareTry in Chat

Google's smallest Gemini model optimized for speed and cost efficiency with multimodal support.

Specifications

Context
1048.6K
Maximum Output
8.2K
Inputtext, image, audio, video
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Standard
128K Tier
Input/MTokens
$0.03
$0.07
Output/MTokens
$0.14
$0.27
Input Audio/MTokens
$0.03
$0.07

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

A lightweight and fast version of Gemini 2.0 Flash optimized for cost-effective multimodal tasks with lower latency.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Google's most cost-efficient multimodal model with 1M token context, designed for high-volume applications requiring speed and affordability.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Google's fast, cost-efficient multimodal model with 1M token context for high-volume tasks.

$0.07/$0.27/M
ctx1.0Mmax8Kavailtps
InOutCap

Snapshot of Gemini 1.5 Flash with 1M token context for fast multimodal understanding.