Back to Models Overview

Gemini 2.5 Pro

Google’s most capable multimodal reasoning model to date. Supports ultra-long contexts and advanced audio-visual understanding.

Core Capabilities
  • Advanced multimodal reasoning
  • Ultra-long context (up to 8M)
  • Audio and video understanding
  • State-of-the-art coding
Ideal Use Cases
  • Complex video analysis
  • Advanced R&D projects
  • Multi-document analysis
Technical Profile

Context Window

≈1,048,576 input / 65,536 output tokens

Media Support

text
code
images
audio
video
Pricing

Input

$1.25 / 1M tokens (≤200k context), $2.50 > 200k

Output

$10.00 / 1M tokens (≤200k), $15.00 > 200k