Back to Models Overview
Gemini 2.5 Pro
Google’s most capable multimodal reasoning model to date. Supports ultra-long contexts and advanced audio-visual understanding.
Core Capabilities
- Advanced multimodal reasoning
- Ultra-long context (up to 8M)
- Audio and video understanding
- State-of-the-art coding
Ideal Use Cases
- Complex video analysis
- Advanced R&D projects
- Multi-document analysis
Technical Profile
Context Window
≈1,048,576 input / 65,536 output tokens
Media Support
text
code
images
audio
video
Pricing
Input
$1.25 / 1M tokens (≤200k context), $2.50 > 200k
Output
$10.00 / 1M tokens (≤200k), $15.00 > 200k