Google: Gemini 2.5 Flash
Model Overview
| Property | Value |
|---|---|
| Model ID | google/gemini-2.5-flash |
| Name | Gemini 2.5 Flash |
| Provider | |
| Released | 2025 |
Description
Lightning-fast and highly capable model that delivers a balance of intelligence and latency. Gemini 2.5 Flash offers controllable thinking budgets for versatile applications, making it ideal for a wide range of tasks requiring both speed and quality.
Specifications
| Spec | Value |
|---|---|
| Context Window | 1,048,576 tokens |
| Max Completion | 8,192 tokens |
| Modalities | Text, Image, Audio, Video |
Pricing
| Type | Price |
|---|---|
| Input | $0.30 per 1M tokens |
| Output | $2.50 per 1M tokens |
Capabilities
- Vision: Yes
- Tool Use: Yes
- JSON Mode: Yes
- Streaming: Yes
- Function Calling: Yes
- Controllable Thinking: Yes
Use Cases
Versatile applications requiring balance of speed and intelligence, multimodal tasks, production deployments.
Integration with LangMart
Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)
API Usage:
curl -X POST https://api.langmart.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemini-2.5-flash",
"messages": [{"role": "user", "content": "Hello"}],
"max_tokens": 4096
}'
Related Models
- google/gemini-2.5-flash-lite - Cost-optimized version
- google/gemini-2.5-pro - Higher capability variant
- google/gemini-2.0-flash - Previous generation
Last Updated: December 28, 2025