Google: Gemini 2.5 Flash

Model Overview

Property	Value
Model ID	`google/gemini-2.5-flash`
Name	Gemini 2.5 Flash
Provider	Google
Released	2025

Description

Lightning-fast and highly capable model that delivers a balance of intelligence and latency. Gemini 2.5 Flash offers controllable thinking budgets for versatile applications, making it ideal for a wide range of tasks requiring both speed and quality.

Specifications

Spec	Value
Context Window	1,048,576 tokens
Max Completion	8,192 tokens
Modalities	Text, Image, Audio, Video

Pricing

Type	Price
Input	$0.30 per 1M tokens
Output	$2.50 per 1M tokens

Capabilities

Vision: Yes
Tool Use: Yes
JSON Mode: Yes
Streaming: Yes
Function Calling: Yes
Controllable Thinking: Yes

Use Cases

Versatile applications requiring balance of speed and intelligence, multimodal tasks, production deployments.

Integration with LangMart

Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)

API Usage:

curl -X POST https://api.langmart.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "google/gemini-2.5-flash",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 4096
  }'

google/gemini-2.5-flash-lite - Cost-optimized version
google/gemini-2.5-pro - Higher capability variant
google/gemini-2.0-flash - Previous generation

Last Updated: December 28, 2025

Google: Gemini 2.5 Flash

Google: Gemini 2.5 Flash

Model Overview

Description

Specifications

Pricing

Capabilities

Use Cases

Integration with LangMart

Related Models