Creating Alerts

This guide provides detailed instructions for creating each type of alert, with examples of when each is most useful.

Creating an Alert

Basic Steps

Navigate to Alerts page
Click Create Alert
Fill in the alert details:
- Name - A descriptive name (required, min 3 characters)
- Description - Optional explanation of what this alert monitors
- Alert Type - Select from the six available types
- Threshold - The value that triggers the alert
- Period - Time window for the alert (hour, day, week, month)
- Notification Channels - Dashboard and/or email
Click Create Alert

Cost Threshold Alerts

Monitor spending and stay within budget.

Configuration

Field	Description	Example
Threshold	Dollar amount that triggers alert	50
Period	Time window for cost calculation	day

When Triggers

Alert fires when your total spending in the period exceeds the threshold.

Example Scenarios

Daily Budget Control

Name: Daily Cost Alert Threshold: $25 Period: Day Use case: Prevent runaway costs from bugs or unexpected usage

Weekly Budget Monitoring

Name: Weekly Spend Warning Threshold: $100 Period: Week Use case: Track weekly spending against your budget

Monthly Cost Cap

Name: Monthly Budget Alert Threshold: $500 Period: Month Use case: Ensure you don't exceed monthly allocation

Recommended Settings

User Type	Suggested Threshold	Period
Hobbyist	$5-10	Day
Developer	$25-50	Day
Team	$100-500	Day
Enterprise	Custom	Day/Week

Error Rate Alerts

Catch API issues before they impact users.

Configuration

Field	Description	Example
Threshold	Error percentage that triggers alert	5
Period	Time window for calculation	hour
Min Requests	Minimum requests before evaluating	10

When Triggers

Alert fires when error rate exceeds threshold percentage within the period, only if minimum request count is met.

Example Scenarios

API Health Monitoring

Name: High Error Rate Alert Threshold: 5% Period: Hour Min Requests: 10 Use case: Detect when API reliability drops

Production Quality Gate

Name: Critical Error Alert Threshold: 10% Period: Hour Min Requests: 5 Use case: Alert on severe issues affecting many requests

Low-Volume Monitoring

Name: Any Errors Alert Threshold: 1% Period: Day Min Requests: 100 Use case: Catch issues in lower-volume applications

Understanding Error Rates

Errors include: timeouts, rate limits, server errors, invalid responses
Normal rate: 0-2% depending on use case
Warning level: 3-5%
Critical level: >5%

Usage Spike Alerts

Detect unusual increases in request volume.

Configuration

Field	Description	Example
Threshold	Percentage increase that triggers	200
Baseline Period	Period to calculate normal usage	day
Comparison Period	Period to compare against baseline	hour

When Triggers

Alert fires when current period requests exceed baseline by the threshold percentage.

Example Scenarios

Runaway Script Detection

Name: Traffic Spike Alert Threshold: 200% Baseline: Day Comparison: Hour Use case: Detect scripts or bots making excessive requests

Viral Traffic Alert

Name: Unexpected Volume Threshold: 500% Baseline: Week Comparison: Day Use case: Catch unexpected traffic surges

Gradual Growth Tracking

Name: Usage Growth Alert Threshold: 150% Baseline: Month Comparison: Week Use case: Track organic growth in usage

Threshold Guidelines

Threshold	Meaning
150%	1.5x normal usage
200%	2x normal usage
300%	3x normal usage
500%	5x normal usage

Quota Limit Alerts

Get warned before running out of credits.

Configuration

Field	Description	Example
Threshold	Percentage of quota used	80
Quota Type	What quota to monitor	credits

When Triggers

Alert fires when the specified percentage of your quota is consumed.

Example Scenarios

Credit Balance Warning

Name: Low Credits Alert Threshold: 80% Quota Type: Credits Use case: Time to purchase more credits

Critical Balance Alert

Name: Credits Nearly Depleted Threshold: 95% Quota Type: Credits Use case: Urgent - service may be interrupted

Comfortable Buffer

Name: Credits Below 50% Threshold: 50% Quota Type: Credits Use case: Early warning for budget planning

Recommended Thresholds

Alert Level	Threshold	Action
Informational	50%	Plan next purchase
Warning	80%	Purchase soon
Critical	95%	Purchase immediately

Model Performance Alerts

Monitor response times and API performance.

Configuration

Field	Description	Example
Metric	What to measure	latency
Threshold	Milliseconds that trigger alert	10000
Period	Time window for average	hour

When Triggers

Alert fires when average latency in the period exceeds threshold.

Example Scenarios

General Performance

Name: Slow Response Alert Metric: Latency Threshold: 10000ms (10 seconds) Period: Hour Use case: Catch general slowdowns

Real-Time Applications

Name: Latency Critical Metric: Latency Threshold: 5000ms (5 seconds) Period: Hour Use case: For applications needing fast responses

Batch Processing

Name: Batch Job Slowdown Metric: Latency Threshold: 30000ms (30 seconds) Period: Day Use case: For background processing that can tolerate delays

Latency Expectations by Model Type

Model Type	Typical Latency	Alert Threshold
Small/Fast	500ms - 2s	5,000ms
Standard	2s - 5s	10,000ms
Large/Complex	5s - 15s	30,000ms
Reasoning	10s - 60s	120,000ms

Unusual Activity Alerts

Detect anomalies and potential security issues.

Configuration

This alert type uses automatic pattern detection. No specific threshold is required.

When Triggers

Alert fires when the system detects:

Unusual request patterns
Requests from new sources
Abnormal time-of-day activity
Suspicious content patterns

Example Scenarios

Security Monitoring

Name: Suspicious Activity Use case: Catch potential credential misuse

Anomaly Detection

Name: Unusual Patterns Use case: Identify bot traffic or attacks

What Gets Flagged

Requests at unusual times
New IP addresses or regions
Rapid request pattern changes
Unusual model access patterns

Notification Channel Selection

Dashboard Only

Best for:

Non-critical alerts
Frequent notifications
When email would be too noisy

Email Only

Best for:

Critical alerts you can't miss
After-hours monitoring
Important threshold breaches

Both Dashboard and Email

Best for:

Critical cost and quota alerts
Production error monitoring
Anything requiring immediate action

Tips for Effective Alerts

Start Conservative

Begin with higher thresholds and lower them as you understand your usage patterns.

Name Clearly

Use descriptive names that tell you what the alert means at a glance:

Good: "Daily Spend Over $50"
Bad: "Cost Alert"

Document Purpose

Use the description field to explain why this alert exists and what action to take.

Review and Adjust

After a week of operation, review your alerts:

Are you getting too many notifications?
Did you miss any important events?
Should thresholds be adjusted?

Managing Alerts - History and acknowledgment
Alerts Overview - Introduction to alerting
Usage Analytics - Understand your usage patterns

Previous Alerts Overview Next Managing Alerts

Creating Alerts

Creating an Alert

Basic Steps

Cost Threshold Alerts

Configuration

When Triggers

Example Scenarios

Daily Budget Control

Weekly Budget Monitoring

Monthly Cost Cap

Recommended Settings

Error Rate Alerts

Configuration

When Triggers

Example Scenarios

API Health Monitoring

Production Quality Gate

Low-Volume Monitoring

Understanding Error Rates

Usage Spike Alerts

Configuration

When Triggers

Example Scenarios

Runaway Script Detection

Viral Traffic Alert

Gradual Growth Tracking

Threshold Guidelines

Quota Limit Alerts

Configuration

When Triggers

Example Scenarios

Credit Balance Warning

Critical Balance Alert

Comfortable Buffer

Recommended Thresholds

Model Performance Alerts

Configuration

When Triggers

Example Scenarios

General Performance

Real-Time Applications

Batch Processing

Latency Expectations by Model Type

Unusual Activity Alerts

Configuration

When Triggers

Example Scenarios

Security Monitoring

Anomaly Detection

What Gets Flagged

Notification Channel Selection

Dashboard Only

Email Only

Both Dashboard and Email

Tips for Effective Alerts

Start Conservative

Name Clearly

Document Purpose

Review and Adjust

Related Topics