System Overview
RAM Used
—
— GB free
Running Models
—
none loaded
Requests Today
—
last 24 h
Avg TTFT
—
time-to-first-token
Avg Tok / s
—
generation speed
KV Cache Saved
—
vs 4096 default
Performance Trends
Requests per Hour
Last 24 hours
TTFT Over Time
Last 100 requests · ms · green <400 ms · blue <1200 ms · yellow <2500 ms · red ≥2500 ms
Raw vs Tuned · Context Window Optimization
KV Cache: Ollama Default (4096 tokens) vs autotune Dynamic Sizing
Ollama Default
4,096
tokens · fixed
autotune Average
—
tokens · dynamic
Context Reduction
—
—
KV Memory Saved
—
proportional
Avg TTFT (measured)
—
all-time average
—
Per-Model Breakdown
All Models
—
| Model | Requests | Avg TTFT | Min / Max TTFT | Avg Tok/s | Avg Context | Avg Elapsed | Total Tokens | Last Used |
|---|---|---|---|---|---|---|---|---|
| Loading… | ||||||||
API Keys & Slow Requests
Active API Keys
—
| Name | Key Prefix | Req Today | Tokens Today | Last Used |
|---|---|---|---|---|
| Loading… | ||||
Slow Requests > 5 s elapsed
—
| Model | Elapsed | TTFT | Context | Profile | Time |
|---|---|---|---|---|---|
| Loading… | |||||
Suggestions
Loading…