connecting…
System Overview
RAM Used
— GB free
Running Models
none loaded
Requests Today
last 24 h
Avg TTFT
time-to-first-token
Avg Tok / s
generation speed
KV Cache Saved
vs 4096 default
Performance Trends
Requests per Hour
Last 24 hours
TTFT Over Time
Last 100 requests · ms · green <400 ms · blue <1200 ms · yellow <2500 ms · red ≥2500 ms
Raw vs Tuned · Context Window Optimization
KV Cache: Ollama Default (4096 tokens) vs autotune Dynamic Sizing
Ollama Default
4,096
tokens · fixed
autotune Average
tokens · dynamic
Context Reduction
KV Memory Saved
proportional
Avg TTFT (measured)
all-time average
Per-Model Breakdown
All Models
Model Requests Avg TTFT Min / Max TTFT Avg Tok/s Avg Context Avg Elapsed Total Tokens Last Used
Loading…
API Keys & Slow Requests
Active API Keys
Name Key Prefix Req Today Tokens Today Last Used
Loading…
Slow Requests > 5 s elapsed
Model Elapsed TTFT Context Profile Time
Loading…
Suggestions
Loading…