vLLM Metrics Dashboard
v0.2.3
Connecting...
30m
2h
8h
24h
3d
7d
30d
Smooth
Running Requests
-
Waiting Requests
-
KV Cache
-
Requests/s
-
Output Tokens/s
-
Cache Hit Rate
-
Total Requests
-
Uptime
-
Running Requests
Waiting Requests
Requests / second
Output Tokens / second
Input Tokens / second
KV Cache Usage (%)
Cache Hit Rate (%)
Latency (TTFT / ITL / E2E)
Per-Engine Running Requests
Engine Details
Engine
Running
Waiting
KV Cache
Prompt Tokens
Gen Tokens
Loading...