dtype=float32. Using bfloat16/float16 on CPU will cause errors.
Enable MCP to use tools from configured servers
Configure MCP Servers âNo MCP servers configured
Add MCP Server âRetrieval-Augmented Generation is under development.
Upload documents and enhance responses with relevant context.
Performance BenchmarkingInstall GuideLLM to enable advanced benchmarking features:
pip install "vllm-playground[benchmark]"
or
pip install guidellm
Built-in benchmarking is still available without GuideLLM.
No benchmark data available
Start the vLLM server and click "Run Benchmark" to test performance
Configure Model Context Protocol servers to extend LLM capabilities with external tools
Install the MCP package to enable this feature:
pip install vllm-playground[mcp]
or
pip install mcp
No MCP servers configured
Add a server to get started, or choose from presets below
npx (Node.js) - Required for Filesystem serveruvx (uv) - Required for Git, Fetch, Time serversNo resources available
No prompts available
Optimized configurations from the vLLM Recipes Repository. Select a recipe to auto-configure the playground.
Are you sure?
Choose export format:
Are you sure you want to clear all messages? This action cannot be undone.