1
Prerequisites
setup
Confirm your environment has all required dependencies before installing the server.
python3 --version
✓ Validation: Output shows
Python 3.11.x or higher.pip --version
brew install gh && gh auth login
pip install mlx-lm
⚠ Note: MLX only runs on Apple Silicon (M1/M2/M3/M4). If you are on Intel Mac or Linux, install Ollama instead:
brew install ollama
2
Install mlx-mcp-server
install
Install the package from PyPI and confirm the CLI is accessible.
pip install mlx-mcp-server
mlx-mcp-server --help
✓ Validation: Prints usage information including the
install subcommand.
3
Configure Claude Desktop
config
Register the MCP server with Claude Desktop so it appears as a toolset.
mlx-mcp-server install --dry-run
✓ Validation: Prints JSON config — confirm it looks correct before writing.
mlx-mcp-server install
✓ Validation: Prints
Added mlx-mcp-server to Claude Desktop config.⚠ Note: If you are using Ollama instead of MLX, pass additional flags:
mlx-mcp-server install --base-url http://localhost:11434 --model mistral
4
Start your local LLM
runtime
Download a model and start the inference server that Claude will talk to.
Apple Silicon (MLX)
pip install huggingface_hub && \
huggingface-cli download mlx-community/Mistral-7B-Instruct-v0.3-4bit
mlx_lm.server --model mlx-community/Mistral-7B-Instruct-v0.3-4bit
✓ Validation: Logs show
Starting server on http://localhost:8080.Ollama (Intel Mac / Linux)
ollama serve & && ollama pull mistral
5
Verify in Claude Desktop
test
Confirm Claude Desktop can see and use the MLX MCP tools.
✓ Validation:
mlx is listed in the tools panel with 4 tools available.Use the health_check tool to verify my local MLX server is running.
✓ Validation: Claude reports
status: ok.Use the quick_test tool with test_type=math
✓ Validation: Returns a math response including tok/s throughput metrics.
6
Configure Claude Code (optional)
config
Also wire up mlx-mcp-server for the Claude Code CLI, if you use it.
mlx-mcp-server install --claude-code
✓ Validation: Prints
Added mlx-mcp-server to Claude Code config.Use the list_models tool
✓ Validation: Claude Code lists available MLX models.
7
Share with your team
team
Spread the word — two commands to get any teammate running.
# Apple Silicon (MLX)
pip install mlx-mcp-server && mlx-mcp-server install
# Intel Mac / Linux (Ollama)
pip install mlx-mcp-server && mlx-mcp-server install --base-url http://localhost:11434 --model mistral
https://github.com/deresolution20/mlx-mcp-server