Performance: 18.9 tokens per second
Response time: 950ms
Availability: 98.5%
Uptime: 97.8%
Pricing available on request
Other providers like Anthropic and OpenAI may be available through different endpoints.
Performance varies: some achieve 30+ TPS while others focus on quality over speed.