Metadata-Version: 2.4
Name: sixtytwo-cli
Version: 0.3.2
Summary: Sixtytwo CLI: `sixtytwo rent` reserves reliability-backed GPUs; `sixtytwo` qualifies, monitors, and NCCL-benchmarks your own GPU clusters, with Slurm/SkyPilot integration.
Author: Sixtytwo, Inc.
License-Expression: LicenseRef-Sixtytwo-Commercial
Keywords: gpu,nccl,skypilot,slurm,cluster,benchmarking
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Operating System :: POSIX :: Linux
Classifier: Operating System :: MacOS
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: System :: Distributed Computing
Classifier: Topic :: System :: Monitoring
Requires-Python: >=3.11
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: rich<15,>=14.0
Requires-Dist: fastapi<1,>=0.115
Provides-Extra: server
Requires-Dist: fastapi<1,>=0.115; extra == "server"
Requires-Dist: uvicorn<1,>=0.34; extra == "server"
Requires-Dist: geoip2<6,>=4.8; extra == "server"
Provides-Extra: sentry
Requires-Dist: sentry-sdk[fastapi]<3,>=2.18; extra == "sentry"
Provides-Extra: gpu
Requires-Dist: nvidia-ml-py3>=7.352.0; extra == "gpu"
Requires-Dist: torch>=2.11; extra == "gpu"
Provides-Extra: skypilot
Requires-Dist: skypilot<0.12,>=0.9.0; extra == "skypilot"
Provides-Extra: skypilot-aws
Requires-Dist: skypilot[aws]<0.12,>=0.9.0; extra == "skypilot-aws"
Provides-Extra: skypilot-gcp
Requires-Dist: skypilot[gcp]<0.12,>=0.9.0; extra == "skypilot-gcp"
Provides-Extra: skypilot-lambda
Requires-Dist: skypilot[lambda]<0.12,>=0.9.0; extra == "skypilot-lambda"
Provides-Extra: skypilot-runpod
Requires-Dist: skypilot[runpod]<0.12,>=0.9.0; extra == "skypilot-runpod"
Dynamic: license-file

# sixtytwo

`sixtytwo-cli` is the command-line client for [sixtytwo](https://sixtytwo.ai),
built for the terminal and for coding agents. It covers two jobs:

- **Rent reliability-backed GPUs.** `sixtytwo rent` reserves GPUs that sixtytwo
  has already qualified, watches them while you run, and refunds the unused time
  if hardware goes bad.
- **Qualify and monitor your own clusters.** `sixtytwo` runs hardware checks,
  NCCL benchmarks, and live fault monitoring on GPUs you already have, and keeps
  a per-node trust score from every test, fault, and recovery.

`sixtytwo rent` stays light (no torch), so it runs anywhere. The qualification
and monitoring commands pull in the GPU stack only when you ask for it.

## Install

```bash
pip install sixtytwo-cli
```

Optional extras:

```bash
pip install 'sixtytwo-cli[gpu]'        # nvidia-ml-py + torch for local GPU checks
pip install 'sixtytwo-cli[skypilot]'   # SkyPilot cloud adapter (see below)
pip install 'sixtytwo-cli[server]'     # local dashboard + collector daemon
```

## Rent GPUs

```bash
sixtytwo rent catalog                  # list SKUs and prices
sixtytwo rent H100_SXM -n 8 -H 4       # reserve 8x H100 for 4 hours, wait for READY
sixtytwo rent ls                       # list my reservations
sixtytwo rent status <reservation-id>
sixtytwo rent ssh <reservation-id>     # open a shell on a READY reservation
```

Every node is validated before you get it and monitored while you use it. Add
`--region us|eu|asia` to pin a geography; billing follows actual usage.

## Qualify and monitor your own GPUs

```bash
sixtytwo init --cluster prod
sixtytwo doctor --json                 # ssh reachability, nvidia-smi, dcgmi, topology
sixtytwo test --quick --all            # fast per-node qualification
sixtytwo test --full gpu-01,gpu-02     # full suite, including NCCL
sixtytwo launch --pre-check python train.py   # qualify, then run with live monitoring
sixtytwo nodes                         # trust scores and fault history
```

Results land in a local trust registry, so a node's history (tests, faults,
recoveries) follows it over time.

## SkyPilot

There are two ways to combine sixtytwo with
[SkyPilot](https://skypilot.readthedocs.io):

```bash
# 1. Qualify a cluster you launched on any cloud SkyPilot supports
sixtytwo skypilot qualify my-cluster --mode quick

# 2. Provision sixtytwo's own GPUs through SkyPilot with `cloud: sixtytwo`
pip install 'sixtytwo-cli[skypilot]'
sixtytwo skypilot install              # register sixtytwo as a SkyPilot cloud
sky show-gpus --cloud sixtytwo
```

`sixtytwo skypilot install` is opt-in. It wires the adapter into the active
virtualenv so `sky launch` can reserve reliability-backed GPUs with
`cloud: sixtytwo`. Run `sixtytwo skypilot --help` for the full set of
subcommands.

## Metrics for Grafana

`sixtytwo metrics serve` exposes a Prometheus `/metrics` endpoint backed by the
local trust registry: trust scores, fault counters, recovery downtime, and
per-check status. `sixtytwo metrics export-grafana` prints a curated dashboard
you can import as is.

```bash
sixtytwo metrics serve --host 0.0.0.0 --port 9620
sixtytwo metrics export-grafana --output sixtytwo-overview.json
```

## License

Commercial. The bundled `LICENSE` governs use. Learn more at
[sixtytwo.ai](https://sixtytwo.ai).
