Metadata-Version: 2.4
Name: sophistry-bench-sprint
Version: 0.1.5
Summary: Single-agent advocacy variant of sophistry-bench for the Prime Intellect Reward Hacking Sprint. Pre-registered hypothesis: training Llama-3.2-1B on a programmatic claim-count cliff (peak at n=8) will cause cliff convergence within 100 GRPO steps; three adversarial canary rewards detect format-hacking. Self-contained (vendored from sophistry-bench v0.1.19) — no runtime dependency on the main package, to work around PI training-infra's exclude-newer index filter.
Requires-Python: >=3.10
Requires-Dist: datasets>=2.0
Requires-Dist: pydantic>=2.0
Requires-Dist: verifiers>=0.1.14
