Conformal-calibrated RL envs expose scientific reasoning gaps in frontier LLMs.
