Olympiad-level formal mathematical reasoning with reinforcement learning
Olympiad-level formal mathematical reasoning with reinforcement learning Summary This Nature preview describes AlphaProof, an AlphaZero-inspired reinforcement learning agent trained to find formal proofs in the Lean proof assistant. The system…
