SAGE: Self-play Adversarial Games Enhance Large Language Model Reasoning Capabilities
A framework for improving LLM reasoning through adversarial self-play where a Setter generates challenging problems and a Solver attempts to solve them, achieving up to +10% on MATH and +8% on MBPP with cross-domain transfer.