>

hey! i'm saraswathy amjith

sometimes i go by sara!

Papers

Code-Game Setter def f(x): return x**2+3 f(7)=? Solver 7**2 + 3 = 52 exec verified Math-Game Setter ∫ x² dx from 0..2 solve? Solver x³/3 |₀² = 8/3 LLM judged vs

SAGE: Self-play Adversarial Games Enhance Large Language Model Reasoning Capabilities

Amjith, S., Wang, M.X., Lynch, J., Gundlach, H., & Thompson, N.

RSI @ ICLR 2026

A framework for improving LLM reasoning through adversarial self-play where a Setter generates challenging problems and a Solver attempts to solve them, achieving up to +10% on MATH and +8% on MBPP with cross-domain transfer.