blob: 174a71c1491cbf9c1908ee5fbf2586631a96608f (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
|
# GAP: Generalization-and-Perturbation Framework
Code for the GAP framework, which decomposes equivalence-preserving transformations of mathematical problems into **surface** (deterministic renaming) and **kernel** (parameter resampling with multi-judge verification) families.
## Contents
- `putnam-bench-anon/` — evaluation harness: multi-provider async solver/grader with checkpoint, cross-provider grading, CLI
- `mini_gap_math*.py` — MATH Level 4-5 generalization experiments
- `kv_math*.py` — kernel-variant generation and evaluation on MATH
- `analysis/` — structural-overlap analysis, rescue experiment runner, figure generation
## Quick start
```bash
pip install -r putnam-bench-anon/requirements.txt
python putnam-bench-anon/putnam_cli.py --help
```
## License
CC-BY-4.0. See `LICENSE`.
|