Paper Notes

Reading notes — mostly on RL, reasoning, reward models, and other ML papers I've found interesting.

2026

2025