Paper Notes
Reading notes — mostly on RL, reasoning, reward models, and other ML papers I've found interesting.
2026
- TikZ Rendering Test meta
2025
- 前端工具链小记 misc
- VLM 知识蒸馏 vlm
- 参数计算 misc
- Alignment 算力小记 misc rl
2024
- ISR 1220 杂记 misc
- Quiet-STaR reasoning
- LLM Notes misc
- LLM Know What They Know reasoning
- Critique-out-Loud Reward Models reward-model rl
- 好博客 misc
- Note 2024-07-17 misc
- Graph Learning: A Survey survey
- Math RLHF Papers rl reasoning
- Shepherd-MCTS reasoning
- LLM Critics Help Catch LLM Bugs reward-model reasoning
- Many-Shot In-Context Learning reasoning vlm