-
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 273 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 260 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 236 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 256
Erik Thorelli
esthor
AI & ML interests
Quantifying Agent Experience
Recent Activity
updated
a collection
9 days ago
papers-to-read
updated
a collection
9 days ago
papers-to-read
updated
a collection
9 days ago
papers-to-read
Organizations
None yet