OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published 4 days ago • 33
Towards Personalized Deep Research: Benchmarks and Evaluations Paper • 2509.25106 • Published 5 days ago • 27
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality Paper • 2506.19807 • Published Jun 24 • 7
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Paper • 2503.16356 • Published Mar 20 • 15
ReLearn: Unlearning via Learning for Large Language Models Paper • 2502.11190 • Published Feb 16 • 30
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training Paper • 2502.11196 • Published Feb 16 • 23
Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published Oct 16, 2024 • 21