Submitted by taesiri 132 AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs · 11 authors 1.14k 9
Submitted by MasterVito 117 Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR · 7 authors 34 5
Submitted by MingyuLiu 43 ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks · 10 authors 2
Submitted by taesiri 29 AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications · 23 authors 10.3k 4
Submitted by zhwang01 14 AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions · 28 authors 4
Submitted by hynnsk 12 Selective Contrastive Learning for Weakly Supervised Affordance Grounding · 3 authors 10 3
Submitted by chaoyi-wu 10 End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning · 10 authors 43 2
Submitted by taesiri 9 Do What? Teaching Vision-Language-Action Models to Reject the Impossible · 6 authors 2
Submitted by fxmeng 7 TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference · 7 authors 2
Submitted by taesiri 3 InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles · 11 authors 7 2
Submitted by Qlisp 3 CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning · 5 authors 3
Submitted by Charlie019 3 Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics · 5 authors 9 2
Submitted by AlienZhang1996 1 Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts · 6 authors 9 2