VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published 1 day ago • 26
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published 1 day ago • 26
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published 12 days ago • 117
SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia Paper • 2502.06298 • Published Feb 10 • 1
Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning Paper • 2310.10962 • Published Oct 17, 2023
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization Paper • 2502.16825 • Published Feb 24 • 7
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations Paper • 2504.13816 • Published Apr 18 • 17
SOUL: Towards Sentiment and Opinion Understanding of Language Paper • 2310.17924 • Published Oct 27, 2023
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback Paper • 2505.17873 • Published May 23 • 31
MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization Paper • 2507.14683 • Published 12 days ago • 117
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Paper • 2506.17930 • Published Jun 22 • 19
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Paper • 2506.17930 • Published Jun 22 • 19
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Paper • 2506.17930 • Published Jun 22 • 19
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning Paper • 2506.18841 • Published Jun 23 • 56