Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 21 days ago • 121
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 11 days ago • 607
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 21 days ago • 121
OpenGVLab/PIIP-LLaVA-Plus_ConvNeXt-L_CLIP-L_1024-336_7B Image-Text-to-Text • Updated 19 days ago • 13
PIIP Collection [NeurIPS 2024 Spotlight ] Parameter-Inverted Image Pyramid Networks • 11 items • Updated 19 days ago • 1
PIIP Collection [NeurIPS 2024 Spotlight ] Parameter-Inverted Image Pyramid Networks • 11 items • Updated 19 days ago • 1