AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO Paper β’ 2502.14669 β’ Published Feb 20 β’ 14