SPA: 3D Spatial-Awareness Enables Effective Embodied Representation Paper • 2410.08208 • Published Oct 10, 2024
VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers Paper • 2507.01016 • Published Jul 1 • 1