Does Data Scaling Lead to Visual Compositional Generalization? Paper • 2507.07102 • Published Jul 9 • 1
CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally Paper • 2502.03566 • Published Feb 5 • 2
Diffusion Classifiers Understand Compositionality, but Conditions Apply Paper • 2505.17955 • Published May 23 • 22