From Bytes to Ideas: Language Modeling with Autoregressive U-Nets Paper • 2506.14761 • Published Jun 17 • 17
zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression Paper • 2506.01084 • Published Jun 1 • 7