🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement le
Shawn
csfufu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
Interleaving Reasoning for Better Text-to-Image Generation
liked
a model
about 1 month ago
rednote-hilab/dots.vlm1.inst
liked
a model
about 2 months ago
csfufu/Revisual-R1-final