-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 28 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 14 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 23
Collections
Discover the best community collections!
Collections including paper arxiv:2506.17201
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
Paper • 2506.09513 • Published • 98 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 101
-
PlayerOne: Egocentric World Simulator
Paper • 2506.09995 • Published • 34 -
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Paper • 2506.17201 • Published • 55 -
Playing with Transformer at 30+ FPS via Next-Frame Diffusion
Paper • 2506.01380 • Published • 2
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 9
-
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation
Paper • 2506.09350 • Published • 47 -
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Paper • 2506.17201 • Published • 55 -
GameFactory: Creating New Games with Generative Interactive Videos
Paper • 2501.08325 • Published • 67 -
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches
Paper • 2408.04567 • Published • 26
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 28 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 14 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 44 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 23
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 9
-
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Paper • 2506.07044 • Published • 113 -
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
Paper • 2506.09513 • Published • 98 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 101
-
PlayerOne: Egocentric World Simulator
Paper • 2506.09995 • Published • 34 -
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Paper • 2506.17201 • Published • 55 -
Playing with Transformer at 30+ FPS via Next-Frame Diffusion
Paper • 2506.01380 • Published • 2
-
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation
Paper • 2506.09350 • Published • 47 -
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Paper • 2506.17201 • Published • 55 -
GameFactory: Creating New Games with Generative Interactive Videos
Paper • 2501.08325 • Published • 67 -
Sketch2Scene: Automatic Generation of Interactive 3D Game Scenes from User's Casual Sketches
Paper • 2408.04567 • Published • 26