1 10

magicwpf

https://scholar.google.com/citations?user=P6MraaYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

authored a paper 5 days ago

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning

authored a paper 5 days ago

HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment

View all activity

Organizations

None yet

magicwpf's activity

authored 4 papers 5 days ago

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

Paper • 2410.08260 • Published Oct 10, 2024

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning

Paper • 2504.00396 • Published Apr 1 • 4

HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment

Paper • 2503.23907 • Published Mar 31 • 2

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 9 days ago • 43

upvoted a paper 6 days ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 9 days ago • 43

authored 4 papers about 1 month ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 76

upvoted 2 papers about 1 month ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31 • 76

SketchVideo: Sketch-based Video Generation and Editing

Paper • 2503.23284 • Published Mar 30 • 23

upvoted a paper about 2 months ago

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21 • 62

authored a paper about 2 months ago

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Paper • 2503.14487 • Published Mar 18 • 27

upvoted a paper about 2 months ago

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Paper • 2503.14487 • Published Mar 18 • 27

authored 6 papers about 2 months ago

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

Paper • 2408.11813 • Published Aug 21, 2024 • 12

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Paper • 2410.21747 • Published Oct 29, 2024

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published Dec 10, 2024 • 56

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Paper • 2412.07759 • Published Dec 10, 2024 • 18

StyleMaster: Stylize Your Video with Artistic Generation and Translation

Paper • 2412.07744 • Published Dec 10, 2024 • 19

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing

Paper • 2411.15260 • Published Nov 22, 2024