metadata

pipeline_tag: image-to-video
library_name: diffusers
license: mit

VIRES_VAE model card

Model Page: VIRES

This is a component of the VIRES model, as presented in VIRES: Video Instance Repainting with Sketch and Text Guidance. VIRES is a video instance repainting method with sketch and text guidance, enabling video instance repainting, replacement, generation, and removal. It leverages the generative priors of text-to-video models to maintain temporal consistency and produce visually pleasing results.

Usage

Ref to our GitHub page: https://github.com/suimuc/VIRES/

Citation

@article{vires,
      title={VIRES: Video Instance Repainting via Sketch and Text Guided Generation},
      author={Weng, Shuchen and Zheng, Haojie and Zhang, Peixuan and Hong, Yuchen and Jiang, Han and Li, Si and Shi, Boxin},
      journal={arXiv preprint arXiv:2411.16199},
      year={2024}
}