pipeline_tag: image-to-video | |
library_name: diffusers | |
license: mit | |
# VIRES_VAE model card | |
**Model Page**: [VIRES](https://hjzheng.net/projects/VIRES/) | |
This is a component of the VIRES model, as presented in [VIRES: Video Instance Repainting with Sketch and Text Guidance](https://huggingface.co/papers/2411.16199). VIRES is a video instance repainting method with sketch and text guidance, enabling video instance repainting, replacement, generation, and removal. It leverages the generative priors of text-to-video models to maintain temporal consistency and produce visually pleasing results. | |
### Usage | |
Ref to our GitHub page: [https://github.com/suimuc/VIRES/](https://github.com/suimuc/VIRES/) | |
## Citation | |
```BibTeX | |
@article{vires, | |
title={VIRES: Video Instance Repainting via Sketch and Text Guided Generation}, | |
author={Weng, Shuchen and Zheng, Haojie and Zhang, Peixuan and Hong, Yuchen and Jiang, Han and Li, Si and Shi, Boxin}, | |
journal={arXiv preprint arXiv:2411.16199}, | |
year={2024} | |
} | |
``` |