File size: 1,035 Bytes
9e93a6c
 
 
 
 
 
6e83386
 
 
 
9e93a6c
 
6e83386
 
 
9e93a6c
6e83386
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
---
pipeline_tag: image-to-video
library_name: diffusers
license: mit
---

# VIRES_VAE model card

**Model Page**: [VIRES](https://hjzheng.net/projects/VIRES/)

This is a component of the VIRES model, as presented in [VIRES: Video Instance Repainting with Sketch and Text Guidance](https://huggingface.co/papers/2411.16199). VIRES is a video instance repainting method with sketch and text guidance, enabling video instance repainting, replacement, generation, and removal.  It leverages the generative priors of text-to-video models to maintain temporal consistency and produce visually pleasing results.


### Usage

Ref to our GitHub page: [https://github.com/suimuc/VIRES/](https://github.com/suimuc/VIRES/)

## Citation

```BibTeX
@article{vires,
      title={VIRES: Video Instance Repainting via Sketch and Text Guided Generation},
      author={Weng, Shuchen and Zheng, Haojie and Zhang, Peixuan and Hong, Yuchen and Jiang, Han and Li, Si and Shi, Boxin},
      journal={arXiv preprint arXiv:2411.16199},
      year={2024}
}
```