suimu
/

VIRES_VAE

VideoAutoencoder_With_Sketch_Pipeline

Model card Files Files and versions Community

VIRES_VAE / README.md

suimu's picture

Improve model card with metadata and links (#1)

9e93a6c verified about 1 month ago

|

history blame contribute delete

1.04 kB

	---
	pipeline_tag: image-to-video
	library_name: diffusers
	license: mit
	---

	# VIRES_VAE model card

	Model Page: [VIRES](https://hjzheng.net/projects/VIRES/)

	This is a component of the VIRES model, as presented in [VIRES: Video Instance Repainting with Sketch and Text Guidance](https://huggingface.co/papers/2411.16199). VIRES is a video instance repainting method with sketch and text guidance, enabling video instance repainting, replacement, generation, and removal. It leverages the generative priors of text-to-video models to maintain temporal consistency and produce visually pleasing results.


	### Usage

	Ref to our GitHub page: [https://github.com/suimuc/VIRES/](https://github.com/suimuc/VIRES/)

	## Citation

	```BibTeX
	@article{vires,
	title={VIRES: Video Instance Repainting via Sketch and Text Guided Generation},
	author={Weng, Shuchen and Zheng, Haojie and Zhang, Peixuan and Hong, Yuchen and Jiang, Han and Li, Si and Shi, Boxin},
	journal={arXiv preprint arXiv:2411.16199},
	year={2024}
	}
	```