Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
jongwoopark7978
/
LVNet
like
3
computer-vision
video-question-answering
zero-shot
9 pages workshop at neurips2024
arxiv:
2406.09396
arxiv:
2210.09996
License:
apache-2.0
Model card
Files
Files and versions
Community
54216bc
LVNet
/
requirements.txt
jongwoopark7978
chore: add project files
54216bc
about 2 months ago
raw
Copy download link
history
blame
Safe
179 Bytes
torch==2.1.2
torchvision==0.16.2
transformers==4.37.2
tokenizers==0.15.1
sentencepiece==0.1.99
shortuuid
peft
numpy
requests
uvicorn
fastapi
rp
tqdm
shutil
PIL
natsort
clip