Safetensors
English
sasasa2va_chat
custom_code

SaSaSa2VA: Segmentation Augmented and Selective Averaged Sa2VA

[πŸ“œ arXiv] [πŸ§‘β€πŸ’» GitHub] [πŸ€— HuggingFace] [🎯 Challenge]

Quanzhu Niu1* Β· Dengxian Gong1* Β· Shihao Chen1* Β· Tao Zhang1* Β· Yikang Zhou1 Β· Haobo Yuan2 Β· Lu Qi1 Β· Xiangtai Li3 Β· Shunping Ji1†

1WHU    2UC Merced    3NTU

*equal contribution †corresponding author

πŸŽ‰ 1st Place in ICCV 2025 LSVOS Challenge RVOS Track! πŸŽ‰

We win 1st place in ICCV 2025 LSVOS (Large-scale Video Object Segmentation) challenge RVOS (Referring Video Object Segmentation) track. The top 3 teams' methods are all based on Sa2VA. The challenge leaderborad:

Method/Team Name J&F Report
πŸ… SaSaSa2VA (Ours) 67.45 πŸ“ link
πŸ₯ˆ Ranhong 64.65 πŸ“ link
πŸ₯‰ Sa2VA-i 64.14 πŸ“ link

Model Zoo

We provide the following models:

Model Name Base MLLM HF Link
SaSaSa2VA-4B InternVL2.5-4B πŸ€— link
SaSaSa2VA-14B InternVL3.5-14B To be released
SaSaSa2VA-26B InternVL2.5-26B πŸ€— link

Citation

If you find our work useful, please consider referring to the challenge report:

@article{sasasa2va,
  title={The 1st Solution for 7th LSVOS RVOS Track: {SaSaSa2VA}},
  author={Niu, Quanzhu and Gong, Dengxian and Chen, Shihao and Zhang, Tao and Zhou, Yikang and Yuan, Haobo and Qi, Lu and Li, Xiangtai and Ji, Shunping},
  journal={arXiv preprint arXiv:2509.16972},
  year={2025}
}
Downloads last month
14
Safetensors
Model size
25.8B params
Tensor type
F32
Β·
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Datasets used to train QuanzhuNiu/SaSaSa2VA-26B

Collection including QuanzhuNiu/SaSaSa2VA-26B