Summarization of Multimodal Presentations Collection Ressources related to summarization of multimodal presentations. • 6 items • Updated 4 days ago
Perception Encoder: The best visual embeddings are not at the output of the network Paper • 2504.13181 • Published 12 days ago • 31
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing Paper • 2206.15076 • Published Jun 30, 2022 • 4
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published 15 days ago • 3
Summarization of Multimodal Presentations Collection Ressources related to summarization of multimodal presentations. • 6 items • Updated 4 days ago
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published 15 days ago • 3
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published 15 days ago • 3 • 2
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 403
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 78
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated 20 days ago • 584k • 1.33k