Summarization of Multimodal Presentations Ressources related to summarization of multimodal presentations. Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published Apr 14 • 3 Runtime error Slide Presentation Viz 📊 Tool to visualize presentations as transcription + slides. gigant/tib-bench Viewer • Updated May 20 • 822 • 112 gigant/tib Viewer • Updated Jul 18, 2024 • 9.1k • 91 • 1
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published Apr 14 • 3
Romanian speech recognition gigant/whisper-medium-romanian Automatic Speech Recognition • 0.8B • Updated Sep 13, 2023 • 1.29k • 15 Runtime error 1 1 Romanian Whisper Demo 🤫 gigant/romanian-wav2vec2 Automatic Speech Recognition • 0.3B • Updated Sep 13, 2023 • 189k • 6
gigant/whisper-medium-romanian Automatic Speech Recognition • 0.8B • Updated Sep 13, 2023 • 1.29k • 15
Summarization of Multimodal Presentations Ressources related to summarization of multimodal presentations. Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published Apr 14 • 3 Runtime error Slide Presentation Viz 📊 Tool to visualize presentations as transcription + slides. gigant/tib-bench Viewer • Updated May 20 • 822 • 112 gigant/tib Viewer • Updated Jul 18, 2024 • 9.1k • 91 • 1
Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure Paper • 2504.10049 • Published Apr 14 • 3
Romanian speech recognition gigant/whisper-medium-romanian Automatic Speech Recognition • 0.8B • Updated Sep 13, 2023 • 1.29k • 15 Runtime error 1 1 Romanian Whisper Demo 🤫 gigant/romanian-wav2vec2 Automatic Speech Recognition • 0.3B • Updated Sep 13, 2023 • 189k • 6
gigant/whisper-medium-romanian Automatic Speech Recognition • 0.8B • Updated Sep 13, 2023 • 1.29k • 15
pinned Runtime error Slide Presentation Viz 📊 Tool to visualize presentations as transcription + slides.