1 24 5

Cyril

cyrilzakka

https://cyrilzakka.github.io

AI & ML interests

Multimodal models for clinical medicine and surgery

Recent Activity

upvoted a paper 5 days ago

Building and better understanding vision-language models: insights and future directions

liked a model 11 days ago

facebook/blt-7b

View all activity

Organizations

cyrilzakka's activity

upvoted a paper 5 days ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 131

upvoted 2 articles 14 days ago

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

Nov 20, 2024

• 59

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Feb 12

• 64

upvoted a paper 21 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 147

upvoted a collection 22 days ago

Nomic Embed Multimodal

Collection

Multimodal models allowing you to search over interleaved text, PDFs, charts, and images! • 15 items • Updated 22 days ago • 20

upvoted a paper about 1 month ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 50

upvoted an article about 2 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 236

upvoted 2 papers 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 183

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

upvoted an article 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 238

upvoted a paper 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.23k

upvoted a paper 3 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 60

upvoted 2 papers 4 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 277

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

upvoted 2 papers 5 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 63

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 135