luanafelbarros
's Collections
Medical VLMs
updated
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in
Vision-Language Models
Paper
•
2503.13939
•
Published
•
5
Med-Flamingo: a Multimodal Medical Few-shot Learner
Paper
•
2307.15189
•
Published
•
23
MedFuzz: Exploring the Robustness of Large Language Models in Medical
Question Answering
Paper
•
2406.06573
•
Published
•
11
BenchX: A Unified Benchmark Framework for Medical Vision-Language
Pretraining on Chest X-Rays
Paper
•
2410.21969
•
Published
•
10
GEMeX-ThinkVG: Towards Thinking with Visual Grounding in Medical VQA via
Reinforcement Learning
Paper
•
2506.17939
•
Published
•
3
Medical large language models are easily distracted
Paper
•
2504.01201
•
Published
•
3
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via
Knowledge Graphs
Paper
•
2504.00993
•
Published
•
2
Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust
MedVQA in Gastrointestinal Endoscopy
Paper
•
2506.09958
•
Published
•
1
MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal
Medical Reasoning
Paper
•
2506.00555
•
Published
•
1
GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark
for Chest X-ray Diagnosis
Paper
•
2411.16778
•
Published
•
1
PathVQA: 30000+ Questions for Medical Visual Question Answering
Paper
•
2003.10286
•
Published
Overcoming Data Limitation in Medical Visual Question Answering
Paper
•
1909.11867
•
Published
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language
Models
Paper
•
2407.05131
•
Published
•
28
Hierarchical Modeling for Medical Visual Question Answering with
Cross-Attention Fusion
Paper
•
2504.03135
•
Published
•
1
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for
Medical LVLM
Paper
•
2402.09181
•
Published
•
1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal
Large Language Models on Medical Challenge Problems & Hallucinations
Paper
•
2402.07023
•
Published
•
4
MedGemma Technical Report
Paper
•
2507.05201
•
Published
•
14
A Survey of Medical Vision-and-Language Applications and Their
Techniques
Paper
•
2411.12195
•
Published