Quentin Gallouédec's picture

Quentin Gallouédec PRO

qgallouedec

·

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

trl-lib/OpenMathReasoning

published a dataset 2 days ago

trl-lib/OpenMathReasoning

liked a dataset 2 days ago

nvidia/OpenMathReasoning

View all activity

Organizations

Articles 6

Article

32

Gotchas in Tokenizer Behavior Every Developer Should Know

Article

287

Open R1: Update #3

View all Articles

Papers 4

arxiv:2402.09844

arxiv:2402.03046

arxiv:2208.14928

arxiv:2106.13687

spaces 3

Run Hello World

Run DuckDB Jobs

Process datasets with DuckDB SQL

Train Memory

Generate memory forecast for ML models

models 725

qgallouedec/Qwen-2.5-7B-Simple-RL

Text Generation • Updated 21 days ago • 6

qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 22 days ago • 2

qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO

qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

qgallouedec/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated Mar 15

qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing

Image-Text-to-Text • Updated Mar 14 • 2

qgallouedec/gemma-3-12b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 14 • 43 • 5

qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing

Image-Text-to-Text • Updated Mar 14 • 3

qgallouedec/gemma-3-4b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 13 • 48 • 3

qgallouedec/gemma-3-27b-it-codeforces-SFT

Image-Text-to-Text • Updated Mar 13 • 14 • 4

datasets 67

qgallouedec/trl-metrics

Viewer • Updated 3 days ago • 108k • 685 • 1

qgallouedec/prm800k

Viewer • Updated Dec 17, 2024 • 41.2k • 42 • 3

qgallouedec/ultrafeedback-prompt

Viewer • Updated Sep 9, 2024 • 60.9k • 23

qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness

Viewer • Updated Sep 9, 2024 • 16.6k • 28

qgallouedec/lm-human-preferences-descriptiveness

Viewer • Updated Sep 9, 2024 • 6.26k • 22

qgallouedec/lm-human-preferences-sentiment

Viewer • Updated Sep 9, 2024 • 6.26k • 25

qgallouedec/tldr-preference

Viewer • Updated Sep 9, 2024 • 179k • 27

qgallouedec/tldr

Viewer • Updated Sep 9, 2024 • 130k • 27

qgallouedec/hh-rlhf-helpful-base

Viewer • Updated Sep 5, 2024 • 46.2k • 16

qgallouedec/hh-rlhf-helpful-base-trl-style

Viewer • Updated Sep 5, 2024 • 46.2k • 40