Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2140.8
TFLOPS
1205
69
64
Quentin Gallouédec
PRO
qgallouedec
Follow
Gmc2's profile picture
cmax2ph3's profile picture
alexandersoare's profile picture
246 followers
·
84 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
updated
a dataset
2 days ago
trl-lib/OpenMathReasoning
published
a dataset
2 days ago
trl-lib/OpenMathReasoning
liked
a dataset
2 days ago
nvidia/OpenMathReasoning
View all activity
Organizations
Articles
6
Article
32
Gotchas in Tokenizer Behavior Every Developer Should Know
Article
287
Open R1: Update #3
View all Articles
Papers
4
arxiv:
2402.09844
arxiv:
2402.03046
arxiv:
2208.14928
arxiv:
2106.13687
spaces
3
Sort: Recently updated
Runtime error
1
Run Hello World
👀
Runtime error
Run DuckDB Jobs
🦆
Process datasets with DuckDB SQL
Running
12
Train Memory
📈
Generate memory forecast for ML models
models
725
Sort: Recently updated
qgallouedec/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
21 days ago
•
6
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
22 days ago
•
2
qgallouedec/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
Mar 26
qgallouedec/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
Mar 24
qgallouedec/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
Mar 15
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-packing
Image-Text-to-Text
•
Updated
Mar 14
•
2
qgallouedec/gemma-3-12b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 14
•
43
•
5
qgallouedec/gemma-3-12b-it-codeforces-SFT-eager-no-packing
Image-Text-to-Text
•
Updated
Mar 14
•
3
qgallouedec/gemma-3-4b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
48
•
3
qgallouedec/gemma-3-27b-it-codeforces-SFT
Image-Text-to-Text
•
Updated
Mar 13
•
14
•
4
Expand 725 models
datasets
67
Sort: Recently updated
qgallouedec/trl-metrics
Viewer
•
Updated
3 days ago
•
108k
•
685
•
1
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
42
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
23
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
28
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
22
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
25
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
27
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
27
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
16
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
40
Expand 67 datasets