Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hiring 💼
1215
135
113
Quentin Gallouédec
PRO
qgallouedec
Follow
OttoUlbrich's profile picture
wyddmw's profile picture
maurorubens's profile picture
570 followers
·
329 following
QGallouedec
qgallouedec
qgallouedec
qgallouedec.bsky.social
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 4 hours ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
updated
a dataset
2 days ago
hf-doc-build/doc-build
updated
a dataset
2 days ago
hf-doc-build/doc-build-dev
View all activity
Organizations
qgallouedec
's datasets
82
Sort: Recently updated
qgallouedec/deepmath-completions-logs
Viewer
•
Updated
2 days ago
•
232
•
79
•
1
qgallouedec/Dolci-Think-DPO-7B
Viewer
•
Updated
Nov 28, 2025
•
150k
•
5
qgallouedec/biogrid_qa
Viewer
•
Updated
Nov 18, 2025
•
59.4k
•
945
qgallouedec/human_gene_interaction_qa_v2
Viewer
•
Updated
Nov 18, 2025
•
79.2k
•
1
qgallouedec/human_gene_interaction_qa
Viewer
•
Updated
Nov 17, 2025
•
1.84M
•
6
qgallouedec/biogrid
Viewer
•
Updated
Nov 17, 2025
•
2.82M
•
425
qgallouedec/trl-metrics
Viewer
•
Updated
Oct 7, 2025
•
148k
•
1.32k
•
1
qgallouedec/rick
Viewer
•
Updated
Sep 11, 2025
•
1.18k
•
5
qgallouedec/OpenMathReasoning
Viewer
•
Updated
Sep 10, 2025
•
10k
•
37
qgallouedec/math-lvl3to5-8k
Viewer
•
Updated
Aug 22, 2025
•
8.52k
•
23
qgallouedec/svg
Viewer
•
Updated
Aug 2, 2025
•
900
•
52
•
1
qgallouedec/rick-physics-grpo
Viewer
•
Updated
May 22, 2025
•
1.79k
•
17
•
1
qgallouedec/rick-science
Viewer
•
Updated
May 16, 2025
•
1.18k
•
15
•
3
qgallouedec/physics-problems
Viewer
•
Updated
May 10, 2025
•
247
•
21
qgallouedec/rick-teaches-math
Viewer
•
Updated
May 10, 2025
•
6.8k
•
25
qgallouedec/DAPO-Math-17k-Processed-Scored
Viewer
•
Updated
Apr 29, 2025
•
16.4k
•
9
•
3
qgallouedec/prm800k
Viewer
•
Updated
Dec 17, 2024
•
41.2k
•
10
•
3
qgallouedec/ultrafeedback-prompt
Viewer
•
Updated
Sep 9, 2024
•
60.9k
•
8
qgallouedec/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
Sep 9, 2024
•
16.6k
•
13
qgallouedec/lm-human-preferences-descriptiveness
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
10
qgallouedec/lm-human-preferences-sentiment
Viewer
•
Updated
Sep 9, 2024
•
6.26k
•
2
qgallouedec/tldr-preference
Viewer
•
Updated
Sep 9, 2024
•
179k
•
4
qgallouedec/tldr
Viewer
•
Updated
Sep 9, 2024
•
130k
•
55
qgallouedec/hh-rlhf-helpful-base
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
18
qgallouedec/hh-rlhf-helpful-base-trl-style
Viewer
•
Updated
Sep 5, 2024
•
46.2k
•
24
qgallouedec/suap_essentials
Viewer
•
Updated
Aug 6, 2024
•
30
•
4
qgallouedec/qa_suap
Viewer
•
Updated
Jul 14, 2024
•
270
•
10
qgallouedec/amber_results
Viewer
•
Updated
Jul 11, 2024
•
30.4k
•
18
qgallouedec/amber
Viewer
•
Updated
Jul 11, 2024
•
15.2k
•
18
qgallouedec/wikipedia_with_images
Viewer
•
Updated
Jun 5, 2024
•
100
•
9
Previous
1
2
3
Next