Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Arian Hosseini
arianhosseini
Follow
0 followers
·
4 following
https://arianhosseini.github.io/
arianTBD
arianhosseini
AI & ML interests
large language models, reasoning, planning, systematic generalization
Recent Activity
authored
a paper
22 days ago
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
authored
a paper
22 days ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
authored
a paper
22 days ago
Generative Verifiers: Reward Modeling as Next-Token Prediction
View all activity
Organizations
Papers
10
arxiv:
2508.10142
arxiv:
2505.04842
arxiv:
2504.01005
arxiv:
2410.18252
Expand 10 papers
models
20
Sort: Recently updated
arianhosseini/polIter_qwen2.5_math_1.5B_genppo_MATH_sft_0
Updated
Mar 28
arianhosseini/rebecca-hansen-cadetblue
3B
•
Updated
May 23, 2024
•
3
arianhosseini/mary-snyder-paleturquoise
3B
•
Updated
May 23, 2024
•
3
arianhosseini/jeffrey-pruitt-white
Updated
May 23, 2024
arianhosseini/thomas-garcia-peachpuff
Updated
May 23, 2024
•
3
arianhosseini/lisa-vance-magenta
Updated
May 23, 2024
•
4
arianhosseini/rachel-james-dds-deepskyblue
Updated
May 23, 2024
arianhosseini/courtney-rivera-darkblue
Updated
May 20, 2024
arianhosseini/jeffrey-walker-teal
3B
•
Updated
May 17, 2024
•
3
arianhosseini/patricia-walters-darkmagenta
3B
•
Updated
May 17, 2024
•
4
View 20 models
datasets
36
Sort: Recently updated
arianhosseini/mt_puzzles
Viewer
•
Updated
Aug 20
•
2.5k
•
324
arianhosseini/r1_1.5_dedup_fil_0to34000_thresh1_25333points
Viewer
•
Updated
Mar 22
•
25.3k
•
8
arianhosseini/llama70b_code_256sol_ver32_pickles
Viewer
•
Updated
Mar 21
•
2
•
20
arianhosseini/lcb127_llama70b_256sol
Preview
•
Updated
Mar 19
•
4
arianhosseini/gemma27b_it_math_128_generations
Viewer
•
Updated
Mar 18
•
128
•
5
arianhosseini/gemma27b_it_math_500_generations
Viewer
•
Updated
Mar 18
•
500
•
6
arianhosseini/verified_questions_ind_2308_to_2896
Viewer
•
Updated
Mar 17
•
8.63k
•
6
arianhosseini/lcb_127_llama70b_128sol
Viewer
•
Updated
Mar 16
•
127
•
5
arianhosseini/lcb127_llama70b_64sol_temp0_6
Updated
Mar 14
•
91
arianhosseini/lcb128_llama3-8B-instruct_256samples_ver32_temp0-7
Viewer
•
Updated
Mar 12
•
25.6k
•
4
View 36 datasets