Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Arian Hosseini
arianhosseini
Follow
0 followers
·
4 following
https://arianhosseini.github.io/
arianTBD
arianhosseini
AI & ML interests
large language models, reasoning, planning, systematic generalization
Recent Activity
authored
a paper
24 days ago
Deep Language Networks: Joint Prompt Training of Stacked LLMs using Variational Inference
authored
a paper
24 days ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
authored
a paper
24 days ago
Generative Verifiers: Reward Modeling as Next-Token Prediction
View all activity
Organizations
arianhosseini
's models
20
Sort: Recently updated
arianhosseini/polIter_qwen2.5_math_1.5B_genppo_MATH_sft_0
Updated
Mar 28
arianhosseini/rebecca-hansen-cadetblue
3B
•
Updated
May 23, 2024
•
2
arianhosseini/mary-snyder-paleturquoise
3B
•
Updated
May 23, 2024
•
2
arianhosseini/jeffrey-pruitt-white
Updated
May 23, 2024
arianhosseini/thomas-garcia-peachpuff
Updated
May 23, 2024
•
2
arianhosseini/lisa-vance-magenta
Updated
May 23, 2024
•
3
arianhosseini/rachel-james-dds-deepskyblue
Updated
May 23, 2024
arianhosseini/courtney-rivera-darkblue
Updated
May 20, 2024
arianhosseini/jeffrey-walker-teal
3B
•
Updated
May 17, 2024
•
3
arianhosseini/patricia-walters-darkmagenta
3B
•
Updated
May 17, 2024
•
3
arianhosseini/patricia-johnson-yellow
Updated
May 17, 2024
arianhosseini/jessica-mitchell-darkcyan
Updated
May 16, 2024
arianhosseini/sample_ver
Updated
Apr 18, 2024
arianhosseini/ver_base
Text Generation
•
7B
•
Updated
Apr 18, 2024
•
2
arianhosseini/zephyr-7b-dpo-qlora
Updated
Apr 18, 2024
arianhosseini/sample_gen
Updated
Apr 17, 2024
arianhosseini/pythia410m-tldr-dpo-1b-relbl-10k
Updated
Nov 20, 2023
arianhosseini/pythia410m-tldr-dpo-1b-relbl
Updated
Nov 20, 2023
arianhosseini/pythia410m-tldr-dpo-1b-relbl_10k
Updated
Nov 20, 2023
arianhosseini/llama-2-7b-gsm8k-lora
Updated
Nov 1, 2023