Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
17
1
Devin Thang
winvswon78
Follow
Bujurocks's profile picture
pufanyi's profile picture
2 followers
·
50 following
devininthelab
AI & ML interests
None yet
Recent Activity
upvoted
an
article
9 days ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
updated
a model
12 days ago
winvswon78/Qwen2.5-Math-1.5B-GRPO
published
a model
12 days ago
winvswon78/Qwen2.5-Math-1.5B-GRPO
View all activity
Organizations
winvswon78
's models
4
Sort: Recently updated
winvswon78/Qwen2.5-Math-1.5B-GRPO
Updated
12 days ago
winvswon78/Qwen2-0.5B-GRPO-test
Updated
12 days ago
winvswon78/gpt2_viet_poem_generation
Text Generation
•
0.1B
•
Updated
Mar 10, 2024
•
4
winvswon78/distilbert-finetuned-squadv2
Question Answering
•
0.1B
•
Updated
Feb 26, 2024
•
4