Devin Thang's picture

2 17 1

Devin Thang

winvswon78

·

devininthelab

AI & ML interests

None yet

Recent Activity

upvoted an article 9 days ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

updated a model 12 days ago

winvswon78/Qwen2.5-Math-1.5B-GRPO

published a model 12 days ago

winvswon78/Qwen2.5-Math-1.5B-GRPO

View all activity

Organizations

winvswon78 's models 4

winvswon78/Qwen2.5-Math-1.5B-GRPO

Updated 12 days ago

winvswon78/Qwen2-0.5B-GRPO-test

Updated 12 days ago

winvswon78/gpt2_viet_poem_generation

Text Generation • 0.1B • Updated Mar 10, 2024 • 4

winvswon78/distilbert-finetuned-squadv2

Question Answering • 0.1B • Updated Feb 26, 2024 • 4