Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 12 days ago • 12
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 4 days ago • 5
Preserving Agency: Why AI Safety Needs Community, Not Corporate Control By giadap • about 16 hours ago • 5
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 70
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 71
SyGra: The One-Stop Framework for Building Data for LLMs and SLMs By ServiceNow-AI and 3 others • 8 days ago • 9
Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips By baidu • 6 days ago • 8
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 225
🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎 By sasha and 1 other • 12 days ago • 12
Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi By Arunbiz and 1 other • 4 days ago • 5
Preserving Agency: Why AI Safety Needs Community, Not Corporate Control By giadap • about 16 hours ago • 5
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 70
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 71
SyGra: The One-Stop Framework for Building Data for LLMs and SLMs By ServiceNow-AI and 3 others • 8 days ago • 9