view article Article OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • Aug 9 • 11
view article Article Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 65