CircleGuardBench: New Standard for Evaluating AI Moderation Models
By
and 7 others
β’
β’
47I trained a Language Model to schedule events with GRPO!
By
β’
β’
60Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios
By
and 3 others
β’
β’
18π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It?
By
β’
β’
237Page-to-Video: Generate videos from webpages πͺπ¬
By
β’
β’
14Uncensor any LLM with abliteration
By
β’
β’
548Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs
By
and 1 other
β’
β’
12Creating your custom Ghibli Text-to-Image model
By
and 3 others
β’
β’
15AI Personas: The Impact of Design Choices
By
and 1 other
β’
β’
10Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability
By
and 1 other
β’
β’
10DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
β’
β’
129DeepWiki: Best AI Documentation Generator for Any Github Repo
By
β’
β’
16π¦Έπ»#17: What is A2A and why is it β still! β underappreciated?
By
β’
β’
6ColPali: Efficient Document Retrieval with Vision Language Models π
By
β’
β’
246KV Caching Explained: Optimizing Transformer Inference Efficiency
By
β’
β’
63What is test-time compute and how to scale it?
By
and 1 other
β’
β’
85Introduction to State Space Models (SSM)
By
β’
β’
128Code a simple RAG from scratch
By
β’
β’
66Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
By
β’
β’
28What is The Agent2Agent Protocol (A2A) and Why You Must Learn It Now
By
β’
β’
17