24 17 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Recent Activity

upvoted a paper 19 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

commented on a paper 24 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

commented on a paper about 1 month ago

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

View all activity

Organizations

ajithprabhakar's activity

upvoted a paper 19 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 29 days ago • 269

commented a paper 24 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 29 days ago • 269 •

commented 2 papers about 1 month ago

ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation

Paper • 2503.21729 • Published Mar 27 • 28 •

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 111 •

upvoted a paper about 2 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 48

commented a paper about 2 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 48 •

commented 4 papers 2 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192 •

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 155 •

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 131 •

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 140 •

commented 5 papers 3 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 228 •

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26 • 71 •

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

Paper • 2501.15228 • Published Jan 25 • 1 •

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 390 •

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 288 •

commented a paper 4 months ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 102 •

commented a paper 7 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 139 •

liked a model 8 months ago

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 149 • 1.71k

upvoted a paper 8 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 59

commented a paper 8 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 59 •