Nitish Pandey's picture

1 33 9

Nitish Pandey

nitishpandey04

·

AI & ML interests

LLMs, Translation

Recent Activity

liked a Space 12 days ago

nanotron/ultrascale-playbook

updated a collection 29 days ago

upvoted a paper 29 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

liked a Space 12 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

updated a collection 29 days ago

Reading List

30 items • Updated 29 days ago

upvoted a paper 29 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 176

updated a collection about 2 months ago

Reading List

30 items • Updated 29 days ago

upvoted 3 papers 2 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 625

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8 • 27

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 112

liked a dataset 2 months ago

Salesforce/fineweb_deduplicated

Viewer • Updated Feb 3 • 6.43B • 12k • 38

updated a collection 3 months ago

Reading List

30 items • Updated 29 days ago

upvoted a paper 3 months ago

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Paper • 2112.10741 • Published Dec 20, 2021 • 4

updated a collection 3 months ago

Reading List

30 items • Updated 29 days ago

upvoted an article 3 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

By

•

Jun 26

• 44

updated a collection 3 months ago

Reading List

30 items • Updated 29 days ago