Harshit Gupta's picture

15 45

Harshit Gupta

hrgupta

·

hrgupta

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

The Future of Open Human Feedback

liked a model 6 days ago

ubergarm/Qwen3-30B-A3B-GGUF

liked a model 6 days ago

Qwen/Qwen2.5-VL-7B-Instruct

View all activity

Organizations

hrgupta's activity

upvoted a paper 6 days ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15, 2024 • 22

upvoted a collection 6 days ago

Qwen3

27 items • Updated 5 days ago • 538

upvoted a collection about 2 months ago

FLUX.1

A collection of our FLUX.1 models and LoRAs. • 8 items • Updated 23 days ago • 73

upvoted 2 collections 5 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 2 days ago • 257

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 7 days ago • 69

upvoted a paper 7 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted 5 collections 7 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 7 days ago • 303

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 9 days ago • 310

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated 19 days ago • 228

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 9 days ago • 607

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 601

upvoted 3 collections 9 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 7 days ago • 566

Pythia Scaling Suite

Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26 • 29

Cohere Labs Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated 22 days ago • 55