Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a model about 1 hour ago

Qwen/Qwen3-30B-A3B

liked a model about 1 hour ago

mlx-community/Qwen3-235B-A22B-4bit

liked a Space about 2 hours ago

Qwen/Qwen3-Demo

View all activity

Organizations

victor's activity

upvoted 2 collections about 14 hours ago

Qwen3

23 items • Updated about 14 hours ago • 367

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 62 items • Updated 40 minutes ago • 97

upvoted a paper 4 days ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published 14 days ago • 28

upvoted an article 4 days ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

4 days ago

• 180

upvoted a collection 4 days ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 25 items • Updated about 8 hours ago • 54

upvoted a collection 5 days ago

OpenMathReasoning

Models and datasets from "AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset" • 7 items • Updated 5 days ago • 31

upvoted an article 5 days ago

Article

Cohere on Hugging Face Inference Providers 🔥

13 days ago

• 124

upvoted a paper 5 days ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Paper • 2504.15270 • Published 8 days ago • 10

upvoted a collection 6 days ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 5 days ago • 44

upvoted a paper 6 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 7 days ago • 93

upvoted 2 papers 7 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 11 days ago • 114

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published 8 days ago • 77

upvoted a paper 8 days ago

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Paper • 2411.17525 • Published Nov 26, 2024 • 4

upvoted a paper 12 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 14 days ago • 59

upvoted 2 papers 13 days ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 15 days ago • 84

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published 15 days ago • 11

upvoted a paper 14 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 22 days ago • 126

upvoted a collection 15 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 14 days ago • 118

upvoted a paper 15 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 18 days ago • 122