Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

updated a dataset about 1 hour ago

librarian-bots/dataset_cards_with_metadata

updated a dataset about 5 hours ago

data-is-better-together/fineweb-c-progress

updated a dataset about 5 hours ago

librarian-bots/model_cards_with_metadata

View all activity

Organizations

davanstrien's activity

upvoted an article about 6 hours ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

9 days ago

• 22

upvoted a collection 8 days ago

Qwen3

27 items • Updated 5 days ago • 536

upvoted a paper 13 days ago

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14 • 2

upvoted a paper 15 days ago

Aioli: A Unified Optimization Framework for Language Model Data Mixing

Paper • 2411.05735 • Published Nov 8, 2024 • 1

upvoted a collection 19 days ago

Cell2Sentence Models

Cell2Sentence models trained for single-cell tasks • 5 items • Updated 21 days ago • 7

upvoted a collection 20 days ago

blt

4 items • Updated 20 days ago • 17

upvoted a paper 20 days ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 61

upvoted a collection 21 days ago

🏜️MIRAGE-Bench [NAACL'25]

Dataset Collection from the MIRAGE-Bench paper • 13 items • Updated Mar 31 • 2

upvoted a paper 21 days ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 22 days ago • 12

upvoted a collection 22 days ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 7 days ago • 13

upvoted a collection 23 days ago

Apriel

ServiceNow Language Modeling Lab's first model family series • 3 items • Updated about 21 hours ago • 8

upvoted 5 collections 26 days ago

RADIO

A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 13 items • Updated 2 days ago • 17

kl3m

KL3M models and tokenizers • 13 items • Updated Feb 1 • 2

kl3m-data

25 items • Updated 26 days ago • 3

kl3m-index

KL3M Dataset Indices • 7 items • Updated Mar 26 • 1

KL3M Embeddings

7 items • Updated Mar 17 • 1