Tommaso Bonomo's picture

Tommaso Bonomo

tommasobonomo

·

AI & ML interests

None yet

Recent Activity

commented on an article 9 days ago

SmolLM3: smol, multilingual, long-context reasoner

upvoted an article 9 days ago

SmolLM3: smol, multilingual, long-context reasoner

liked a model 25 days ago

Salesforce/WQRM-PRE

View all activity

Organizations

commented on SmolLM3: smol, multilingual, long-context reasoner 9 days ago

Hi there! Congrats on the great work, really appreciate seeing discussions like these in the open ✨
Just one question: in the long context extension phase you mention using an extra 100B tokens - from where do you source them? Are they from the same sources as the pretraining, with different upscaling weights?
In general, I would really appreciate it if you could point me to some resource/inspiration regarding what data to use for the long-context extension!

upvoted an article 9 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 666

liked 2 models 25 days ago

Salesforce/WQRM-PRE

0.4B • Updated Jul 28 • 10 • 2

Salesforce/WQRM

0.4B • Updated Jul 28 • 10 • 4

upvoted a paper 26 days ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published 29 days ago • 58

liked 2 models about 2 months ago

HuggingFaceTB/SmolLM3-3B-Base

Text Generation • 3B • Updated 29 days ago • 15.1k • 123

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 23.4k • 24

updated a dataset about 2 months ago

sapienzanlp/bookcoref

Updated Jul 23 • 367 • 9

liked a dataset about 2 months ago

sapienzanlp/bookcoref

Updated Jul 23 • 367 • 9

authored a paper about 2 months ago

BOOKCOREF: Coreference Resolution at Book Scale

Paper • 2507.12075 • Published Jul 16 • 5

upvoted 3 papers about 2 months ago

Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering

Paper • 2503.14996 • Published Mar 19 • 3

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering

Paper • 2410.05077 • Published Oct 7, 2024 • 5

BOOKCOREF: Coreference Resolution at Book Scale

Paper • 2507.12075 • Published Jul 16 • 5

upvoted a collection about 2 months ago

Reward Bench 2

Datasets, spaces, and models for Reward Bench 2 benchmark and paper! • 11 items • Updated Jun 3 • 14

upvoted a paper about 2 months ago

RewardBench 2: Advancing Reward Model Evaluation

Paper • 2506.01937 • Published Jun 2 • 7

upvoted a collection about 2 months ago

CLIPPER

Models and datasets for CLIPPER: Compression enables long-context synthetic data generation • 6 items • Updated Feb 20 • 5

published a dataset 2 months ago

sapienzanlp/bookcoref

Updated Jul 23 • 367 • 9