Bowen

PeterJinGo

AI & ML interests

None yet

Recent Activity

updated a model about 5 hours ago

Cell-O1/cello1_qwen7bit_sft_4epoch

published a model about 5 hours ago

Cell-O1/cello1_qwen7bit_sft_4epoch

upvoted a paper 4 days ago

RM-R1: Reward Modeling as Reasoning

View all activity

Organizations

Collections 2

Papers 6

models 34

datasets 13

PeterJinGo/wiki-18-e5-index-HNSW64

Updated Apr 4 • 180

PeterJinGo/wiki-18-bm25-index

Updated Apr 4 • 131

PeterJinGo/nq_hotpotqa_train

Viewer • Updated Mar 13 • 221k • 423 • 2

PeterJinGo/wiki-18-e5-index

Updated Feb 26 • 2.37k

PeterJinGo/wiki-18-corpus

Updated Feb 26 • 1.76k

PeterJinGo/ultrafeedback_first_5000

Viewer • Updated Jan 15 • 5k • 8

PeterJinGo/gsm8k-chat

Viewer • Updated Jan 12 • 7.47k • 18

PeterJinGo/math-zeroshot-chat

Viewer • Updated Dec 23, 2024 • 7.5k • 18

PeterJinGo/math-zeroshot

Viewer • Updated Dec 20, 2024 • 7.5k • 22

PeterJinGo/math2

Viewer • Updated Dec 9, 2024 • 7.5k • 18

Bowen

AI & ML interests

Recent Activity

Organizations

Collections 2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.2

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-em-ppo

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-em-grpo

PeterJinGo/SearchR1-nq_hotpotqa_train-llama3.2-3b-it-em-ppo

Papers 6

models 34

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-grpo-v0.3

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-grpo-v0.3

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.3

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.2

PeterJinGo/SearchR1-nq_hotpotqa_train-qwen2.5-14b-it-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-14b-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-14b-it-em-ppo-v0.2

PeterJinGo/R1-nq_hotpotqa_train-qwen2.5-7b-it-em-ppo-v0.2

datasets 13

PeterJinGo/wiki-18-e5-index-HNSW64

PeterJinGo/wiki-18-bm25-index

PeterJinGo/nq_hotpotqa_train

PeterJinGo/wiki-18-e5-index

PeterJinGo/wiki-18-corpus

PeterJinGo/ultrafeedback_first_5000

PeterJinGo/gsm8k-chat

PeterJinGo/math-zeroshot-chat

PeterJinGo/math-zeroshot

PeterJinGo/math2

Bowen

AI & ML interests

Recent Activity

Organizations

Collections 2

Papers 6

models 34 Sort: Recently updated

datasets 13 Sort: Recently updated

models 34

datasets 13