Qiyuan Zhang's picture

Qiyuan Zhang PRO

DonJoey

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 10 days ago

updated a collection 10 days ago

updated a collection 10 days ago

View all activity

Organizations

None yet

upvoted a collection 10 days ago

RubricBench

2 items • Updated 10 days ago • 2

updated a collection 10 days ago

RubricBench

2 items • Updated 10 days ago • 2

upvoted a collection 10 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 10 days ago • 1

upvoted a paper 10 days ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published 12 days ago • 33

updated a collection 10 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 10 days ago • 1

submitted a paper to Daily Papers 10 days ago

Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models

Paper • 2603.01571 • Published 12 days ago • 33

authored 2 papers 11 days ago

From Verifiable Dot to Reward Chain: Harnessing Verifiable Reference-based Rewards for Reinforcement Learning of Open-ended Generation

Paper • 2601.18533 • Published Jan 26

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 12 days ago • 57

updated a model 11 days ago

DonJoey/mix-grm-qwen3-8b-rl

8B • Updated 11 days ago • 57

upvoted a paper 11 days ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 12 days ago • 57

submitted a paper to Daily Papers 11 days ago

RubricBench: Aligning Model-Generated Rubrics with Human Standards

Paper • 2603.01562 • Published 12 days ago • 57

published 2 datasets 13 days ago

DonJoey/mix-grm-sft-9k

Viewer • Updated 15 days ago • 8.99k • 7

DonJoey/mix-grm-rl-21k

Viewer • Updated 15 days ago • 21.9k • 4

updated a collection 13 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 10 days ago • 1

updated a model 13 days ago

DonJoey/mix-grm-qwen3-8b-sft

Updated 13 days ago • 19

published a model 13 days ago

DonJoey/mix-grm-qwen3-8b-sft

Updated 13 days ago • 19

updated a collection 13 days ago

Mix-GRM

We provide a collection about ``Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models'', including data, models, and paper • 5 items • Updated 10 days ago • 1

published a model 13 days ago

DonJoey/mix-grm-qwen3-8b-rl

8B • Updated 11 days ago • 57

published a dataset 13 days ago

DonJoey/rubricbench

Viewer • Updated 13 days ago • 1.15k • 137 • 6