ZhangJin

Benjamin0

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a dataset about 2 months ago

meituan-longcat/AMO-Bench

liked a model 4 months ago

internlm/Intern-S1

View all activity

Organizations

None yet

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.68k

The secrets to building world-class LLMs

liked a dataset about 2 months ago

meituan-longcat/AMO-Bench

Viewer • Updated 25 days ago • 50 • 696 • 24

liked a model 4 months ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated Oct 31 • 60.9k • 249

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 259

liked 2 datasets 5 months ago

edinburgh-dawg/mmlu-redux-2.0

Viewer • Updated Feb 25 • 5.7k • 6.13k • 34

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated Mar 25 • 251k • 5.31k • 213

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

739

upvoted a paper 6 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

upvoted an article 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

•

1.31k

upvoted an article 7 months ago

Article

The Common Pile v0.1

Jun 6

•

liked a dataset 7 months ago

TIGER-Lab/WebInstruct-verified

Viewer • Updated 28 days ago • 462k • 645 • 61

upvoted an article 8 months ago

Article

PipelineRL

Apr 25

•

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 306

upvoted an article 9 months ago

Article

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

Apr 4

•

upvoted 2 articles 10 months ago

Article

What changed in the Transformer architecture

Mar 8

•

Article

Common AI Model Formats

Feb 27

•

upvoted a paper 10 months ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24 • 73

upvoted 2 articles 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

261

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Jan 16

•

liked a dataset over 1 year ago

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 53.5k • 511

ZhangJin

AI & ML interests

Recent Activity

Organizations

Benjamin0's activity

The Smol Training Playbook

SmolLM3: smol, multilingual, long-context reasoner

Open-source DeepResearch – Freeing our search agents

The Common Pile v0.1

PipelineRL

Topic 33: Slim Attention, KArAt, XAttention and Multi-Token Attention Explained – What’s Really Changing in Transformers?

What changed in the Transformer architecture

Common AI Model Formats

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference