YANG SHU

babytreecc

AI & ML interests

None yet

Recent Activity

liked a dataset about 14 hours ago

javirandor/hh-rlhf-safety-v3-dpo

upvoted a paper 27 days ago

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

authored a paper 3 months ago

When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment

View all activity

Organizations

Collections 1

Papers 1

arxiv:2509.00544

models 14

datasets 18

babytreecc/DeliberationBank

Viewer • Updated Oct 1 • 10 • 26

babytreecc/Implicit-suicide-detection

Viewer • Updated Aug 3 • 1.61k • 15

babytreecc/mllm-self-fullfilling

Viewer • Updated Jul 17 • 743 • 17

babytreecc/0101_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 3 • 256 • 34

babytreecc/0052_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 3 • 312 • 33

babytreecc/0047_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 3 • 56 • 28

babytreecc/2213_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 3 • 56 • 40

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

Viewer • Updated Feb 3 • 8 • 28

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-14B

Viewer • Updated Feb 3 • 8 • 27

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Llama-8B

Viewer • Updated Feb 3 • 8 • 40

View 18 datasets

YANG SHU

AI & ML interests

Recent Activity

Organizations

Collections 1

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Papers 1

models 14

babytreecc/DeliberationJudge

babytreecc/qwen2-7b-instruct-mllm-self-fullfilling

babytreecc/qwen2-7b-instruct-amazon-description

babytreecc/vit-base-patch16-224-in21k_lung_cancer

babytreecc/groupdp_tldr_reward_5.0_0.001

babytreecc/rr_tldr_reward_5.0_0.001

babytreecc/dpsgd_tldr_reward_0.5_0.01

babytreecc/dpsgd_tldr_reward_1_0.01

babytreecc/groupdp_tldr_reward_8_0.001

babytreecc/rr_tldr_reward_8_0.01

datasets 18

babytreecc/DeliberationBank

babytreecc/Implicit-suicide-detection

babytreecc/mllm-self-fullfilling

babytreecc/0101_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

babytreecc/0052_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

babytreecc/0047_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

babytreecc/2213_deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-1.5B

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Qwen-14B

babytreecc/test-deepseek-r1-distill-DeepSeek-R1-Distill-Llama-8B

YANG SHU

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

models 14 Sort: Recently updated

datasets 18 Sort: Recently updated

models 14

datasets 18