YANG SHU
babytreecc
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 14 hours ago
javirandor/hh-rlhf-safety-v3-dpo
upvoted
a
paper
27 days ago
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model
Reasoning
authored
a paper
3 months ago
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced
Misalignment