Zhaoning Yu's picture

4

Zhaoning Yu

ZhaoningYu

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization

upvoted a paper 3 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper 3 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

View all activity

Organizations

None yet

ZhaoningYu 's models 1

ZhaoningYu/rl-course-ppo-LunarLander-v2

Reinforcement Learning • Updated Dec 27, 2024 • 11