Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Zhaoning Yu
ZhaoningYu
Follow
0 followers
·
4 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization
upvoted
a
paper
3 months ago
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
upvoted
a
paper
3 months ago
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
View all activity
Organizations
None yet
ZhaoningYu
's models
1
Sort: Recently updated
ZhaoningYu/rl-course-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 27, 2024
•
11