Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
39
34
30
Shizhe Diao
shizhediao2
Follow
renjiepi's profile picture
cmhungsteve's profile picture
darragh's profile picture
18 followers
·
13 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
upvoted
a
paper
24 days ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
liked
a model
26 days ago
nvidia/Nemotron-Flash-1B
updated
a dataset
about 2 months ago
nvidia/ToolScale
View all activity
Organizations
shizhediao2
's models
3
Sort:Â Recently updated
shizhediao2/ToolOrchestrator-8B
Updated
Oct 15, 2025
•
2
shizhediao2/Llama-Nemotron-8B-v1-Prorl
Updated
Aug 25, 2025
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14, 2025