Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
21
4
Xiangxin Zhou
zhouxiangxin
Follow
Datawitch-Programmer's profile picture
JohnRoger's profile picture
Gargaz's profile picture
6 followers
·
12 following
https://zhouxiangxin1998.github.io/
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
Rethinking the Divergence Regularization in LLM RL
authored
a paper
6 days ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
authored
a paper
6 days ago
Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning
View all activity
Organizations
zhouxiangxin
's models
21
Sort: Recently updated
zhouxiangxin/Variational-Posterior-PA-7B
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
1
zhouxiangxin/Variational-Posterior-PB-7B
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
3
zhouxiangxin/Variational-Posterior-PA-32B
Text Generation
•
33B
•
Updated
Sep 28, 2025
zhouxiangxin/Variational-Posterior-PB-4B
Text Generation
•
4B
•
Updated
Sep 28, 2025
•
3
zhouxiangxin/Variational-Posterior-PB-8B
Text Generation
•
8B
•
Updated
Sep 28, 2025
zhouxiangxin/Initial-Reasoning-32B
Text Generation
•
33B
•
Updated
Sep 28, 2025
•
1
zhouxiangxin/Initial-Reasoning-7B
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-32B-Acc
Text Generation
•
33B
•
Updated
Sep 28, 2025
•
3
zhouxiangxin/Initial-Reasoning-4B
Text Generation
•
4B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-PA-7B-Acc
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
4
zhouxiangxin/Initial-Reasoning-8B
Text Generation
•
8B
•
Updated
Sep 28, 2025
zhouxiangxin/Variational-Reasoning-PB-7B-Acc
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-PA-7B-GML
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
3
zhouxiangxin/Variational-Reasoning-PB-7B-GML
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-32B-GML
Text Generation
•
33B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-8B-GML
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
5
zhouxiangxin/Variational-Reasoning-4B-Acc
Text Generation
•
4B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Variational-Reasoning-8B-Acc
Text Generation
•
8B
•
Updated
Sep 28, 2025
•
3
zhouxiangxin/Variational-Reasoning-4B-GML
Text Generation
•
4B
•
Updated
Sep 28, 2025
•
2
zhouxiangxin/Qwen3-4B-Base-VeriFree
Text Generation
•
4B
•
Updated
May 29, 2025
•
5
•
•
1
zhouxiangxin/Qwen3-8B-Base-VeriFree
Text Generation
•
8B
•
Updated
May 29, 2025
•
6
•
•
1