16 4

Xiangxin Zhou

zhouxiangxin

https://zhouxiangxin1998.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

upvoted a paper 21 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

liked a model 3 months ago

GSAI-ML/LLaDA-8B-Base

View all activity

Organizations

authored a paper 21 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published 21 days ago • 34

upvoted a paper 21 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published 21 days ago • 34

liked a model 3 months ago

GSAI-ML/LLaDA-8B-Base

Text Generation • Updated Oct 21, 2025 • 147k • 90

upvoted a paper 4 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

authored a paper 4 months ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 31

upvoted a paper 4 months ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 31

updated a dataset 4 months ago

zhouxiangxin/data_to_zichen

Viewer • Updated Oct 30, 2025 • 1 • 17

published a dataset 4 months ago

zhouxiangxin/data_to_zichen

Viewer • Updated Oct 30, 2025 • 1 • 17

authored a paper 5 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 90

upvoted 3 papers 5 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 90

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 146

Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts

Paper • 2509.23188 • Published Sep 27, 2025 • 3

authored a paper 5 months ago

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 69

upvoted a paper 5 months ago

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 69

updated a dataset 5 months ago

zhouxiangxin/TACO_subset

Viewer • Updated Sep 28, 2025 • 4.24k • 5

published a dataset 5 months ago

zhouxiangxin/TACO_subset

Viewer • Updated Sep 28, 2025 • 4.24k • 5

updated a dataset 5 months ago

zhouxiangxin/apps

Viewer • Updated Sep 28, 2025 • 5k • 5

published a dataset 5 months ago

zhouxiangxin/apps

Viewer • Updated Sep 28, 2025 • 5k • 5

updated a dataset 5 months ago

zhouxiangxin/numina_all_subsets_formatted

Viewer • Updated Sep 28, 2025 • 39k • 6

published a dataset 5 months ago

zhouxiangxin/numina_all_subsets_formatted

Viewer • Updated Sep 28, 2025 • 39k • 6

Xiangxin Zhou

AI & ML interests

Recent Activity

Organizations

zhouxiangxin's activity