arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
upvoted a paper about 16 hours ago
A Very Big Video Reasoning Suite upvoted a collection about 18 hours ago
Qwen3.5 commented on
a paper
2 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training