Xiaoyu Tan
WIlliam1900
AI & ML interests
None yet
Recent Activity
authored
a paper
11 minutes ago
Training-Free Group Relative Policy Optimization
authored
a paper
11 minutes ago
RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning
authored
a paper
11 minutes ago
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents