Runze Liu's picture

5 20 4

Runze Liu

RyanLiu112

·

https://ryanliu112.github.io

AI & ML interests

LLM, RL

Recent Activity

upvoted an article 7 days ago

Deriving the PPO Loss from First Principles

upvoted a paper 10 days ago

Step-DeepResearch Technical Report

upvoted a collection 11 days ago

Physics of Language Models: Part 4.2

View all activity

Organizations

commented 2 papers 3 months ago

ASPO: Asymmetric Importance Sampling Policy Optimization

Paper • 2510.06062 • Published Oct 7, 2025 • 13 •

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published Sep 30, 2025 • 16 •

commented a paper 6 months ago

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21, 2025 • 20 •

commented a paper 7 months ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published May 23, 2025 • 41 •

commented a paper 9 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1, 2025 • 14 •