Wei He's picture

8 8 4

Wei He

hewei2001

·

https://hwcoder.top/about

hewei2001

AI & ML interests

LLM

Recent Activity

upvoted a paper about 13 hours ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

liked a dataset 2 months ago

meituan-longcat/VitaBench

upvoted a paper 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

View all activity

Organizations

upvoted a paper about 13 hours ago

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Paper • 2601.14004 • Published 2 days ago • 42

liked a dataset 2 months ago

meituan-longcat/VitaBench

Updated Nov 5, 2025 • 678 • 13

upvoted a paper 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

upvoted a paper 3 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

authored a paper 3 months ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 26

upvoted a paper 3 months ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9, 2025 • 26

authored 5 papers 4 months ago

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

LongCat-Flash Technical Report

Paper • 2509.01322 • Published Sep 1, 2025 • 6

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

LongCat-Flash-Thinking Technical Report

Paper • 2509.18883 • Published Sep 23, 2025 • 4

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 20

New activity in meituan-longcat/VitaBench 4 months ago

Update README.md

#4 opened 4 months ago by

Update README.md

#3 opened 4 months ago by

upvoted a paper 4 months ago

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 20

commented a paper 4 months ago

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 20 •

New activity in meituan-longcat/VitaBench 4 months ago

init dataset

#1 opened 4 months ago by

init dataset

#2 opened 4 months ago by

authored 3 papers 5 months ago

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Paper • 2411.16579 • Published Nov 25, 2024 • 3

Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling

Paper • 2411.00750 • Published Nov 1, 2024 • 1

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Paper • 2402.05808 • Published Feb 8, 2024