Jayden Campbell's picture

Jayden Campbell

ZHYI40

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

tencent/Hy-MT2-1.8B

liked a dataset 8 days ago

gretelai/synthetic_text_to_sql

upvoted a paper 9 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

View all activity

Organizations

None yet

upvoted a paper 9 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 19 days ago • 195

upvoted a paper 10 days ago

Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR

Paper • 2605.20164 • Published 12 days ago • 6

upvoted a paper 13 days ago

An Empirical Study of Automating Agent Evaluation

Paper • 2605.11378 • Published 19 days ago • 3

upvoted a paper 16 days ago

When No Benchmark Exists: Validating Comparative LLM Safety Scoring Without Ground-Truth Labels

Paper • 2605.06652 • Published 24 days ago • 5

upvoted 6 papers about 2 months ago

In-Place Test-Time Training

Paper • 2604.06169 • Published Apr 7 • 30

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 630

Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems

Paper • 2604.03295 • Published Mar 27 • 10

upvoted 2 papers 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

upvoted 3 papers 3 months ago

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 197

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 524