Jiayi Zhang's picture

Jiayi Zhang

didiforhugface

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

InfoPO: Information-Driven Policy Optimization for User-Centric Agents

upvoted a paper 15 days ago

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

authored a paper about 1 month ago

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

View all activity

Organizations

upvoted a paper 2 days ago

InfoPO: Information-Driven Policy Optimization for User-Centric Agents

Paper • 2603.00656 • Published 6 days ago • 9

upvoted a paper 15 days ago

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Paper • 2602.14296 • Published 19 days ago • 49

upvoted 2 papers about 1 month ago

MARS: Modular Agent with Reflective Search for Automated AI Research

Paper • 2602.02660 • Published Feb 2 • 65

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Paper • 2602.03786 • Published Feb 3 • 87

upvoted 4 papers about 2 months ago

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published Jan 15 • 39

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 214

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 187

Evolving Programmatic Skill Networks

Paper • 2601.03509 • Published Jan 7 • 87

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

607

upvoted 6 papers 3 months ago

Agentic Policy Optimization via Instruction-Policy Co-Evolution

Paper • 2512.01945 • Published Dec 1, 2025 • 4

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 300

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published Nov 22, 2025 • 38

WorldGen: From Text to Traversable and Interactive 3D Worlds

Paper • 2511.16825 • Published Nov 20, 2025 • 24

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 91

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 109

upvoted 5 papers 4 months ago

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published Nov 19, 2025 • 77

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 86

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97