VibeSearchBench: Benchmarking Long-horizon Proactive Search in the Wild Paper • 2605.27882 • Published 5 days ago • 11
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 4 days ago • 99
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence Paper • 2605.26494 • Published 6 days ago • 34
Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention Paper • 2605.22791 • Published 11 days ago • 30
Toto 2.0: Time Series Forecasting Enters the Scaling Era Paper • 2605.20119 • Published 13 days ago • 38
Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution Paper • 2605.15301 • Published 18 days ago • 22
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published 28 days ago • 40
MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning Paper • 2605.13037 • Published 19 days ago • 8
It's TIME: Towards the Next Generation of Time Series Forecasting Benchmarks Paper • 2602.12147 • Published Mar 4 • 4
Retrieval from Within: An Intrinsic Capability of Attention-Based Models Paper • 2605.05806 • Published 24 days ago • 6
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 20 days ago • 124
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling Paper • 2605.08083 • Published 24 days ago • 69
AI Co-Mathematician: Accelerating Mathematicians with Agentic AI Paper • 2605.06651 • Published 25 days ago • 15
Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems Paper • 2605.04018 • Published 27 days ago • 40
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 30 days ago • 24
The Last Human-Written Paper: Agent-Native Research Artifacts Paper • 2604.24658 • Published Apr 29 • 21