Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MercedeSnape 's Collections
Benchmark: method
ViT
Problem Definition
future
Evolve
LLM reasoning
reasoning evaluation
mm thinking
agent reasoning
agent training
RL agent
agent env
mas
model paradigm
MoE
Memory
RAG
KG
Tokenization

agent reasoning

updated 7 days ago
Upvote
-

  • MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

    Paper • 2511.11793 • Published Nov 14, 2025 • 169

    Note 第三维度指标 Interactive Scaling


  • Reinforcement Learning for Self-Improving Agent with Skill Library

    Paper • 2512.17102 • Published 22 days ago • 32
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs