kargarisaac 's Collections
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking
Paper
• 2501.04519
• Published
• 288
URSA: Understanding and Verifying Chain-of-thought Reasoning in
Multimodal Mathematics
Paper
• 2501.04686
• Published
• 53
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
• 2501.04682
• Published
• 99
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
• 2501.04227
• Published
• 95
Reasoning with Language Model is Planning with World Model
Paper
• 2305.14992
• Published
• 4
Reasoning Language Models: A Blueprint
Paper
• 2501.11223
• Published
• 33
Agent-R: Training Language Model Agents to Reflect via Iterative
Self-Training
Paper
• 2501.11425
• Published
• 109
Language Agent Tree Search Unifies Reasoning Acting and Planning in
Language Models
Paper
• 2310.04406
• Published
• 10
ProAgent: From Robotic Process Automation to Agentic Process Automation
Paper
• 2311.10751
• Published
• 10
Executable Code Actions Elicit Better LLM Agents
Paper
• 2402.01030
• Published
• 188
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper
• 2502.01142
• Published
• 24