Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Paper • 2501.05444 • Published Jan 9, 2025 • 3
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30, 2025 • 82
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning Paper • 2510.27492 • Published Oct 30, 2025 • 82
Diversity-Incentivized Exploration for Versatile Reasoning Paper • 2509.26209 • Published Sep 30, 2025 • 16
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning Paper • 2505.19761 • Published May 26, 2025
Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision Paper • 2504.15046 • Published Apr 21, 2025
Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning Paper • 2312.04819 • Published Dec 8, 2023
Mixture-of-Experts Meets In-Context Reinforcement Learning Paper • 2506.05426 • Published Jun 5, 2025 • 5
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations Paper • 2506.04633 • Published Jun 5, 2025 • 19
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published May 20, 2025 • 62
FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow Paper • 2505.17399 • Published May 23, 2025 • 14
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning Paper • 2505.08617 • Published May 13, 2025 • 41
CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models Paper • 2407.17467 • Published Jul 24, 2024
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Paper • 2501.05444 • Published Jan 9, 2025 • 3
OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning Paper • 2505.08617 • Published May 13, 2025 • 41
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published Apr 23, 2025 • 57
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published Apr 8, 2025 • 85