cokesoda22

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

kuvvi published a dataset 24 days ago

ThinkPro/MindCube-VCoT

luckychao authored a paper about 2 months ago

Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark

kuvvi authored a paper about 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

View all activity

kuvvi

published a dataset 24 days ago

ThinkPro/MindCube-VCoT

Viewer • Updated 24 days ago • 10k • 28

luckychao

authored a paper about 2 months ago

Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark

Paper • 2501.05444 • Published Jan 9, 2025 • 3

kuvvi

authored a paper about 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 82

luckychao

authored a paper about 2 months ago

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30, 2025 • 82

huzican

authored a paper 3 months ago

Diversity-Incentivized Exploration for Versatile Reasoning

Paper • 2509.26209 • Published Sep 30, 2025 • 16

huzican

authored 2 papers 5 months ago

Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning

Paper • 2505.19761 • Published May 26, 2025

Text-to-Decision Agent: Offline Meta-Reinforcement Learning from Natural Language Supervision

Paper • 2504.15046 • Published Apr 21, 2025

huzican

authored 2 papers 7 months ago

Attention-Guided Contrastive Role Representations for Multi-Agent Reinforcement Learning

Paper • 2312.04819 • Published Dec 8, 2023

Mixture-of-Experts Meets In-Context Reinforcement Learning

Paper • 2506.05426 • Published Jun 5, 2025 • 5

kuvvi

authored 3 papers 7 months ago

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Paper • 2506.04633 • Published Jun 5, 2025 • 19

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20, 2025 • 62

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Paper • 2505.17399 • Published May 23, 2025 • 14

kuvvi

authored 4 papers 8 months ago

A Survey on LLM-as-a-Judge

Paper • 2411.15594 • Published Nov 23, 2024

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13, 2025 • 41

CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models

Paper • 2407.17467 • Published Jul 24, 2024

Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark

Paper • 2501.05444 • Published Jan 9, 2025 • 3

luckychao

authored 2 papers 8 months ago

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Paper • 2505.08617 • Published May 13, 2025 • 41

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published Apr 23, 2025 • 57

huzican

authored a paper 8 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

luckychao

authored a paper 9 months ago

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8, 2025 • 85

AI & ML interests

Recent Activity

Team members 5

ThinkPro's activity