arxiv:2511.21688
Runsen Xu
RunsenXu
AI & ML interests
Large Language Models, Multi-modal Learning, 3D Perception and Understanding, Self-supervised Learning
Recent Activity
upvoted
a
paper
7 days ago
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
authored
a paper
8 days ago
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded
Language Annotations
authored
a paper
8 days ago
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal
Large Language Models
Organizations
None yet