WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 8 days ago • 61
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 9 days ago • 63
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 9 days ago • 69
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 9 days ago • 87
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 15 days ago • 108
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 9 days ago • 98
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 15 days ago • 46
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published 16 days ago • 43
Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality Paper • 2512.07951 • Published 16 days ago • 47
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation Paper • 2512.09363 • Published 14 days ago • 70
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published 13 days ago • 112
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform Paper • 2512.08478 • Published 15 days ago • 76
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 15 days ago • 125
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 19 days ago • 38