In Pursuit of Pixel Supervision for Visual Pre-training Paper • 2512.15715 • Published 7 days ago • 7
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 16 days ago • 55
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12 • 76
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers Paper • 2401.11605 • Published Jan 21, 2024 • 23
Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training Paper • 2510.12586 • Published Oct 14 • 108
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 66
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published Aug 4 • 133
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8 • 29
facebook/dinov3-vith16plus-pretrain-lvd1689m Image Feature Extraction • 0.8B • Updated Aug 19 • 94.1k • 38
facebook/dinov3-vits16-pretrain-lvd1689m Image Feature Extraction • 21.6M • Updated Aug 19 • 293k • 58