Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published 16 days ago • 148
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis Paper • 2603.20278 • Published 10 days ago • 85
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World Paper • 2603.19223 • Published 8 days ago • 30
Supervised Fine-Tuning or Contrastive Learning? Towards Better Multimodal LLM Reranking Paper • 2510.14824 • Published Oct 16, 2025 • 2
Encoders vs Decoders: the Ettin Suite Collection A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 30 items • Updated 25 days ago • 28
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 107
MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning Paper • 2603.12266 • Published 15 days ago • 19