Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dfahey 's Collections
ocr

ocr

updated Nov 8, 2025
Upvote
-

  • mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding

    Paper • 2403.12895 • Published Mar 19, 2024 • 32

  • microsoft/layoutlm-base-uncased

    0.1B • Updated Apr 16, 2024 • 93.6k • 61

  • microsoft/layoutlmv3-base

    0.1B • Updated Apr 10, 2024 • 652k • 469

  • naver-clova-ix/donut-base-finetuned-docvqa

    Document Question Answering • Updated Mar 9, 2024 • 18.9k • 266

  • microsoft/udop-large

    Image-Text-to-Text • 0.7B • Updated Dec 2, 2025 • 2.78k • 120

  • impira/layoutlm-document-qa

    Document Question Answering • 0.1B • Updated Mar 18, 2023 • 8.27k • 1.15k

  • InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

    Paper • 2401.13313 • Published Jan 24, 2024 • 5

  • DocLLM: A layout-aware generative language model for multimodal document understanding

    Paper • 2401.00908 • Published Dec 31, 2023 • 189

  • mPLUG/DocOwl1.5

    8B • Updated Apr 10, 2024 • 25 • 26

  • olmOCR 2: Unit Test Rewards for Document OCR

    Paper • 2510.19817 • Published Oct 22, 2025 • 15
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs