Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mirth 's Collections
Text chunking / splitting models

Text chunking / splitting models

updated Jan 24

It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module.

Upvote
1

  • mirth/chonky_distilbert_base_uncased_1

    Token Classification • 66.4M • Updated Jan 17 • 26.5k • • 15

  • mirth/chonky_mmbert_small_multilingual_1

    Token Classification • 0.1B • Updated Jan 17 • 180 • 23

  • mirth/chonky_modernbert_base_1

    Token Classification • 0.1B • Updated Jan 17 • 8.73k • • 6

  • mirth/chonky_modernbert_large_1

    Token Classification • 0.4B • Updated Jan 17 • 2.42k • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs