Biencoders - Zero shot text classification
Collection
Biencoders-based architectures for zero-shot multi-label text classification. • 8 items • Updated
GLiNER-inspired span-attentive classification with top-K span selection.
This model encodes texts and candidate labels into a shared embedding space using BERT, enabling classification into arbitrary categories without retraining for new labels.
| Parameter | Value |
|---|---|
| Base model | bert-base-uncased |
| Model variant | spanclass |
| Training steps | 1000 |
| Batch size | 2 |
| Learning rate | 2e-05 |
| Trainable params | 111,254,017 |
| Training time | 374.1s |
Trained on polodealvarado/zeroshot-classification.
| Metric | Score |
|---|---|
| Precision | 0.9277 |
| Recall | 0.9503 |
| F1 Score | 0.9388 |
from models.spanclass import SpanClassModel
model = SpanClassModel.from_pretrained("polodealvarado/spanclass")
predictions = model.predict(
texts=["The stock market crashed yesterday."],
labels=[["Finance", "Sports", "Biology", "Economy"]],
)
print(predictions)
# [{"text": "...", "scores": {"Finance": 0.98, "Economy": 0.85, ...}}]
Base model
google-bert/bert-base-uncased