Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

One-click Deployment

Inference Endpoints

Microsoft Foundry

Amazon SageMaker AI

Misc

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

246

Base only

Active filters: GRPO

etri-vilab/MultiHopSpatial-Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated 7 days ago • 97 • 2

mradermacher/SocialR1-8B-i1-GGUF

Reinforcement Learning • 4B • Updated May 12 • 93 • 2

etri-vilab/MultiHopSpatial-Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated 7 days ago • 48 • 1

etri-vilab/MultiHopSpatial-Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated 7 days ago • 40 • 1

Ihor/Text2Graph-R1-Qwen2.5-0.5b

Text Generation • 0.5B • Updated Aug 18, 2025 • 56 • • 24

prithivMLmods/Bellatrix-Tiny-1B-R1

Text Generation • 1B • Updated Feb 2, 2025 • 9 • • 1

mradermacher/Bellatrix-Tiny-1B-R1-GGUF

1B • Updated Feb 3, 2025 • 68

mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF

1B • Updated Feb 3, 2025 • 134

Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF

Text Generation • 1B • Updated Feb 3, 2025 • 12

Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF

Text Generation • 1B • Updated Feb 3, 2025 • 10

tecosys/Nutaan-RL1

Reinforcement Learning • Updated Feb 7, 2025 • 2

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF

0.5B • Updated Aug 18, 2025 • 67 • 1

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF

0.5B • Updated Aug 18, 2025 • 336 • 1

alpha-ai/Deep-Reason-SMALL-V0-GGUF

3B • Updated Feb 26, 2025 • 38 • 1

alpha-ai/Deep-Reason-SMALL-V0

Text Generation • 3B • Updated Feb 26, 2025 • 17 • 2

mradermacher/Deep-Reason-SMALL-V0-GGUF

3B • Updated Feb 9, 2025 • 53 • 2

mradermacher/Deep-Reason-SMALL-V0-i1-GGUF

3B • Updated Feb 9, 2025 • 158 • 1

alpha-ai/qwen2.5-reason-thought-lite-GGUF

3B • Updated Apr 28, 2025 • 19

alpha-ai/qwen2.5-reason-thought-lite

Text Generation • 3B • Updated Apr 28, 2025 • 16 •

alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF

3B • Updated Feb 26, 2025 • 55 • 2

alpha-ai/llama-3.2-3B-Reason-Reflect-Lite

Text Generation • 3B • Updated Feb 26, 2025 • 10

mradermacher/Cogito-R1-GGUF

33B • Updated Jul 31, 2025 • 97

accuracy-maker/Llama-3.2-1B-GRPO-gsm8k

Text Generation • 1B • Updated Feb 12, 2025 • 8 •

mradermacher/Cogito-R1-i1-GGUF

33B • Updated Feb 13, 2025 • 631

AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV

3B • Updated Feb 17, 2025 • 110 • 1

Nitral-AI/Captain-Eris_Violet-GRPO-v0.420

Text Generation • 12B • Updated Apr 14, 2025 • 23 • • 24

prithivMLmods/SmolLM2_135M_Grpo_Gsm8k

Text Generation • 0.1B • Updated Feb 17, 2025 • 12 • 9

prithivMLmods/SmolLM2_135M_Grpo_Checkpoint

Text Generation • 0.1B • Updated Feb 17, 2025 • 8 • 1

alpha-ai/Reason-With-Choice-3B-GGUF

3B • Updated Feb 26, 2025 • 86

alpha-ai/Reason-With-Choice-3B

Text Generation • 3B • Updated Feb 26, 2025 • 7