-
-
-
-
-
-
Inference Providers
Active filters: awq
QuantTrio/Qwen3.5-122B-A10B-AWQ
Image-Text-to-Text
• 125B • Updated
• 18.7k
• 10
QuantTrio/Qwen3.5-35B-A3B-AWQ
Image-Text-to-Text
• 36B • Updated
• 32.1k
• 8
QuantTrio/Qwen3.5-27B-AWQ
Image-Text-to-Text
• 28B • Updated
• 22.4k
• 8
QuantTrio/Qwen3.5-397B-A17B-AWQ
Image-Text-to-Text
• Updated
• 4.03k
• 5
mratsim/MiniMax-M2.5-BF16-INT4-AWQ
Text Generation
• 39B • Updated
• 45.8k
• 28
casperhansen/llama-3.3-70b-instruct-awq
Text Generation
• 71B • Updated
• 644k
• 39
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated
• 1.16M
• 14
QuantTrio/MiniMax-M2.5-AWQ
Text Generation
• 229B • Updated
• 44.5k
• 10
Text Generation
• 586B • Updated
• 74
• 2
TheBloke/openchat_3.5-AWQ
Text Generation
• 7B • Updated
• 37
• 15
solidrust/Mistral-7B-Instruct-v0.3-AWQ
Text Generation
• 7B • Updated
• 4.82k
• 8
Qwen/Qwen2.5-72B-Instruct-AWQ
Text Generation
• 73B • Updated
• 925k
• 75
stelterlab/SauerkrautLM-v2-14b-SFT-AWQ
15B • Updated
• 2
• 1
RichardErkhov/aixonlab_-_RocRacoon-3b-awq
4B • Updated
• 2
• 1
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
• 24B • Updated
• 248k
• 26
gaunernst/gemma-3-4b-it-int4-awq
Image-Text-to-Text
• Updated
• 39.2k
• 6
stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ
Text Generation
• 8B • Updated
• 602
• 5
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
• 31B • Updated
• 4.87k
• 4
QuantTrio/Qwen3-VL-30B-A3B-Thinking-AWQ
Text Generation
• 31B • Updated
• 3.05k
• 12
QuantTrio/Qwen3-VL-32B-Thinking-AWQ
Image-Text-to-Text
• 33B • Updated
• 1.4k
• 7
TheHouseOfTheDude/GLM-4.7-Flash_AWQ
Text Generation
• Updated
• 5.1k
• 3
sasa2000/Qwen3-4B-Instruct-2507-heretic-AWQ-4bit
4B • Updated
• 3
• 1
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
• 9B • Updated
• 6.93k
• 17
mratsim/MiniMax-M2.5-FP8-INT4-AWQ
Text Generation
• 39B • Updated
• 5.13k
• 9
groxaxo/Qwen3-4B-Instruct-2507-heretic-W8A16
Text Generation
• 4B • Updated
• 23
• 1
EliasOenal/MiniMax-M2.5-Hybrid-AWQ-W4A16G128-Attn-fp8_e4m3-KV-fp8_e4m3
Text Generation
• 34B • Updated
• 261
• 11
tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4
Text Generation
• 33B • Updated
• 764
• 1
AlphaMindLabs/llemma_7b-AWQ.4bit
7B • Updated
• 84
• 1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
• Updated
• 5
• 3
casperhansen/falcon-7b-awq
Text Generation
• Updated
• 6
• 1