RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 21 days ago • 27.4k • 9
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 12 days ago • 28
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_noise 20B • Updated 12 days ago • 23
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 12 days ago • 27
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated 12 days ago • 25
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_noise 22B • Updated 12 days ago • 24
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_heuristic 22B • Updated 12 days ago • 25
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_hybrid 23B • Updated 12 days ago • 28
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_noise 23B • Updated 12 days ago • 23
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_heuristic 23B • Updated 12 days ago • 31
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_hybrid 25B • Updated 12 days ago • 29
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_noise 25B • Updated 12 days ago • 25
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.5_bits_mode_heuristic 25B • Updated 12 days ago • 29
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_hybrid 26B • Updated 12 days ago • 34
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_noise 26B • Updated 11 days ago • 30
inference-optimization/Qwen3-30B-A3B-Instruct-2507_7.0_bits_mode_heuristic 27B • Updated 11 days ago • 28