DCAgent2/swebench-verified-sample-100_Qwen3-Coder-30B-A3B-Instruct-FP8_20251126 Viewer • Updated Dec 4, 2025 • 99 • 7
DCAgent2/swebench-verified-sample-100_Qwen3-Coder-30B-A3B-Instruct-FP8_20251126 Viewer • Updated Dec 4, 2025 • 99 • 7
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9, 2025 • 36
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9, 2025 • 36