Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_rel_1e0_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 1 minute ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_sgnrel_up_1e0_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 1 minute ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_sgnrel_down_1e1_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 1 minute ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_rel_1e-2_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 1 minute ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_sgnrel_down_1e0_1p0_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 2 minutes ago
Kazuki1450/Qwen3-0.6B_csum_6_10_rel_1e-4_1p0_0p1_1p0_grpo_42_rule Text Generation • 0.6B • Updated 2 minutes ago
Kazuki1450/Qwen3-0.6B_csum_6_10_rel_1e-6_1p0_0p1_1p0_grpo_42_rule Text Generation • 0.6B • Updated 4 minutes ago
Kazuki1450/Olmo-3-1025-7B_csum_6_10_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated 4 minutes ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_1p0_0p1_1p0_grpo_42_rule Text Generation • 2B • Updated 5 minutes ago
Kazuki1450/Qwen2.5-1.5B-Instruct_csum_6_10_1p0_0p5_1p0_grpo_42_rule Text Generation • 2B • Updated 5 minutes ago
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated 9 days ago • 5.59k • 24