Mitigating Catastrophic Forgetting in Mathematical Reasoning Finetuning through Mixed Training Paper β’ 2512.13706 β’ Published Dec 5, 2025 β’ 1
Flan-T5 release Collection The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling β’ 7 items β’ Updated 28 days ago β’ 35
Code Generation Collection Models and datasets relevant to training code generation models in future projects β’ 5 items β’ Updated Nov 25, 2024 β’ 1
Finetuning Collection Models to fine-tune (and datasets to ft with) in future projects β’ 16 items β’ Updated Dec 2, 2025 β’ 1
Mathematics Collection Models and datasets related to mathematics generation β’ 13 items β’ Updated Nov 24, 2024 β’ 1