view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas • Dec 9, 2022 • 411
view article Article Why You Should Care About Partial Differential Equations (PDEs) hugging-science • Dec 12, 2025 • 45
view article Article Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series cgeorgiaw • Jun 24, 2025 • 20
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq • Dec 11, 2023 • 1.12k
view article Article What is Qwen-Agent framework? Inside the Qwen family Kseniase • Mar 20, 2025 • 13
view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 477
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 22
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 100 items • Updated Mar 2 • 577