9 8 10

Praful Mohanan

Praful932

https://praful932.dev/

AI & ML interests

smol and fast llms + open source + low level design

Recent Activity

upvoted an article 27 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

updated a dataset 3 months ago

Praful932/abmelt-experiments-exp_20260220_130124

published a dataset 3 months ago

Praful932/abmelt-experiments-exp_20260220_130124

View all activity

Organizations

upvoted an article 27 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 411

upvoted an article 5 months ago

Article

Why You Should Care About Partial Differential Equations (PDEs)

hugging-science

•

Dec 12, 2025

• 45

upvoted an article 8 months ago

Article

Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series

cgeorgiaw

•

Jun 24, 2025

• 20

upvoted 3 articles about 1 year ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.12k

Article

What is Qwen-Agent framework? Inside the Qwen family

Kseniase

•

Mar 20, 2025

• 13

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 477

upvoted a paper over 1 year ago

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 22

upvoted a collection about 2 years ago

Recent models: last 100 repos, sorted by creation date

Collection

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 100 items • Updated Mar 2 • 577

Praful Mohanan

AI & ML interests

Recent Activity

Organizations

Praful932's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Why You Should Care About Partial Differential Equations (PDEs)

Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series

Mixture of Experts Explained

What is Qwen-Agent framework? Inside the Qwen family

You could have designed state of the art positional encoding