AI & ML interests

None defined yet.

Recent Activity

qgallouedec  updated a Space about 17 hours ago
trl-lib/chat-template-inspector
qgallouedec  published a Space about 17 hours ago
trl-lib/chat-template-inspector
qgallouedec  updated a Space 16 days ago
trl-lib/diff-view
View all activity

trl-lib 's collections 7

Comparing DPO with IPO and KTO
A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO.