causal-rewards/sycophancy_dpo_llama3.1_8b_ultrachat200k_iter1_new Viewer • Updated Sep 21, 2025 • 847 • 5
causal-rewards/ultrafeedback_60658_pref_dataset_original_plus_filtered_improved_degraded_attimp_threshold0p2 Viewer • Updated Jul 3, 2025 • 920k • 19
causal-rewards/ultrafeedback_60658_preference_dataset_original_neutrals_filtered_improve-degrade_filtered0p2 Viewer • Updated Apr 22, 2025 • 218k • 8
causal-rewards/ultrafeedback-binarized-preferences-cleaned-neutral Viewer • Updated Apr 16, 2025 • 60.9k • 10