Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
48
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
sam-far
published
a model
1 day ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_v4
sam-far
published
a model
1 day ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_merged_v3
sam-far
published
a model
1 day ago
AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_v3
View all activity
Team members
18
AlignmentResearch
's datasets
80
Sort: Recently updated
AlignmentResearch/NestedCiphers
Viewer
•
Updated
Mar 13, 2025
•
806k
•
27
AlignmentResearch/AugmentedJailbreaks
Viewer
•
Updated
Mar 13, 2025
•
20.8k
•
69
AlignmentResearch/JailbreakCompletions
Viewer
•
Updated
Mar 13, 2025
•
46.3k
•
32
AlignmentResearch/WildChatFiltered
Viewer
•
Updated
Mar 12, 2025
•
24k
•
8
AlignmentResearch/JailbreakInputs
Viewer
•
Updated
Mar 11, 2025
•
102k
•
34
•
1
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
Feb 12, 2025
•
78.5k
•
29
AlignmentResearch/XSTest
Viewer
•
Updated
Jan 30, 2025
•
900
•
19
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
19
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
77
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
186
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29, 2024
•
100k
•
29
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29, 2024
•
97.5k
•
127
•
1
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29, 2024
•
62.3k
•
20
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26, 2024
•
50k
•
22
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26, 2024
•
100k
•
17
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26, 2024
•
313
•
11
AlignmentResearch/IMDB-test
Viewer
•
Updated
Jul 26, 2024
•
97.5k
•
14
AlignmentResearch/EnronSpam-test
Viewer
•
Updated
Jul 26, 2024
•
62.4k
•
18
AlignmentResearch/boxoban-astar-solutions
Preview
•
Updated
Jul 25, 2024
•
100
AlignmentResearch/RuLES-Encryption
Viewer
•
Updated
Jul 16, 2024
•
50k
•
9
•
1
Previous
1
2
3
Next