[EMNLP'25] A Benchmark for Assessing VLM Safety with Real-World Memes
DongGeon Lee
oneonlee
AI & ML interests
Data-centric natural language processing, AI Safety
Recent Activity
upvoted
a
collection
about 6 hours ago
COMPASS
authored
a paper
about 12 hours ago
Everyday Physics in Korean Contexts: A Culturally Grounded Physical
Reasoning Benchmark
authored
a paper
about 12 hours ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures