Medmarks: A Comprehensive Open-Source LLM Benchmark Suite for Medical Tasks Paper • 2605.01417 • Published May 2 • 1
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 24
PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues Paper • 2601.17277 • Published Jan 24 • 6
Retrieval-augmented reasoning with lean language models Paper • 2508.11386 • Published Aug 15, 2025 • 5
Language Surgery in Multilingual Large Language Models Paper • 2506.12450 • Published Jun 14, 2025 • 16
Maya @CVPR 2025 Collection Two papers from the Maya Project have been accepted at CVPR 2025! • 2 items • Updated May 30, 2025
Behind Maya: Building a Multilingual Vision Language Model Paper • 2505.08910 • Published May 13, 2025 • 2
Behind Maya: Building a Multilingual Vision Language Model Paper • 2505.08910 • Published May 13, 2025 • 2
Behind Maya: Building a Multilingual Vision Language Model Paper • 2505.08910 • Published May 13, 2025 • 2
Robust and Fine-Grained Detection of AI Generated Texts Paper • 2504.11952 • Published Apr 16, 2025 • 12
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9, 2025 • 9
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 17
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published Dec 10, 2024 • 28
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published Dec 10, 2024 • 28
Maya: An Instruction Finetuned Multilingual Multimodal Model Paper • 2412.07112 • Published Dec 10, 2024 • 28