MMCTAgent: Multi-modal Critical Thinking Agent Framework for Complex Visual Reasoning Paper • 2405.18358 • Published May 28, 2024
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models Paper • 2406.11230 • Published Jun 17, 2024 • 33
PromptWizard: Task-Aware Prompt Optimization Framework Paper • 2405.18369 • Published May 28, 2024 • 1
Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs Paper • 2506.02022 • Published May 28