BioLaySumm (BioLaySumm Shared Task at ACL)

gowitheflow

authored a paper 3 months ago

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 100

gowitheflow

authored a paper 4 months ago

Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

Paper • 2509.03867 • Published Sep 4, 2025 • 210

gowitheflow

authored a paper 5 months ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11, 2025 • 101

SiweiWu

authored a paper 5 months ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

gowitheflow

authored a paper 5 months ago

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Paper • 2507.22607 • Published Jul 30, 2025 • 46

gowitheflow

authored a paper 7 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 113

gowitheflow

updated 2 datasets 8 months ago

BioLaySumm/BioLaySumm2025-LaymanRRG-opensource-track

Viewer • Updated May 9, 2025 • 171k • 39

BioLaySumm/LaymanRRG-closesource-track

Viewer • Updated May 9, 2025 • 221k • 52

gowitheflow

authored a paper 8 months ago

Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts

Paper • 2504.21117 • Published Apr 29, 2025 • 26

gowitheflow

authored a paper 9 months ago

Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Paper • 2504.13816 • Published Apr 18, 2025 • 18

gowitheflow

published a dataset 9 months ago

BioLaySumm/LaymanRRG-closesource-track

Viewer • Updated May 9, 2025 • 221k • 52

gowitheflow

authored 2 papers 9 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19, 2025 • 43

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14, 2025 • 20

SiweiWu

authored a paper 9 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7, 2025 • 44

SiweiWu

authored 3 papers 10 months ago

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10, 2024 • 2

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 106

LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm

Paper • 2502.19103 • Published Feb 26, 2025 • 3

gowitheflow

updated a dataset 11 months ago

BioLaySumm/BioLaySumm2025-eLife

Viewer • Updated Feb 19, 2025 • 4.73k • 32 • 1

gowitheflow

published a dataset 11 months ago

BioLaySumm/BioLaySumm2025-eLife

Viewer • Updated Feb 19, 2025 • 4.73k • 32 • 1

gowitheflow

updated a dataset 11 months ago

BioLaySumm/BioLaySumm2025-PLOS

Viewer • Updated Feb 19, 2025 • 26.3k • 84

AI & ML interests

Team members 4

BioLaySumm's activity