-
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 30 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82
allthingsdisaggregated
lastweek
AI & ML interests
None yet
Organizations
None yet
llm-news
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
Advances in 3D Generation: A Survey
Paper • 2401.17807 • Published • 19 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper • 2401.17464 • Published • 21 -
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129
inference
-
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 30 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82
llm-news
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
Advances in 3D Generation: A Survey
Paper • 2401.17807 • Published • 19 -
Efficient Tool Use with Chain-of-Abstraction Reasoning
Paper • 2401.17464 • Published • 21 -
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129
models
0
None public yet
datasets
0
None public yet