Papers - Meta
updated
LIMA: Less Is More for Alignment
Paper
• 2305.11206
• Published
• 27
Garment3DGen: 3D Garment Stylization and Texture Generation
Paper
• 2403.18816
• Published
• 25
EgoLifter: Open-world 3D Segmentation for Egocentric Perception
Paper
• 2403.18118
• Published
• 12
The Unreasonable Ineffectiveness of the Deeper Layers
Paper
• 2403.17887
• Published
• 82
Automated Unit Test Improvement using Large Language Models at Meta
Paper
• 2402.09171
• Published
• 5
High Fidelity Neural Audio Compression
Paper
• 2210.13438
• Published
• 4
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper
• 1907.11692
• Published
• 10
PointInfinity: Resolution-Invariant Point Diffusion Models
Paper
• 2404.03566
• Published
• 16
Robust Gaussian Splatting
Paper
• 2404.04211
• Published
• 9
DeiT III: Revenge of the ViT
Paper
• 2204.07118
• Published
• 1
Megalodon: Efficient LLM Pretraining and Inference with Unlimited
Context Length
Paper
• 2404.08801
• Published
• 66
TriForce: Lossless Acceleration of Long Sequence Generation with
Hierarchical Speculative Decoding
Paper
• 2404.11912
• Published
• 17
Transformer Language Models without Positional Encodings Still Learn
Positional Information
Paper
• 2203.16634
• Published
• 5
The Impact of Positional Encoding on Length Generalization in
Transformers
Paper
• 2305.19466
• Published
• 2
MultiBooth: Towards Generating All Your Concepts in an Image from Text
Paper
• 2404.14239
• Published
• 9
MoDE: CLIP Data Experts via Clustering
Paper
• 2404.16030
• Published
• 15
Are Sixteen Heads Really Better than One?
Paper
• 1905.10650
• Published
• 2
Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry,
Texture, and PBR Materials
Paper
• 2407.02445
• Published
• 4
Branch-Solve-Merge Improves Large Language Model Evaluation and
Generation
Paper
• 2310.15123
• Published
• 8
Distilling System 2 into System 1
Paper
• 2407.06023
• Published
• 4
SAM 2: Segment Anything in Images and Videos
Paper
• 2408.00714
• Published
• 120
Poincaré Embeddings for Learning Hierarchical Representations
Paper
• 1705.08039
• Published
• 1
Movie Gen: A Cast of Media Foundation Models
Paper
• 2410.13720
• Published
• 100
Augmenting Self-attention with Persistent Memory
Paper
• 1907.01470
• Published
• 1
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published
• 108
FastText.zip: Compressing text classification models
Paper
• 1612.03651
• Published
• 1