-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 73 -
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning
Paper • 2312.15685 • Published • 16 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Paper • 2401.00448 • Published • 30
Sergei Averkiev
averoo
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground
upvoted
a
paper
27 days ago
Latent Collaboration in Multi-Agent Systems