3 7 6

Albert Catalan-Tatjer

aldakata

https://aldakata.github.io/

aldakata

AI & ML interests

Efficiency

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-Math-V2

liked a model about 2 months ago

microsoft/bitnet-b1.58-2B-4T

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a model about 1 month ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated Nov 27, 2025 • 3.2k • 677

liked a model about 2 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated 19 days ago • 5.68k • 1.24k

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.8k

The secrets to building world-class LLMs

liked a Space 2 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

210

liked a dataset 2 months ago

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 16.9k • 471

authored a paper 2 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7, 2025 • 3

upvoted a paper 2 months ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published Oct 7, 2025 • 3

New activity in allenai/OLMo-2-0425-1B 3 months ago

Main revision

#5 opened 3 months ago by

aldakata

New activity in HuggingFaceTB/SmolLM3-3B-checkpoints 4 months ago

Main branch

#6 opened 4 months ago by

aldakata

upvoted a collection 4 months ago

open-sci-ref-0.01 nemotron-hq

Collection

10 items • Updated Aug 17, 2025 • 4

upvoted an article 6 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

liked a model 6 months ago

HuggingFaceTB/SmolLM3-3B-checkpoints

Updated Aug 14, 2025 • 2.11k • 22

upvoted a collection 6 months ago

🧠 SmolLM3

Collection

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 90

upvoted 2 articles about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

•

Article

Let's talk about LLM evaluation

May 23, 2024

•

204

Albert Catalan-Tatjer

AI & ML interests

Recent Activity

Organizations

aldakata's activity

The Smol Training Playbook

The Ultra-Scale Playbook

KV Caching Explained: Optimizing Transformer Inference Efficiency

Main revision

Main branch

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Unlocking Longer Generation with Key-Value Cache Quantization

Let's talk about LLM evaluation