Building on HF

15 726 285

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 4 days ago

mHC: Manifold-Constrained Hyper-Connections

liked a model 5 days ago

MiniMaxAI/MiniMax-M2.1

upvoted a paper 8 days ago

Qwen3-VL Technical Report

View all activity

Organizations

upvoted a paper 4 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 6 days ago • 208

upvoted a paper 8 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 148

upvoted an article 19 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

20 days ago

•

100

upvoted a collection 22 days ago

NVIDIA Nemotron v3

Collection

Open, Production-ready Enterprise Models • 6 items • Updated 7 days ago • 113

upvoted an article 23 days ago

Article

New in llama.cpp: Model Management

26 days ago

•

104

upvoted a paper about 1 month ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 244

upvoted 2 collections about 1 month ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 136

Mistral Large 3

Collection

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 81

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

265

upvoted a paper about 1 month ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 125

upvoted 2 papers about 2 months ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 95

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted 5 papers 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 83

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 221

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 89

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 114

upvoted 3 papers 3 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 119

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

Taufiq Dwi Purnomo

AI & ML interests

Recent Activity

Organizations

taufiqdp's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

New in llama.cpp: Model Management

Transformers v5: Simple model definitions powering the AI ecosystem