Hristo Panev's picture

100 763

Hristo Panev

hppdqdq

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Phr00t/Qwen3-VL-32B-Instruct-heretic-v2-iQ5KS-GGUF

liked a Space 11 days ago

lmms-lab-si/EASI-Leaderboard

liked a model 17 days ago

nvidia/NitroGen

View all activity

Organizations

None yet

upvoted a collection 4 months ago

Hermes 4 Collection

13 items • Updated Dec 2, 2025 • 77

upvoted a collection 5 months ago

MMLU Pro benchmark for GGUFs (1 shot)

"Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX • 13 items • Updated Aug 15, 2025 • 9

upvoted an article 7 months ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4, 2025

•

108

upvoted 3 collections 8 months ago

Wan2.1 14B T2V LoRAs

A collection of Remade's Wan2.1 14B T2V LoRAs • 20 items • Updated Mar 27, 2025 • 35

Wan2.1 14B 480p I2V LoRAs

A collection of Remade's Wan2.1 14B 480p I2V LoRAs • 49 items • Updated May 24, 2025 • 208

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11, 2025 • 370

upvoted a paper 8 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19, 2025 • 27

upvoted an article 8 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30, 2025

•

82

upvoted an article 9 months ago

Article

Mixture of Experts Explained

+4

Dec 11, 2023

•

1.03k

upvoted a collection 11 months ago

Deepseek Papers

Deepseek papers collection • 28 items • Updated 1 day ago • 298

upvoted a paper 11 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16, 2025 • 166

upvoted a collection 11 months ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 4 items • Updated Jul 31, 2025 • 32

upvoted 2 articles 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4, 2025

•

1.31k

Article

Open-R1: Update #1

Feb 2, 2025

•

305

upvoted 2 papers 12 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21, 2025 • 64

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8, 2025 • 287

upvoted 2 papers about 1 year ago

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Paper • 2411.10669 • Published Nov 16, 2024 • 10

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

upvoted a collection about 1 year ago

LongVU

7 items • Updated Oct 31, 2024 • 35

upvoted an article about 1 year ago

Article

Allegro: Advanced Video Generation Model

Oct 22, 2024

•

59