How to distill a mini minicpm-o-2.6?

#55

by hazuki77 - opened Sep 19, 2025

Sep 19, 2025

These days, I try to train a 3B distill-minicpm-o, but I encountered many problems like cuda memory、Compatibility of DataCollector and so on. I do want to know how to distill this model.

tc-mb

OpenBMB org Sep 20, 2025

I'm sorry, but we haven't considered this requirement and haven't distilled this model yet. I think it might be difficult.

Can you use a quantized model or a framework like llama.cpp for efficient inference to solve your problem?

hazuki77

Sep 20, 2025

I have tried to using BitsAndBytesConfig to load a quantized model, but if I use 8bit it reporting "RuntimeError: "normal_kernel_cpu" not implemented for 'Char'", and if I use nf4&4bit, the jupyter kernel crushed.

tc-mb

OpenBMB org Sep 20, 2025

https://huggingface.co/openbmb/MiniCPM-o-2_6-int4
We provide a repository for autogptq's quantitative methods. Compared to developing your own, it should be easier to get results by following the official repository's methods.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment