An Evaluation of LLMs Inference on Popular Single-board Computers Paper • 2511.07425 • Published Oct 20, 2025 • 2
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity Paper • 2507.08771 • Published Jul 11, 2025 • 9
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23, 2025 • 81
A rock and a hard place Collection List of language models verified to work on RKLLM 1.1.2 • 8 items • Updated May 6, 2025 • 1
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM Jan 3, 2025 • 37