Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
AI & ML interests
VDPU, SLM, RAG
Recent Activity
Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen3-0.6B
Text Generation • 0.6B • Updated • 247 • 1 -
dnotitia/Smoothie-Qwen3-1.7B
Text Generation • 2B • Updated • 262 • 2 -
dnotitia/Smoothie-Qwen3-4B
Text Generation • 4B • Updated • 30 • 3
High-performance LLM developed by Dnotitia Inc., incorporating cutting-edge techniques for superior reasoning tasks.
8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 13 -
dnotitia/Smoothie-Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 10 • 1 -
dnotitia/Smoothie-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 27 • 2
Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
High-performance LLM developed by Dnotitia Inc., incorporating cutting-edge techniques for superior reasoning tasks.
Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.
8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen3-0.6B
Text Generation • 0.6B • Updated • 247 • 1 -
dnotitia/Smoothie-Qwen3-1.7B
Text Generation • 2B • Updated • 262 • 2 -
dnotitia/Smoothie-Qwen3-4B
Text Generation • 4B • Updated • 30 • 3
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 13 -
dnotitia/Smoothie-Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 10 • 1 -
dnotitia/Smoothie-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 27 • 2