tencent/Hunyuan-MT-7B-fp8
Translation • 8B • Updated • 1.3k • 33
None defined yet.
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMs
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement