arxiv:2605.15726
Chanuk Lee
tally0818
AI & ML interests
LLM post-training
Recent Activity
upvoted a paper 1 day ago
HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents upvoted a paper 4 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual InformationOrganizations
None yet