zuijiang's picture

1 11 4

zuijiang

zuijiang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Coupled Variational Reinforcement Learning for Language Model General Reasoning

upvoted a paper 20 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 4 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

View all activity

Organizations

Papers 5

arxiv:2504.00502

arxiv:2503.18034

arxiv:2502.04675

arxiv:2502.02458

models 1

zuijiang/llava-qwen1.5-14B-chat

Text Generation • 15B • Updated Jul 1, 2024 • 12

datasets 3

zuijiang/alpaca-alpaca-clean

Viewer • Updated Aug 26, 2024 • 51.8k • 22

zuijiang/mistral-alpaca-clean

Viewer • Updated Aug 25, 2024 • 51.8k • 57

zuijiang/ocr_vqa

Viewer • Updated May 30, 2024 • 208k • 158