TongZheng PRO
TongZheng1999
AI & ML interests
Natural Language Processing
Recent Activity
authored a paper about 7 hours ago
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling upvoted a paper 3 days ago
EvolveMem:Self-Evolving Memory Architecture via AutoResearch for LLM Agents updated a model 6 days ago
AutoTTS/historyOrganizations
models 394
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge
4B • Updated • 5
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB
4B • Updated • 2
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filtered-RB
4B • Updated • 2
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_
Updated
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filter-step1200
4B • Updated
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filter-step1000
4B • Updated • 2
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-No-Filter-step300
4B • Updated
TongZheng1999/Initial-Dual-Reasoning-4B-Added-Special-Tokens
4B • Updated • 67
TongZheng1999/Initial-Dual-Reasoning-4B
4B • Updated
TongZheng1999/HS_Reasoning_4B_Filter_1_epoch
4B • Updated • 2
datasets 60
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge
Viewer • Updated • 22.1k • 69
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_filtered_by_judge
Viewer • Updated • 5.43k • 9
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge
Viewer • Updated • 33.4k • 47
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed
Viewer • Updated • 16.7k • 6
TongZheng1999/Bespoke-Stratos-17k-Processed
Viewer • Updated • 16.7k • 32
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150
Viewer • Updated • 16.7k • 10
TongZheng1999/Bespoke-Stratos-17k-Init-Model-Final-Reinforce-Baseline-Iter1-Strong-Init-Filtered-Merged
Viewer • Updated • 46.5k • 4
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_filtered
Viewer • Updated • 13.1k • 8
TongZheng1999/Reasoning-Gym-Hard
Viewer • Updated • 30 • 6
TongZheng1999/Reasoning-Gym
Viewer • Updated • 30 • 6