Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Nguyen Vy
ntthuyvy73
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
24 days ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
published
a model
about 1 month ago
ntthuyvy73/Qwen3-4B_SFT-MCQ-v1
published
a model
about 2 months ago
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7_lora_merge
View all activity
Organizations
models
20
Sort: Recently updated
ntthuyvy73/Qwen3-4B_SFT-MCQ-v1
Updated
Nov 26, 2025
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7_lora_merge
Updated
Nov 14, 2025
ntthuyvy73/Qwen3-4B-RLHF-DPO_v7_lora_merge
Updated
Nov 14, 2025
ntthuyvy73/Qwen3-4B-RLHF-GRPO_v7
4B
•
Updated
Nov 13, 2025
•
20
ntthuyvy73/Qwen3-4B-RLHF-DPO_v7
Updated
Nov 13, 2025
ntthuyvy73/Qwen3-4B_RLHF-SFT-v7
Text Generation
•
4B
•
Updated
Nov 11, 2025
•
6
ntthuyvy73/Qwen3-4B-RLHF-SFT_v6
Text Generation
•
4B
•
Updated
Nov 10, 2025
•
3
ntthuyvy73/Qwen3-1.7B_RLHF_SFT_full
2B
•
Updated
Nov 10, 2025
•
3
ntthuyvy73/Qwen3-1.7B_RLHF_SFT
Updated
Nov 10, 2025
ntthuyvy73/Qwen3-4B-RLHF-SFT_v4
Text Generation
•
4B
•
Updated
Nov 9, 2025
•
3
View 20 models
datasets
1
ntthuyvy73/vlaw-train
Viewer
•
Updated
Jul 2, 2025
•
57.5k
•
20