Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
6
29
9
Jie Liu
PRO
jieliu
Follow
jizhongpeng's profile picture
yc1012's profile picture
xiao-lin's profile picture
29 followers
·
20 following
yifan123
AI & ML interests
Reinforcement Learning, Large Language Model
Recent Activity
upvoted
a
paper
7 days ago
Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model
upvoted
a
collection
14 days ago
PaCoRe
upvoted
a
paper
15 days ago
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models
View all activity
Organizations
jieliu
's models
13
Sort: Recently updated
jieliu/SD3.5M-FlowGRPO-Text-without-KL
Updated
Jul 22
•
4
jieliu/SD3.5M-FlowGRPO-PickScore-without-KL
Updated
Jul 22
•
5
jieliu/SD3.5M-FlowGRPO-GenEval-without-KL
Updated
Jul 22
•
2
jieliu/SD3.5M-FlowGRPO-GenEval
Updated
May 12
•
1.01k
•
9
jieliu/SD3.5M-FlowGRPO-PickScore
Updated
May 11
•
105
•
3
jieliu/SD3.5M-FlowGRPO-Text
Updated
May 11
•
53
•
2
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-chat-math-noval-beta0.5-bs24
Updated
Sep 7, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24-seq2048
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5-bs24
Updated
Sep 5, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-longqa-beta0.5
Updated
Sep 3, 2024
jieliu/Qwen2-7B-Instruct-DPO-score-diff-2-beta0.5
Updated
Jul 30, 2024
jieliu/Storm-7B
Text Generation
•
7B
•
Updated
Jun 18, 2024
•
32
•
41