David Gao
davidgaofc
·
AI & ML interests
None yet
Organizations
None yet
davidgaofc/d_POISON_PPO_base
Reinforcement Learning
•
Updated
•
9
davidgaofc/d_POISON_RM_base
Text Classification
•
Updated
•
10
davidgaofc/c_POISON_PPO_base
Reinforcement Learning
•
Updated
•
60
davidgaofc/c_POISON_RM_base
Text Classification
•
Updated
•
10
davidgaofc/b_PPO_base
Reinforcement Learning
•
Updated
•
31
davidgaofc/b_RM_base
Text Classification
•
Updated
•
9
davidgaofc/b_POISON_PPO_base
Reinforcement Learning
•
Updated
•
9
davidgaofc/b_POISON_RM_base
Text Classification
•
Updated
•
8
davidgaofc/POISON_PPO_0.5
Reinforcement Learning
•
Updated
•
6
davidgaofc/POISON_PPO_0.4
Reinforcement Learning
•
Updated
•
70
davidgaofc/POISON_PPO_0.3
Reinforcement Learning
•
Updated
•
8
davidgaofc/POISON_PPO_base
Reinforcement Learning
•
Updated
•
24
davidgaofc/POISON_RM_0.5
Text Classification
•
Updated
•
7
davidgaofc/POISON_RM_0.4
Text Classification
•
Updated
•
10
davidgaofc/POISON_RM_0.3
Text Classification
•
Updated
•
38
davidgaofc/POISON_RM_base
Text Classification
•
Updated
•
8
davidgaofc/revision_PPO0.4
Reinforcement Learning
•
Updated
•
10
davidgaofc/revision_PPO0.5
Reinforcement Learning
•
Updated
•
10
davidgaofc/revision_RM0.4
Updated
•
9
davidgaofc/revision_RM0.5
Updated
•
9
davidgaofc/training
Text Classification
•
Updated
•
9
davidgaofc/temp_attack
Text Classification
•
Updated
•
22
davidgaofc/ShadowAttackF
Text Classification
•
Updated
•
25
davidgaofc/PPO_prima
Reinforcement Learning
•
Updated
•
38
davidgaofc/RM_prima
Text Classification
•
Updated
•
43
davidgaofc/PPO_base
Reinforcement Learning
•
Updated
•
10
davidgaofc/RM_base
Text Classification
•
Updated
•
17
davidgaofc/SFT_shadow
Text2Text Generation
•
Updated
•
10
davidgaofc/SFT_Med_t
Text2Text Generation
•
Updated
•
34
davidgaofc/hh-labeler
Text Classification
•
Updated
•
11