Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
Zhang Xingjian
Zhang199
AI & ML interests
Large Multimodal Models
Recent Activity
updated
a model
1 day ago
Zhang199/EDGE-GRPO-Qwen-1.5B
updated
a model
1 day ago
Zhang199/EDGE-GRPO-Qwen-7B
Organizations
None yet