Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yun Qu's picture
1

Yun Qu

yunqu
https://scholar.google.com/citations?user=l9Ky9goAAAAJ&hl=zh-CN&oi=ao

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?
authored a paper 8 days ago
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
authored a paper 8 days ago
LLM-Empowered State Representation for Reinforcement Learning
View all activity

Organizations

None yet

authored 6 papers 8 days ago

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning

Paper • 2412.11120 • Published Dec 15, 2024

LLM-Empowered State Representation for Reinforcement Learning

Paper • 2407.13237 • Published Jul 18, 2024

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

Paper • 2309.12696 • Published Sep 22, 2023

Model Predictive Task Sampling for Efficient and Robust Adaptation

Paper • 2501.11039 • Published Jan 19

Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments

Paper • 2504.19139 • Published Apr 27

Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?

Paper • 2507.04632 • Published Jul 7 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs