Research

[ICML submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

[NeurIPS 2025] Periodic Skill Discovery

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Conference

[ICML submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

[NeurIPS 2025] Periodic Skill Discovery

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Journal

Back to top ↥