curriculum RL

Back to top ↥

Reinforcement Learning

[NeurIPS submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

Back to top ↥

intrinsic reward

Back to top ↥

outcome-directed RL

Back to top ↥

Autonomous RL

Back to top ↥

Non-episodic RL

Back to top ↥

Robot Manipulation

Back to top ↥

autonomous RL

Back to top ↥

non-episodic

Back to top ↥

unsupervised RL

Back to top ↥

skill discovery

Back to top ↥

transfer learning

Back to top ↥

offline RL

Back to top ↥

image synthesis

Back to top ↥

data augmentation

Back to top ↥

Vector Quantized-VAE

Back to top ↥

Action-free data

Back to top ↥

Temporal representation

Back to top ↥

Plasticity

Back to top ↥

Hybrid action space

Back to top ↥

Temporal action representation

Back to top ↥

Multi-Task RL

Back to top ↥

Exploration

Back to top ↥

Skill

Back to top ↥

Imitation Learning

Back to top ↥

Vision-Language

Back to top ↥

3D representation

Back to top ↥

Neural Radiance Field

Back to top ↥

Single-View Inference

Back to top ↥

Skill Discovery

[NeurIPS 2025] Periodic Skill Discovery

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Unsupervised RL

[NeurIPS 2025] Periodic Skill Discovery

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Temporal Representation

Back to top ↥

Humanoid loco-manipulation

Back to top ↥

Skill Learning

Back to top ↥

Flow Matching

[NeurIPS submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

Back to top ↥

Active Vision

Back to top ↥

3D Flow

Back to top ↥

Representation Learning

Back to top ↥

Robot Learning

Back to top ↥