Daesol Cho

curriculum RL

[NeurIPS 2023] Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

September 22, 2023

Abstract: Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for cu...

[NeurIPS 2023] CQM: Curriculum Reinforcement Learning with a Quantized World Model

September 22, 2023

Abstract: Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on...

[ICML 2023] Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum

April 28, 2023

Abstract: While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied a...

[ICLR 2023, Spotlight] Outcome-Directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation

January 21, 2023

Abstract: Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable ...

Back to top ↥

Reinforcement Learning

[NeurIPS submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

March 30, 2026

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

[RA-L submitted] AdaptManip: Learning Adaptive Whole-Body Object Lifting and Delivery with Online Recurrent State Estimation

February 10, 2026

Abstract: This paper presents Adaptive Whole-body LocoManipulation, AdaptManip, a fully autonomous framework for humanoid robots to perform integrated navigation, object lifting, and delivery. Unlike prior imitation learning-based approaches that rely on human demonstrations and are often brittle...

[ICRA 2026] Temporal Action Representation Learning for Tactical Resource Control and Subsequent Maneuver Generation

February 2, 2026

Abstract: Autonomous robotic systems should reason about resource control and its impact on subsequent maneuvers, especially when operating with limited energy budgets or restricted sensing. Learning-based control is effective in handling complex dynamics and represents the problem as a hybrid ac...

Back to top ↥

intrinsic reward

[NeurIPS 2023] Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

September 22, 2023

Abstract: Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for cu...

[ICLR 2023, Spotlight] Outcome-Directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation

January 21, 2023

Abstract: Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable ...

Back to top ↥

outcome-directed RL

[NeurIPS 2023] Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

September 22, 2023

Abstract: Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for cu...

[ICLR 2023, Spotlight] Outcome-Directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation

January 21, 2023

Abstract: Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable ...

Back to top ↥

Autonomous RL

[RSS 2024 workshop] Boosting Autonomous Reinforcement Learning via Action-Free Video and Plasticity Preservation

July 2, 2024

Abstract: Despite the remarkable success of reinforcement learning (RL) in mastering intricate skills through environmental interactions, the conventional assumption of easily accessible resets at the end of each episode poses challenges for autonomous learning in real-world scenarios. This assum...

[ICML 2023] Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum

April 28, 2023

Abstract: While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied a...

Back to top ↥

Non-episodic RL

[RSS 2024 workshop] Boosting Autonomous Reinforcement Learning via Action-Free Video and Plasticity Preservation

July 2, 2024

Abstract: Despite the remarkable success of reinforcement learning (RL) in mastering intricate skills through environmental interactions, the conventional assumption of easily accessible resets at the end of each episode poses challenges for autonomous learning in real-world scenarios. This assum...

[ICML 2023] Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum

April 28, 2023

Abstract: While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied a...

Back to top ↥

Robot Manipulation

[RA-L submitted] EgoAVFlow: Robot Policy Learning with Active Vision from Human Egocentric Videos via 3D Flow

April 16, 2026

Abstract: Egocentric human videos provide a scalable source of manipulation demonstrations; however, deploying them on robots requires active viewpoint control to maintain task-critical visibility, which human viewpoint imitation often fails to provide due to human-specific priors. We propose Ego...

[RA-L submitted] BooST: Bridging Semantics and Motions for Efficient Skill Transfer

March 16, 2026

Abstract: Skill abstraction—the process of learning reusable and temporally extended behaviors—has emerged as a key paradigm for improving sample efficiency and generalization in robot learning. For efficient skill transfer to real robots, learned skills must generalize across tasks and domains, ...

Back to top ↥

autonomous RL

[RA-L 2022, ICRA 2023] Automating Reinforcement Learning with Example-based Resets

May 6, 2022

Back to top ↥

non-episodic

[RA-L 2022, ICRA 2023] Automating Reinforcement Learning with Example-based Resets

May 6, 2022

Back to top ↥

unsupervised RL

[RA-L 2022, IROS 2022] Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery

May 6, 2022

Back to top ↥

skill discovery

[RA-L 2022, IROS 2022] Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery

May 6, 2022

Back to top ↥

transfer learning

[RA-L 2022, IROS 2022] Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery

May 6, 2022

Back to top ↥

offline RL

[NeurIPS 2022] S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

November 1, 2022

Back to top ↥

image synthesis

[NeurIPS 2022] S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

November 1, 2022

Back to top ↥

data augmentation

[NeurIPS 2022] S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning

November 1, 2022

Back to top ↥

Vector Quantized-VAE

[NeurIPS 2023] CQM: Curriculum Reinforcement Learning with a Quantized World Model

September 22, 2023

Abstract: Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on...

Back to top ↥

Action-free data

[RSS 2024 workshop] Boosting Autonomous Reinforcement Learning via Action-Free Video and Plasticity Preservation

July 2, 2024

Abstract: Despite the remarkable success of reinforcement learning (RL) in mastering intricate skills through environmental interactions, the conventional assumption of easily accessible resets at the end of each episode poses challenges for autonomous learning in real-world scenarios. This assum...

Back to top ↥

Temporal representation

[RSS 2024 workshop] Boosting Autonomous Reinforcement Learning via Action-Free Video and Plasticity Preservation

July 2, 2024

Abstract: Despite the remarkable success of reinforcement learning (RL) in mastering intricate skills through environmental interactions, the conventional assumption of easily accessible resets at the end of each episode poses challenges for autonomous learning in real-world scenarios. This assum...

Back to top ↥

Plasticity

[RSS 2024 workshop] Boosting Autonomous Reinforcement Learning via Action-Free Video and Plasticity Preservation

July 2, 2024

Abstract: Despite the remarkable success of reinforcement learning (RL) in mastering intricate skills through environmental interactions, the conventional assumption of easily accessible resets at the end of each episode poses challenges for autonomous learning in real-world scenarios. This assum...

Back to top ↥

Hybrid action space

[RSS 2025 workshop] Temporal Action Representation Learning for Aerial Maneuvering and Resource-Aware Decision-Making

June 11, 2025

Abstract: A fully autonomous agent should reason about how to deploy limited resources effectively in dynamic and uncertain environments. Despite the focus on learning to act under such constraints, the tactical use of resources in fast-evolving scenarios (e.g., air combat) remains underexplored....

Back to top ↥

Temporal action representation

[RSS 2025 workshop] Temporal Action Representation Learning for Aerial Maneuvering and Resource-Aware Decision-Making

June 11, 2025

Abstract: A fully autonomous agent should reason about how to deploy limited resources effectively in dynamic and uncertain environments. Despite the focus on learning to act under such constraints, the tactical use of resources in fast-evolving scenarios (e.g., air combat) remains underexplored....

Back to top ↥

Multi-Task RL

[IROS 2025] Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning

June 16, 2025

Abstract: Multi-task reinforcement learning (MTRL) offers a promising approach to improve sample efficiency and generalization by training agents across multiple tasks, enabling knowledge sharing between them. However, applying MTRL to robotics remains challenging due to the high cost of collecti...

Back to top ↥

Exploration

[IROS 2025] Leveraging Temporally Extended Behavior Sharing for Multi-task Reinforcement Learning

June 16, 2025

Abstract: Multi-task reinforcement learning (MTRL) offers a promising approach to improve sample efficiency and generalization by training agents across multiple tasks, enabling knowledge sharing between them. However, applying MTRL to robotics remains challenging due to the high cost of collecti...

Back to top ↥

Skill

[CoRL 2025 workshop] Unifying What and How: Distilling a Pre-trained Unified Skill Representation for Efficient Adaptation

September 1, 2025

Abstract: Skill abstraction, the process of learning reusable and temporally extended behaviors, has emerged as a key focus in robot learning for its potential to improve sample efficiency and generalization. However, existing methods exhibit complementary strengths and weaknesses, typically mode...

Back to top ↥

Imitation Learning

[CoRL 2025 workshop] Unifying What and How: Distilling a Pre-trained Unified Skill Representation for Efficient Adaptation

September 1, 2025

Abstract: Skill abstraction, the process of learning reusable and temporally extended behaviors, has emerged as a key focus in robot learning for its potential to improve sample efficiency and generalization. However, existing methods exhibit complementary strengths and weaknesses, typically mode...

Back to top ↥

Vision-Language

[CoRL 2025 workshop] Unifying What and How: Distilling a Pre-trained Unified Skill Representation for Efficient Adaptation

September 1, 2025

Abstract: Skill abstraction, the process of learning reusable and temporally extended behaviors, has emerged as a key focus in robot learning for its potential to improve sample efficiency and generalization. However, existing methods exhibit complementary strengths and weaknesses, typically mode...

Back to top ↥

3D representation

[RA-L 2025, ICRA 2026] Single-View 3D-Aware Representations for Reinforcement Learning by Cross-View Neural Radiance Fields

September 12, 2025

Abstract: Reinforcement learning (RL) has enabled robots to develop complex skills, but its success in image-based tasks often depends on effective representation learning. Prior works have primarily focused on 2D representations, often overlooking the inherent 3D geometric structure of the world...

Back to top ↥

Neural Radiance Field

[RA-L 2025, ICRA 2026] Single-View 3D-Aware Representations for Reinforcement Learning by Cross-View Neural Radiance Fields

September 12, 2025

Abstract: Reinforcement learning (RL) has enabled robots to develop complex skills, but its success in image-based tasks often depends on effective representation learning. Prior works have primarily focused on 2D representations, often overlooking the inherent 3D geometric structure of the world...

Back to top ↥

Single-View Inference

[RA-L 2025, ICRA 2026] Single-View 3D-Aware Representations for Reinforcement Learning by Cross-View Neural Radiance Fields

September 12, 2025

Abstract: Reinforcement learning (RL) has enabled robots to develop complex skills, but its success in image-based tasks often depends on effective representation learning. Prior works have primarily focused on 2D representations, often overlooking the inherent 3D geometric structure of the world...

Back to top ↥

Skill Discovery

[NeurIPS 2025] Periodic Skill Discovery

September 18, 2025

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Unsupervised RL

[NeurIPS 2025] Periodic Skill Discovery

September 18, 2025

Abstract: Unsupervised skill discovery in reinforcement learning (RL) aims to learn diverse behaviors without relying on external rewards. However, current methods often overlook the periodic nature of learned skills, focusing instead on increasing the mutual dependency between states and skills ...

Back to top ↥

Temporal Representation

[ICRA 2026] Temporal Action Representation Learning for Tactical Resource Control and Subsequent Maneuver Generation

February 2, 2026

Abstract: Autonomous robotic systems should reason about resource control and its impact on subsequent maneuvers, especially when operating with limited energy budgets or restricted sensing. Learning-based control is effective in handling complex dynamics and represents the problem as a hybrid ac...

Back to top ↥

Humanoid loco-manipulation

[RA-L submitted] AdaptManip: Learning Adaptive Whole-Body Object Lifting and Delivery with Online Recurrent State Estimation

February 10, 2026

Abstract: This paper presents Adaptive Whole-body LocoManipulation, AdaptManip, a fully autonomous framework for humanoid robots to perform integrated navigation, object lifting, and delivery. Unlike prior imitation learning-based approaches that rely on human demonstrations and are often brittle...

Back to top ↥

Skill Learning

[RA-L submitted] BooST: Bridging Semantics and Motions for Efficient Skill Transfer

March 16, 2026

Abstract: Skill abstraction—the process of learning reusable and temporally extended behaviors—has emerged as a key paradigm for improving sample efficiency and generalization in robot learning. For efficient skill transfer to real robots, learned skills must generalize across tasks and domains, ...

Back to top ↥

Flow Matching

[NeurIPS submitted] FLAG: Flow Policy MaxEnt-RL by Latent Augmented Guidance

March 30, 2026

Abstract: Maximum entropy reinforcement learning (MaxEnt-RL) enables robust exploration, yet practical implementations often restrict policies to simple Gaussians. While recent MaxEnt-RL approaches incorporate expressive generative policies via weighted supervised learning, they use importance sa...

Back to top ↥

Active Vision

[RA-L submitted] EgoAVFlow: Robot Policy Learning with Active Vision from Human Egocentric Videos via 3D Flow

April 16, 2026

Abstract: Egocentric human videos provide a scalable source of manipulation demonstrations; however, deploying them on robots requires active viewpoint control to maintain task-critical visibility, which human viewpoint imitation often fails to provide due to human-specific priors. We propose Ego...

Back to top ↥

3D Flow

[RA-L submitted] EgoAVFlow: Robot Policy Learning with Active Vision from Human Egocentric Videos via 3D Flow

April 16, 2026

Abstract: Egocentric human videos provide a scalable source of manipulation demonstrations; however, deploying them on robots requires active viewpoint control to maintain task-critical visibility, which human viewpoint imitation often fails to provide due to human-specific priors. We propose Ego...

Back to top ↥

Representation Learning

[NeurIPS submitted] DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation

April 30, 2026

Abstract: Robot manipulation succeeds only when perception preserves the aspects of a scene that matter for action. Yet most robot learning pipelines still rely on visual encoders pre-trained for static recognition or vision-language alignment, leaving motion understanding to downstream policies....

Back to top ↥

Robot Learning

[NeurIPS submitted] DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation

April 30, 2026

Abstract: Robot manipulation succeeds only when perception preserves the aspects of a scene that matter for action. Yet most robot learning pipelines still rely on visual encoders pre-trained for static recognition or vision-language alignment, leaving motion understanding to downstream policies....

Back to top ↥