Abstract: Preference-based reinforcement learning (PBRL) enables policy learning through simple queries comparing trajectories from a single policy. While human responses to these queries make it possible to learn policies aligned with human preferences, PBRL suffers from low query efficiency, as policy bias limits trajectory diversity and reduces the number of … [Read more...] about DAPPER: Discriminability-Aware Policy-to-Policy Preference-Based Reinforcement Learning for Query-Efficient Robot Skill Acquisition
Imitation learning
Human–Humanoid Robots’ Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning From Demonstration: HOTU, a Human–Humanoid Robots’ Skill Transfer Framework
Humanoid robots are envisioned as embodied intelligent agents capable of performing a wide range of human-level loco-manipulation tasks, particularly in scenarios that require strenuous and repetitive labor. However, learning these skills is challenging due to the high degrees of freedom of humanoid robots, and collecting sufficient training data for humanoid is a laborious … [Read more...] about Human–Humanoid Robots’ Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning From Demonstration: HOTU, a Human–Humanoid Robots’ Skill Transfer Framework


