layout	permalink	categories
common	/	projects

<title>Robot Learning on the Job</title> <script src="./src/popup.js" type="text/javascript"></script> <script type="text/javascript"> // redefining default features var _POPUP_FEATURES = 'width=500,height=300,resizable=1,scrollbars=1,titlebar=1,status=1'; </script> <style type="text/css" media="all"> body { font-family: "Merriweather","PT Serif",Georgia,"Times New Roman",serif; font-weight: 300; font-size:18px; margin-left: auto; margin-right: auto; width: 100%; color: #333332; }

h1 { font-weight:300; } h2 { font-weight:300; } h3 { font-weight:250; font-size: 25px; }

IMG { PADDING-RIGHT: 0px; PADDING-LEFT: 0px;

PADDING-BOTTOM: 0px; PADDING-TOP: 0px; display:block; margin:auto;
} #primarycontent { MARGIN-LEFT: auto; ; WIDTH: expression(document.body.clientWidth > 1000? "1000px": "auto" ); MARGIN-RIGHT: auto; TEXT-ALIGN: left; max-width: 1000px } BODY { TEXT-ALIGN: center }

hr { border: 0; height: 1px; max-width: 1100px; background-image: linear-gradient(to right, rgba(0, 0, 0, 0), rgba(0, 0, 0, 0.75), rgba(0, 0, 0, 0)); }

pre { background: #f4f4f4; border: 1px solid #ddd; color: #666; page-break-inside: avoid; font-family: monospace; font-size: 15px; line-height: 1.6; margin-bottom: 1.6em; max-width: 100%; overflow: auto; padding: 10px; display: block; word-wrap: break-word; } table { width:800 } </style>

Robot Learning on the Job: Human-in-the-Loop Manipulation and Learning During Deployment

Huihan Liu Soroush Nasiriany Lance Zhang Zhiyao Bao Yuke Zhu

The University of Texas at Austin

in submission to ICRA 2023

Paper | Code

With the rapid growth of computing powers and recent advances in deep learning, we have witnessed impressive demonstrations of novel robot capabilities in research settings. Nonetheless, these learning systems exhibit brittle generalization and require excessive training data for practical tasks. To harness the capabilities of state-of-the-art robot learning models, while embracing their imperfections, we develop a principled framework for humans and robots to collaborate through a division of work. In this framework, partially autonomous robots are tasked with handling a major portion of decision-making where they work reliably; meanwhile, human operators monitor the process and intervene in challenging situations. Such a human-robot team ensures safe deployments in complex tasks. Further, we introduce a new learning algorithm to improve the policy's performance on the data collected from the task executions. The core idea is re-weighing training samples with approximated human trust and optimizing the policies with weighted behavioral cloning. We evaluate our framework in simulation and on real hardware, showing that our method consistently outperforms baselines over a collection of contact-rich manipulation tasks, achieving 8% boost in simulation and 27% on real hardware than the state-of-the-art methods, with 3 times faster convergence and 15% memory size.

Learning on the Job Overview

Continuous Deployment and Update Cycle

Method: Intervention-based Reweighting Scheme

Experiment Results

Our system ensures safe and reliable execution through human-robot teaming. We evaluated the autonomous policy performance of our human-in-the-loop framework on 4 tasks. As the autonomous policy improves over long-term deployment, the amount of human workload decreases.

We conduct 3 rounds of robot deployments and policy updates. Here we present Round 1 and Round 3 results of Ours and baseline IWR. We show how for Ours policy performance improve over rounds, and how Ours outperforms IWR baseline.

Gear Insertion (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3

Coffee Pod packing (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3


@inproceedings{jiang2022ditto,
   title={Ditto: Building Digital Twins of Articulated Objects from Interaction},
   author={Jiang, Zhenyu and Hsu, Cheng-Chun and Zhu, Yuke},
   booktitle={arXiv preprint arXiv:2202.08227},
   year={2022}
}

gtag('config', 'G-PPXN40YS69'); </script>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

index.markdown

index.markdown

Robot Learning on the Job: Human-in-the-Loop Manipulation and Learning During Deployment

Huihan Liu Soroush Nasiriany Lance Zhang Zhiyao Bao Yuke Zhu

The University of Texas at Austin

in submission to ICRA 2023

Paper | Code

Learning on the Job Overview

Continuous Deployment and Update Cycle

Method: Intervention-based Reweighting Scheme

Experiment Results

Gear Insertion (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3

Coffee Pod packing (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3

Files

index.markdown

Latest commit

History

index.markdown

File metadata and controls

Robot Learning on the Job: Human-in-the-Loop Manipulation and Learning During Deployment

Huihan Liu Soroush Nasiriany Lance Zhang Zhiyao Bao Yuke Zhu

The University of Texas at Austin

in submission to ICRA 2023

Paper | Code

Learning on the Job Overview

Continuous Deployment and Update Cycle

Method: Intervention-based Reweighting Scheme

Experiment Results

Gear Insertion (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3

Coffee Pod packing (Real)

Ours, Round 1

IWR, Round 1

Ours, Round 3

IWR, Round 3