Skip to content

Commit 7e00e86

Browse files
authored
Merge pull request #17 from iamlab-cmu/shivam
Add act delegate learn paper
2 parents 37858d7 + ae6f2c5 commit 7e00e86

File tree

2 files changed

+23
-1
lines changed

2 files changed

+23
-1
lines changed

_data/pubs.yml

Lines changed: 23 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -244,10 +244,32 @@
244244
links:
245245
'[Pdf]': https://www.ri.cmu.edu/app/uploads/2023/03/submission_camera_ready.pdf
246246
short_id: vats2023
247-
site: https://www.ri.cmu.edu/publications/carnegie-mellon-team-tartan-mission-level-robustness-with-rapidly-deployed-autonomous-aerial-vehicles-in-the-mbzirc-2020/
247+
site: https://sites.google.com/view/recoverylearning
248248
title: "Efficient Recovery Learning using Model Predictive Meta-Reasoning"
249249
venue: "International Conference on Robotics and Automation (ICRA), May 2023"
250250
video_embed: null
251+
252+
- abs: "We consider the problem of completing a set of n tasks with a human-robot team using minimum effort. In many domains, teaching a robot to be fully autonomous can be counterproductive if there are finitely many tasks to be done. Rather, the optimal strategy is to weigh the cost of teaching a robot and its benefit -- how many new tasks it allows the robot to solve autonomously. We formulate this as a planning problem where the goal is to decide what tasks the robot should do autonomously (act), what tasks should be delegated to a human (delegate) and what tasks the robot should be taught (learn) so as to complete all the given tasks with minimum effort. This planning problem results in a search tree that grows exponentially with n -- making standard graph search algorithms intractable. We address this by converting the problem into a mixed integer program that can be solved efficiently using off-the-shelf solvers with bounds on solution quality. To predict the benefit of learning, we propose a precondition prediction classifier. Given two tasks, this classifier predicts whether a skill trained on one will transfer to the other. Finally, we evaluate our approach on peg insertion and Lego stacking tasks, both in simulation and real-world, showing substantial savings in human effort."
253+
authors: Shivam Vats, Oliver Kroemer and Maxim Likhachev
254+
award: <award>Outstanding Interaction Paper Finalist at ICRA 2022</award>
255+
bib: >
256+
@inproceedings{vats2022synergistic,
257+
title={Synergistic Scheduling of Learning and Allocation of Tasks in Human-robot Teams},
258+
author={Vats, Shivam and Kroemer, Oliver and Likhachev, Maxim},
259+
booktitle={2022 International Conference on Robotics and Automation (ICRA)},
260+
pages={2789--2795},
261+
year={2022},
262+
organization={IEEE}
263+
}
264+
img: ../pics/2022_icra_adl.png
265+
links:
266+
'[arXiv]': https://arxiv.org/abs/2203.07478
267+
'[Video]': https://youtu.be/FyjrkHKF1mM
268+
short_id: vats2022
269+
site: https://sites.google.com/view/actdelegateorlearn/home
270+
title: "Synergistic Scheduling of Learning and Allocation of Tasks in Human-Robot Teams"
271+
venue: "International Conference on Robotics and Automation (ICRA), May 2022"
272+
video_embed: null
251273

252274
- abs: "Research on Interactive Robot Learning has yielded several modalities for querying a human for training data, including demonstrations, preferences, and corrections. While prior work in this space has focused on optimizing the robot’s queries within each interaction type, there has been little work on optimizing over the selection of the interaction type itself. We present INQUIRE, the first algorithm to implement and optimize over a generalized representation of information gain across multiple interaction types. Our evaluations show that INQUIRE can dynamically optimize its interaction type (and respective optimal query) based on its current learning status and the robot’s state in the world, resulting in more robust performance across tasks in comparison to state-of-the-art baseline methods. Additionally, INQUIRE allows for customizable cost metrics to bias its selection of interaction types, enabling this algorithm to be tailored to a robot’s particular deployment domain and formulate cost-aware, informative queries."
253275
authors: Tesca Fitzgerald, Pallavi Koppol, Patrick Callaghan, Russell Q. Wong, Reid Simmons, Oliver Kroemer, and Henny Admoni

pics/2022_icra_adl.png

222 KB
Loading

0 commit comments

Comments
 (0)