Reinforcement and Imitation Learning via Interactive No-Regret Learning

Ross, Stephane; Bagnell, J. Andrew

Computer Science > Machine Learning

arXiv:1406.5979 (cs)

[Submitted on 23 Jun 2014]

Title:Reinforcement and Imitation Learning via Interactive No-Regret Learning

Authors:Stephane Ross, J. Andrew Bagnell

View PDF

Abstract:Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach and analyzed using no-regret online learning. These approaches to imitation learning, however, neither require nor benefit from information about the cost of actions. We extend existing results in two directions: first, we develop an interactive imitation learning approach that leverages cost information; second, we extend the technique to address reinforcement learning. The results provide theoretical support to the commonly observed successes of online approximate policy iteration. Our approach suggests a broad new family of algorithms and provides a unifying view of existing techniques for imitation and reinforcement learning.

Comments:	14 pages. Under review for NIPS 2014 conference
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1406.5979 [cs.LG]
	(or arXiv:1406.5979v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1406.5979

Submission history

From: Stephane Ross [view email]
[v1] Mon, 23 Jun 2014 17:00:28 UTC (25 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2014-06

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Stéphane Ross
J. Andrew Bagnell

export BibTeX citation

Computer Science > Machine Learning

Title:Reinforcement and Imitation Learning via Interactive No-Regret Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reinforcement and Imitation Learning via Interactive No-Regret Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators