Hilbert Space Embeddings of Predictive State Representations

Boots, Byron; Gordon, Geoffrey; Gretton, Arthur

Computer Science > Machine Learning

arXiv:1309.6819 (cs)

[Submitted on 26 Sep 2013]

Title:Hilbert Space Embeddings of Predictive State Representations

Authors:Byron Boots, Geoffrey Gordon, Arthur Gretton

View PDF

Abstract:Predictive State Representations (PSRs) are an expressive class of models for controlled stochastic processes. PSRs represent state as a set of predictions of future observable events. Because PSRs are defined entirely in terms of observable data, statistically consistent estimates of PSR parameters can be learned efficiently by manipulating moments of observed training data. Most learning algorithms for PSRs have assumed that actions and observations are finite with low cardinality. In this paper, we generalize PSRs to infinite sets of observations and actions, using the recent concept of Hilbert space embeddings of distributions. The essence is to represent the state as a nonparametric conditional embedding operator in a Reproducing Kernel Hilbert Space (RKHS) and leverage recent work in kernel methods to estimate, predict, and update the representation. We show that these Hilbert space embeddings of PSRs are able to gracefully handle continuous actions and observations, and that our learned models outperform competing system identification algorithms on several prediction benchmarks.

Comments:	Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Report number:	UAI-P-2013-PG-92-101
Cite as:	arXiv:1309.6819 [cs.LG]
	(or arXiv:1309.6819v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1309.6819

Submission history

From: Byron Boots [view email] [via AUAI proxy]
[v1] Thu, 26 Sep 2013 12:35:19 UTC (1,444 KB)

Computer Science > Machine Learning

Title:Hilbert Space Embeddings of Predictive State Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hilbert Space Embeddings of Predictive State Representations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators