Publications

Jan Peters; Katharina Mülling; Yasemin Altun

Relative Entropy Policy Search

In: Maria Fox; David Poole (Hrsg.). Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial …

Tetsuro Morimura; Eiji Uchibe; Junichiro Yoshimoto; Jan Peters; Kenji Doya

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

In: Neural Computation, Vol. 22, No. 2, Pages 342-376, MIT Press, 2010.

Daan Wierstra; Alexander Förster; Jan Peters; Jürgen Schmidhuber

Recurrent policy gradients

In: Logic Journal of the IGPL Oxford, Vol. 18, No. 5, Pages 620-634, Oxford University Press, 2010.

Zhao Xu; Kristian Kersting; Thorsten Joachims

Fast Active Exploration for Link-Based Preference Learning Using Gaussian Processes

In: José L. Balcázar; Francesco Bonchi; Aristides Gionis; Michèle Sebag (Hrsg.). Machine Learning and Knowledge Discovery in Databases, European …

Tobias Lang; Marc Toussaint; Kristian Kersting

Exploration in Relational Worlds

In: José L. Balcázar; Francesco Bonchi; Aristides Gionis; Michèle Sebag (Hrsg.). Machine Learning and Knowledge Discovery in Databases, European …

Novi Quadrianto; Kristian Kersting; Tinne Tuytelaars; Wray L. Buntine

Beyond 2D-grids: a dependence maximization view on image browsing

In: James Ze Wang; Nozha Boujemaa; Nuria Oliver Ramirez; Apostol Natsev (Hrsg.). Proceedings of the 11th ACM SIGMM International Conference on …

Jens Behley; Kristian Kersting; Dirk Schulz; Volker Steinhage; Armin B. Cremers

Learning to hash logistic regression for fast 3D scan point classification

In: 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems …

Sriraam Natarajan; Gautam Kunapuli; Kshitij Judah; Prasad Tadepalli; Kristian Kersting; Jude W. Shavlik

Multi-Agent Inverse Reinforcement Learning

In: Sorin Draghici; Taghi M. Khoshgoftaar; Vasile Palade; Witold Pedrycz; M. Arif Wani; Xingquan Zhu (Hrsg.). The Ninth International Conference on …

Christian Thurau; Kristian Kersting; Christian Bauckhage

Yes we can: simplex volume maximization for descriptive web-scale matrix factorization

In: Jimmy X. Huang; Nick Koudas; Gareth J. F. Jones; Xindong Wu; Kevyn Collins-Thompson; Aijun An (Hrsg.). Proceedings of the 19th ACM Conference on …

Scott Sanner; Kristian Kersting

Symbolic Dynamic Programming for First-order POMDPs