Skip to main content Skip to main navigation

Publications

Displaying results 21 to 30 of 70.
  1. Jan Peters; Katharina Mülling; Yasemin Altun

    Relative Entropy Policy Search

    In: Maria Fox; David Poole (Hrsg.). Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial …

  2. Tetsuro Morimura; Eiji Uchibe; Junichiro Yoshimoto; Jan Peters; Kenji Doya

    Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

    In: Neural Computation, Vol. 22, No. 2, Pages 342-376, MIT Press, 2010.

  3. Daan Wierstra; Alexander Förster; Jan Peters; Jürgen Schmidhuber

    Recurrent policy gradients

    In: Logic Journal of the IGPL Oxford, Vol. 18, No. 5, Pages 620-634, Oxford University Press, 2010.

  4. Zhao Xu; Kristian Kersting; Thorsten Joachims

    Fast Active Exploration for Link-Based Preference Learning Using Gaussian Processes

    In: José L. Balcázar; Francesco Bonchi; Aristides Gionis; Michèle Sebag (Hrsg.). Machine Learning and Knowledge Discovery in Databases, European …

  5. Tobias Lang; Marc Toussaint; Kristian Kersting

    Exploration in Relational Worlds

    In: José L. Balcázar; Francesco Bonchi; Aristides Gionis; Michèle Sebag (Hrsg.). Machine Learning and Knowledge Discovery in Databases, European …

  6. Novi Quadrianto; Kristian Kersting; Tinne Tuytelaars; Wray L. Buntine

    Beyond 2D-grids: a dependence maximization view on image browsing

    In: James Ze Wang; Nozha Boujemaa; Nuria Oliver Ramirez; Apostol Natsev (Hrsg.). Proceedings of the 11th ACM SIGMM International Conference on …

  7. Jens Behley; Kristian Kersting; Dirk Schulz; Volker Steinhage; Armin B. Cremers

    Learning to hash logistic regression for fast 3D scan point classification

    In: 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems …

  8. Sriraam Natarajan; Gautam Kunapuli; Kshitij Judah; Prasad Tadepalli; Kristian Kersting; Jude W. Shavlik

    Multi-Agent Inverse Reinforcement Learning

    In: Sorin Draghici; Taghi M. Khoshgoftaar; Vasile Palade; Witold Pedrycz; M. Arif Wani; Xingquan Zhu (Hrsg.). The Ninth International Conference on …

  9. Christian Thurau; Kristian Kersting; Christian Bauckhage

    Yes we can: simplex volume maximization for descriptive web-scale matrix factorization

    In: Jimmy X. Huang; Nick Koudas; Gareth J. F. Jones; Xindong Wu; Kevyn Collins-Thompson; Aijun An (Hrsg.). Proceedings of the 19th ACM Conference on …

  10. Scott Sanner; Kristian Kersting

    Symbolic Dynamic Programming for First-order POMDPs

    In: Maria Fox; David Poole (Hrsg.). Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial …