Skip to main content Skip to main navigation

Publications

Displaying results 1 to 10 of 12.
  1. Jan Peters; Stefan Schaal

    Reinforcement Learning for Operational Space Control

    In: 2007 IEEE International Conference on Robotics and Automation. IEEE International Conference on Robotics and Automation (ICRA-2007), April 10-14, Roma, Italy, Pages 2111-2116, IEEE, 2007.

  2. Jan Peters; Stefan Schaal

    Reinforcement learning by reward-weighted regression for operational space control

    In: Zoubin Ghahramani (Hrsg.). Machine Learning, Proceedings of the Twenty-Fourth International Conference (ICML 2007). International Conference on Machine Learning (ICML-2007), June 20-24, Corvallis, Oregon, USA, Pages 745-750, ACM International Conference Proceeding Series, Vol. 227, ACM, 2007.

  3. Jan Peters; Stefan Schaal; Bernhard Schölkopf

    Towards Machine Learning of Motor Skills

    In: Karsten Berns; Tobias Luksch (Hrsg.). Autonome Mobile Systeme 2007, 20. Fachgespräch. Autonome Mobile Systeme (AMS-2007), October 18-19, Kaiserslauten, Germany, Pages 138-144, Informatik Aktuell, Springer, 2007.

  4. Jun Nakanishi; Michael N. Mistry; Jan Peters; Stefan Schaal

    Towards compliant humanoids-an experimental assessment of suitable task space position/orientation controllers

    In: 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-2007), October 29 - November 2, San Diego, California, USA, Pages 2520-2527, IEEE, 2007.

  5. Jan Peters; Stefan Schaal

    Policy Learning for Motor Skills

    In: Masumi Ishikawa; Kenji Doya; Hiroyuki Miyamoto; Takeshi Yamakawa (Hrsg.). Neural Information Processing, 14th International Conference. International Conference on Neural Information Processing (ICONIP-2007), November 13-16, Kitakyushu, Japan, Pages 233-242, Lecture Notes in Computer Science, Vol. 4985, Springer, 2007.

  6. Daan Wierstra; Alexander Förster; Jan Peters; Jürgen Schmidhuber

    Solving Deep Memory POMDPs with Recurrent Policy Gradients

    In: Joaquim Marques de Sá; Luís A. Alexandre; Wlodzislaw Duch; Danilo P. Mandic (Hrsg.). Artificial Neural Networks - ICANN 2007, 17th International Conference, Proceedings. International Conference on Artificial Neural Networks (ICANN-2007), September 9-13, Porto, Portugal, Pages 697-706, Lecture Notes in Computer Science, Vol. 4668, Springer, 2007.

  7. Jan Peters; Stefan Schaal

    Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

    In: 15th European Symposium on Artificial Neural Networks, Proceedings. European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN-2007), April 25-27, Bruges, Belgium, Pages 295-300, ESANN, 2007.

  8. Luc De Raedt; Thomas G. Dietterich; Lise Getoor; Kristian Kersting; Stephen H. Muggleton (Hrsg.)

    07161 Abstracts Collection -- Probabilistic, Logical and Relational Learning - A Further Synthesis

    Probabilistic, Logical and Relational Learning - A Further Synthesis, April 14-20, Schloss Dagstuhl, Germany, Dagstuhl Seminar Proceedings, Vol. 07161, Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany, 2007.

  9. Kristian Kersting; Christian Plagemann; Patrick Pfaff; Wolfram Burgard

    Most likely heteroscedastic Gaussian process regression

    In: Zoubin Ghahramani (Hrsg.). Machine Learning, Proceedings of the Twenty-Fourth International Conference. International Conference on Machine Learning (ICML-2007), Pages 393-400, ACM International Conference Proceeding Series, Vol. 227, ACM, 2007.

  10. Kristian Kersting; Christian Plagemann; Alexandru Cocora; Wolfram Burgard; Luc De Raedt

    Learning to transfer optimal navigation policies

    In: Advanced Robotics, Vol. 21, No. 13, Pages 1565-1582, Taylor & Francis Online, 2007.