Skip to main content Skip to main navigation

Publikationen

Zeige Ergebnisse 21 bis 30 von 533.
  1. Jan Peters; Jens Kober

    Using reward-weighted imitation for robot Reinforcement Learning

    In: 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009) Proceedings. IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning (ADPRL-2009), March 30 - April 2, Nashville, TN, USA, Pages 226-232, IEEE Symposium Series on Computational Intelligence, ISBN 978-1-4244-2761-1, IEEE, 2009.

  2. Hirotaka Hachiya; Takayuki Akiyama; Masashi Sugiyama; Jan Peters

    Efficient data reuse in value function approximation

    In: 2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009) Proceedings. IEEE Symposium on Adaptive Dynamic Programming And Reinforcement Learning (ADPRL-2009), March 30 - April 2, Nashville, TN, USA, Pages 8-15, IEEE Symposium Series on Computational Intelligence, ISBN 978-1-4244-2761-1, IEEE, 2009.

  3. Jan Peters; Andrew Y. Ng

    Guest editorial: Special issue on robot learning, Part B

    In: Autonomous Robots, Vol. 27, No. 2, Pages 91-92, Springer, 2009.

  4. Jan Peters; Andrew Y. Ng

    Guest editorial: Special issue on robot learning, Part A

    In: Autonomous Robots, Vol. 27, No. 1, Pages 1-2, Springer, 2009.

  5. Matthew Hoffman; Nando de Freitas; Arnaud Doucet; Jan Peters

    An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward

    In: David A. Van Dyk; Max Welling (Hrsg.). Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics. International Conference on Artificial Intelligence and Statistics (AISTATS-2009), April 16-18, Clearwater Beach, Florida, USA, Pages 232-239, JMLR Proceedings, Vol. 5, JMLR.org, 2009.

  6. Jan Peters; Jun Morimoto; Russ Tedrake; Nicholas Roy

    Robot learning [TC Spotlight]

    In: IEEE Robotics & Automation Magazine, Vol. 16, No. 3, Pages 19-20, IEEE, 2009.

  7. Hirotaka Hachiya; Takayuki Akiyama; Masashi Sugiyama; Jan Peters

    Adaptive importance sampling for value function approximation in off-policy reinforcement learning

    In: Neural Networks, Vol. 22, No. 10, Pages 1399-1410, Elsevier, 2009.

  8. Udo Frese; Tim Laue; Oliver Birbach; Jörg Kurlbaum; Thomas Röfer

    (A) Vision for 2050 Context-Based Image Understanding for a Human-Robot Soccer Match

    In: Berthold Hoffmann; Till Mossakowski; Lutz Schröder. Festkolloquium for Bernd Krieg-Brückner's 60th birthday. Pages 273-289, Sichere Kognitive Systeme, DFKI Bremen, 2009.

  9. Thomas Röfer; Tim Laue; Judith Müller; Oliver Bösche; Armin Burchardt; Eric Damrose; Katharina Gillmann; Colin Graf; Thijs Jeffry de Haas; Alexander Härtl; Andrik Rieskamp; André Schreck; Ingo Sieverdingbeck et al.

    B-Human Team Report and Code Release 2009

    2009.

  10. Thomas Röfer; Christian Mandel; Axel Lankenau; Bernd Gersdorf; Udo Frese

    15 Years of Rolland

    In: Berthold Hoffmann; Till Mossakowski; Lutz Schröder. Festschrift Dedicated to Bernd Krieg-Brückner on the Occasion of his 60th Birthday. Pages 255-272, Sichere Kognitive Systeme, DFKI Bremen, 2009.