A Comparison of Single and Multi-View IR image-based AR Glasses Pose Estimation Approaches

Ahmet Firintepe, Alain Pagani, Didier Stricker

In: Proceedings of the IEEE Virtual Reality conference - Posters. IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops (VRW) (IEEEVR-2021) March 27-April 2 IEEE 2021.


In this paper, we present a study on single and multi-view image-based AR glasses pose estimation with two novel methods. The first approach is named GlassPose and is a VGG-based network. The second approach GlassPoseRN is based on ResNet18. We train and evaluate the two custom developed glasses pose estimation networks with one, two and three input images on the HMDPose dataset. We achieve errors as low as 0.10 degrees and 0.90mm on average on all axes for orientation and translation. For both networks, we observe minimal improvements in position estimation with more input views.

Firintepe2021_IEEE_VR.pdf (pdf, 2 MB ) Firintepe2021_IEEE_VR_SupplementaryMaterial.pdf (pdf, 77 KB )

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz