ResFPN: Residual Skip Connections in Multi-Resolution Feature Pyramid Networks for Accurate Dense Pixel Matching

Rishav Rishav, René Schuster, Ramy Battrawy, Oliver Wasenmüller, Didier Stricker

In: International Conference on Pattern Recognition. International Conference on Pattern Recognition (ICPR-2021) January 12-15 Mailand Italy IEEE 1/2021.


Dense pixel matching is required for many computer vision algorithms such as disparity, optical flow or scene flow estimation. Feature Pyramid Networks (FPN) have proven to be a suitable feature extractor for CNN-based dense matching tasks. FPN generates well localized and semantically strong features at multiple scales. However, the generic FPN is not utilizing its full potential, due to its reasonable but limited localization accuracy. Thus, we present ResFPN -- a multi-resolution feature pyramid network with multiple residual skip connections, where at any scale, we leverage the information from higher resolution maps for stronger and better localized features. In our ablation study, we demonstrate the effectiveness of our novel architecture with clearly higher accuracy than FPN. In addition, we verify the superior accuracy of ResFPN in many different pixel matching applications on established datasets like KITTI, Sintel, and FlyingThings3D.

rishav2021resfpn.pdf (pdf, 3 MB)

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence