SDC - Stacked Dilated Convolution: A Unified Descriptor Network for Dense Matching Tasks

René Schuster, Oliver Wasenmüller, Christian Unger, Didier Stricker

In: Conference on Computer Vision and Pattern Recognition. International Conference on Computer Vision and Pattern Recognition (CVPR-2019) June 16-20 Long Beach CA United States IEEE 2019.


Dense pixel matching is important for many computer vision tasks such as disparity and flow estimation. We present a robust, unified descriptor network that considers a large context region with high spatial variance. Our network has a very large receptive field and avoids striding layers to maintain spatial resolution. These properties are achieved by creating a novel neural network layer that consists of multiple, parallel, stacked dilated convolutions (SDC). Several of these layers are combined to form our SDC descriptor network. In our experiments, we show that our SDC features outperform state-of-the-art feature descriptors in terms of accuracy and robustness. In addition, we demonstrate the superior performance of SDC in state-of-the-art stereo matching, optical flow and scene flow algorithms on several famous public benchmarks.

schuster2019sdc.pdf (pdf, 8 MB) schuster2019sdc_supplementary.pdf (pdf, 18 MB)

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence