ScaleNet: Scale Invariant Network for Semantic Segmentation in Urban Driving Scenes

Mohammad Dawud Ansari, Stephan Krauß, Oliver Wasenmüller, Didier Stricker

In: Proceedings of the 13th International Conference on Computer Vision Theory and Applications |. International Conference on Computer Vision Theory and Applications (VISAPP-18) 13th January 27-29 Funchal Madeira Portugal SCITEPRESS Digital Library 2018.


The scale difference in driving scenarios is one of the essential challenges in semantic scene segmentation. Close objects cover significantly more pixels than far objects. In this paper, we address this challenge with a scale invariant architecture. Within this architecture, we explicitly estimate the depth and adapt the pooling field size accordingly. Our model is compact and can be extended easily to other research domains. Finally, the accuracy of our approach is comparable to the state-of-the-art and superior for scale problems. We evaluate on the widely used automotive dataset Cityscapes as well as a self-recorded dataset

Weitere Links

VISAPP_2018_193_CR.pdf (pdf, 1 MB)

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz