Publication

ScaleNet: Scale Invariant Network for Semantic Segmentation in Urban Driving Scenes

Mohammad Dawud Ansari; Stephan Krauß; Oliver Wasenmüller; Didier Stricker

In: Proceedings of the 13th International Conference on Computer Vision Theory and Applications |. International Conference on Computer Vision Theory and Applications (VISAPP-18), 13th, January 27-29, Funchal, Madeira, Portugal, SCITEPRESS Digital Library, 2018.

Abstract

The scale difference in driving scenarios is one of the essential challenges in semantic scene segmentation. Close objects cover significantly more pixels than far objects. In this paper, we address this challenge with a scale invariant architecture. Within this architecture, we explicitly estimate the depth and adapt the pooling field size accordingly. Our model is compact and can be extended easily to other research domains. Finally, the accuracy of our approach is comparable to the state-of-the-art and superior for scale problems. We evaluate on the widely used automotive dataset Cityscapes as well as a self-recorded dataset

Weitere Links

https://www.researchgate.net/publication/321360967_ScaleNet_Scale_Invariant_Network_for_Semantic_Segmentation_in_Urban_Driving_Scenes

VISAPP_2018_193_CR.pdf (pdf, 1 MB )