Fast Discriminative Linear Models for Scalable Video Tagging

Roberto Paredes; Adrian Ulges; Thomas Breuel

In: Proceedings of the International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications (ICMLA-09), December 13-15, Miami, Florida, USA, IEEE, 12/2009.


While video tagging (or "concept detection") is a key building block of research prototypes for video retrieval, its practical use is hindered by the computational effort associated with learning and detecting thousands of concepts. Support vector machines (SVMs), which can be considered the standard approach, scale poorly since the number of support vectors is usually high. In this paper, we propose a novel alternative that offers the benefits of rapid training and detection. This linear-discriminative method is based on the maximization of the area under the ROC. In quantitative experiments on a publicly available dataset of web videos, we demonstrate that this approach offers a significant speedup at a moderate performance loss compared to SVMs, and also outperforms another well-known linear-discriminative method based on a Passive-Aggressive Online Learning (PAMIR).


PID1025841.pdf (pdf, 314 KB )

Deutsches Forschungszentrum für Künstliche Intelligenz
German Research Center for Artificial Intelligence