Skip to main content Skip to main navigation


Text Mining Support for Semantic Indexing and Analysis of A/V Streams

Jan Nemrava; Paul Buitelaar; Vojtech Svatek; Thierry Declerck
In: OntoImage 2008. International Workshop on Language Resources for Content-Based Image Retrieval (OntoImage-08), 2nd, located at LREC08, ELDA, 2008.


The work described here concerns the use of complementary resources in sports video analysis; soccer in our case. Structured web data such as match tables with teams, player names, score goals, substitutions, etc. and multiple, unstructured, textual web data sources (minute-by-minute match reports) are processed with an ontology-based information extraction tool to extract and annotate events and entities according to the SmartWeb soccer ontology. Through the temporal alignment of the primary A/V data (soccer videos) with the textual and structured complementary resources, these extracted and semantically organized events can be used as indicators for video segment extraction and semantic classification, i.e. occurrences of particular events in the complementary resources can be used to classify the corresponding video segment, enabling semantic indexing and retrieval of soccer videos.