Extracting Glossary Sentences from Scholarly Articles: A Comparative Evaluation of Pattern Bootstrapping and Deep Analysis

Melanie Reiplinger; Ulrich Schäfer; Magdalena Wolska
In: Proceedings of the ACL-2012 Main Conference Workshop on Rediscovering 50 Years of Discoveries. Annual Meeting of the Association for Computational Linguistics (ACL-2012), located at ACL-2012, July 10, Jeju Island, Korea, Republic of, Pages 55-65, ISBN 978-1-937284-29-9, Association for Computational Linguistics, 7/2012.


The paper reports on a comparative study of two approaches to extracting definitional sentences from a corpus of scholarly discourse: one based on bootstrapping lexico-syntactic patterns and another based on deep analysis. Computational Linguistics was used as the target domain and the ACL Anthology as the corpus. Definitional sentences extracted for a set of well-defined concepts were rated by domain experts. Results show that the methods extract high-quality definition sentences intended for automated glossary construction.



