DFKI-LT - Extracting and Querying Relations in Scientific Papers on Language Technology

Ulrich Schäfer, Hans Uszkoreit, Christian Federmann, Torsten Marek, Yajing Zhang
Extracting and Querying Relations in Scientific Papers on Language Technology
5 Proceedings of the 6th International Conference on Language Resources and Evaluation, Pages 3040-3046, Marrakesh, Morocco, ELRA, 5/2008
 
We describe methods for extracting interesting factual relations from scientific texts in computational linguistics and language technology taken from the ACL Anthology. We use a hybrid NLP architecture with shallow preprocessing for increased robustness and domain-specific, ontology-based named entity recognition, followed by a deep HPSG parser running the English Resource Grammar (ERG). The extracted relations in the MRS (minimal recursion semantics) format are simplified and generalized using WordNet. The resulting `quriples' are stored in a database from where they can be retrieved (again using abstraction methods) by relation-based search. The query interface is embedded in a web browser-based application we call the Scientist's Workbench. It supports researchers in editing and online-searching scientific papers.
 
Files: BibTeX, hylap-aiama-lrec08.pdf, L08-1251, 773_paper.pdf