DFKI-LT - Unsupervised and domain-independent extraction of technical terms from scientifc articles in digital libraries
Unsupervised and domain-independent extraction of technical terms from scientifc articles in digital libraries
3 Proceedings of the Workshop "Information Retrieval", Darmstadt, Germany, TU Darmstadt, TU Darmstadt, Karolinenplatz 5 64289 Darmstadt, 2009
A central issue for making the contents of documents in a digital library accessible to the user is the identification and extraction of technical terms. We propose a method to approach this task in an unsupervised, domain-independent way: We use a nominal group chunker to extract term candidates and select the technical terms from these candidates based on string frequencies retrieved using the MSN search engine.
Files: BibTeX, LWA09_Proceedings.pdf, lwa-submit_dilia_eichler_etal.pdf