Mapping between Dependency Structures and Compositional Semantic Representations

Max Jakob, Markéta Lopatková, Valia Kordoni

In: Nicoletta Calzolari (Conference Chair) , Khalid Choukri , Bente Maegaard , Joseph Mariani , Jan Odijk , Stelios Piperidis , Mike Rosner , Daniel Tapias (editor). Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). International Conference on Language Resources and Evaluation (LREC-2010) May 17-23 Valletta Malta ISBN 2-9517408-6-7 European Language Resources Association (ELRA) 5/2010.


This paper investigates the mapping between two semantic formalisms, namely the tectogrammatical layer of the Prague Dependency Treebank 2.0 (PDT) and (Robust) Minimal Recursion Semantics ((R)MRS). It is a first attempt to relate the dependency-based annotation scheme of PDT to a compositional semantics approach like (R)MRS. A mapping algorithm that converts PDT trees to (R)MRS structures is developed, associating (R)MRSs to each node on the dependency tree. Furthermore, composition rules are formulated and the relation between dependency in PDT and semantic heads in (R)MRS is analyzed. It turns out that structure and dependencies, morphological categories and some coreferences can be preserved in the target structures. Moreover, valency and free modifications are distinguished using the valency dictionary of PDT as an additional resource. The validation results show that systematically correct underspecified target representations can be obtained by a rule-based mapping approach, which is an indicator that (R)MRS is indeed robust in relation to the formal representation of Czech data. This finding is novel, for Czech, with its free word order and rich morphology, is typologically different than languages analyzed with (R)MRS to date.

German Research Center for Artificial Intelligence
Deutsches Forschungszentrum für Künstliche Intelligenz