DFKI-LT - Publications

2018
 
Gennady Agre, Josef van Genabith, Thierry Declerck (eds.)
18th International Conference, AIMSA 2018, Varna, Bulgaria, September 12-14, 2018, Proceedings
Lecture Notes in Artificial Intelligence number 11089, Varna, Bulgaria, Springer International Publishing, Bulgarian Artificial Intelligence Association & Institute of Information and Communication Technologies at the Bulgarian Academy of Sciences (IICT-BAS), Acad. G. Bonchev 2, 1113 Sofia, Bulgaria, 9/2018
 
Andrea Lösch, Valérie Mapelli, Stelios Piperidis, Andrejs Vasiļjevs, Lilli Smal, Thierry Declerck, Eileen Schnur, Khalid Choukri, Josef van Genabith
European Language Resource Coordination: Collecting Language Resources for Public Sector Multilingual Information Management
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan, ELRA, European Language Resources Association, 9, rue des Cordelières, 75013 Paris, France, Paris, 5/2018
 
Georg Heigold, Stalin Varanasi, Günter Neumann, Josef van Genabith
How Robust Are Character-BasedWord Embeddings in Tagging and MT Against Wrod Scramlbing or Randdm Nouse?
Proceedings of the Conference of the Association for Machine Translation in the Americas, Boston, MA, USA, o.A., 2018
 
Khyathi Raghavi Chandu, Ekaterina Loginova, Vishal Gupta, Josef van Genabith, Günter Neumann, Manoj Chinnakotla, Eric Nyberg, Alan Black
Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques
Third Workshop on Computational Approaches to Linguistic Code-switching, Melbourne, Australia, ACL, 2018
 
Cristina España i Bonet, Josef van Genabith
Multilingual Semantic Networks for Data-driven Interlingua Seq2Seq Systems
in: Jinhua Du, Mihael Arcan, Qun Liu, Hitoshi Isahara (eds.):
Proceedings of the LREC 2018 Workshop “MLP-MomenT”, Pages 8-13, Miyazaki, Japan, o.A., 2018
 
Alina Karakanta, Jonathan Dehdari, Josef van Genabith
Neural machine translation for low-resource languages without parallel corpora.
in: Journal of Machine Translation. 2018. 32167–189. Springer. (ed.):
Machine Translation volume o.A., Pages 167-189, Springer, 2018
 
2017
 
Cristina España i Bonet, Josef van Genabith
Going beyond zero-shot MT: combining phonological, morphological and semantic factors. The UdS-DFKI System at IWSLT 2017
in: Sakriani Sakti, Masao Utiyama (eds.):
International Workshop on Spoken Language Translation, Pages 15-22, Tokyo, Japan, o.A., 12/2017
 
Cristina España i Bonet, Ádám Csaba Varga, Alberto Barrón-Cedeño, Josef van Genabith
An Empirical Analysis of NMT-Derived Interlingual Embeddings and their Use in Parallel Sentence Identification
IEEE Journal of Selected Topics in Signal Processing volume 11 number 8, Pages 1340-1350, IEEE, 12/2017
 
Octavia-Maria Şulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith
Exploring the Use of Text Classification in the Legal Domain.
Proceedings of the Second Workshop on Automated Semantic Analysis of Information in Legal Text, London, United Kingdom, http://ceur-ws.org, 10/2017
 
Octavia-Maria Sulea, Marcos Zampieri, Mihaela Vela, Josef van Genabith
Predicting the Law Area and Decisions of French Supreme Court Cases
Proceedings of Recent Advances in Natural Language Processing Conference, Varna, Bulgaria, o.A., 4/2017
 
Hans Uszkoreit, Aleksandra Gabryszak, Leonhard Hennig, Jörg Steffen, Renlong Ai, Stephan Busemann, Jonathan Dehdari, Josef van Genabith, Georg Heigold, Nils Rethmeier, Raphael Rubino, Sven Schmeier, Philippe Thomas, He Wang, Feiyu Xu
Common Round: Application of Language Technologies to Large-Scale Web Debates
Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Pages 5-8, Valencia, Spain, Association for Computational Linguistics, 4/2017
 
Georg Heigold, Günter Neumann, Josef van Genabith
An Extensive Empirical Evaluation of Character-Based Morphological Tagging for 14 Languages
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics 15th Conference of the European Chapter of the Association for Computational Linguistics- Proceedings of Conference, Volume 1: Long Papers volume 1, Long Papers, Pages 505-5013, Valencia, Spain, Association for Computational Linguistics (ACL), 4/2017
 
Ben Peters, Jonathan Dehdari, Josef van Genabith
Massively multilingual neural grapheme-to-phoneme conversion
Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, o.A., 2017
 
Santanu Pal, Sudip Kumar Naskar, Mihaela Vela, Qun Liu, Josef van Genabith
Neural automatic post-editing using prior alignment and reranking
15th EACL Conference, Valencia, Spain, o.A., 2017
 
2016
 
Li Ling Tan, Jonathan Dehdari, Josef van Genabith
Faster and lighter phrase-based machine translation baseline
The Third Workshop on Asian Translation (WAT2016), Pages 184-193, Osaka, Japan, Association for Computational Linguistics, 12/2016
 
Georg Heigold, Günter Neumann, Josef van Genabith
Scaling Character-Based Morphological Tagging to Fourteen Languages
Proceedings of IEEE International Conference on Big Data, Washinton, DC, DC, USA, IEEE, IEEE, 12/2016
 
Mohammed Attia, Pavel Pecina, Younes Samih, Khaled Shaalan, Josef van Genabith
Arabic spelling error detection and correction
Natural Language Engineering volume 22 number 5, Pages 751-773, Cambridge University Press, 9/2016
 
Georg Heigold, Josef van Genabith
Character-based Neural Machine Translation
volume 16-01,
DFKI Research Reports, Pages 20Technical Report, DFKI GmbH, 9/2016
 
Aljoscha Burchardt, Kimberley Harris, Georg Rehm, Hans Uszkoreit
Towards a Systematic and Human-Informed Paradigm for High-Quality Machine Translation
in: Georg Rehm, Aljoscha Burchardt, Ondrej Bojar, Christian Dugast, Marcello Federico, Josef van Genabith, Barry Haddow, Jan Hajic, Kimberley Harris, Philipp Koehn, Matteo Negri, Martin Popel, Lucia Specia, Marco Turchi, Hans Uszkoreit (eds.):
Proceedings of the LREC 2016 Workshop "Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem", Portorož, Slovenia, o.A., 5/2016
 
Kimberley Harris, Aljoscha Burchardt, Georg Rehm, Lucia Specia
Technology Landscape for Quality Evaluation: Combining the Needs of Research and Industry
in: Georg Rehm, Aljoscha Burchardt, Ondrej Bojar, Christian Dugast, Marcello Federico, Josef van Genabith, Barry Haddow, Jan Hajic, Kimberley Harris, Philipp Koehn, Matteo Negri, Martin Popel, Lucia Specia, Marco Turchi, Hans Uszkoreit (eds.):
Proceedings of the LREC 2016 Workshop "Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem", Portorož, Slovenia, o.A., 5/2016
 
Georg Rehm, Aljoscha Burchardt, Ondrej Bojar, Christian Dugast, Marcello Federico, Josef van Genabith, Barry Haddow, Jan Hajic, Kimberley Harris, Philipp Koehn, Matteo Negri, Martin Popel, Lucia Specia, Marco Turchi, Hans Uszkoreit (eds.)
Proceedings of the LREC 2016 Workshop "Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem"
Portorož, Slovenia, o.A., 5/2016
 
Georg Rehm, Jan Hajic, Josef van Genabith, Andrejs Vasiljevs
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities
in: Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis (eds.):
Proceedings of the Tenth International Conference on Language Resources and Evaluation, Portoro¸, Slovenia, European Language Resources Association (ELRA), Paris, France, 5/2016
 
Artuur Leeuwenberg, Mihaela Vela, Jonathan Dehdari, Josef van Genabith
A Minimally Supervised Approach for Synonym Extraction with Word Embeddings
The Prague Bulletin of Mathematical Linguistics volume 105, Pages 111-142, De Gruyter, Berlin, Germany, 4/2016
 
Santanu Pal, Sudip Kumar Naskar, Mihaela Vela, Josef van Genabith
A Neural Network based Approach to Automated Post-Editing
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, ACL, ACL, 2016
 
Jonathan Dehdari, Li Ling Tan, Josef van Genabith
BIRA: Improved Predictive Exchange Word Clustering
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, California, USA, Association for Computational Linguistics, Association for Computational Linguistics, 2016
 
Raphael Rubino, Ekaterina Lapshinova-Koltunski, Josef van Genabith
Information Density and Quality Estimation Features as Translationese Indicators for Human Translation Classification
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, CA, USA, o.A., 2016
 
Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela, Josef van Genabith
CATaLog Online: Porting a Post-editing Tool to the Web
10th edition of the Language Resources and Evaluation Conference, Portoro, Zambia, o.A., 2016
 
Jonathan Dehdari, Li Ling Tan, Josef van Genabith
Scaling Up Word Clustering
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), Pages 42-46, San Diego, CA, USA, ACL, Association for Computational Linguistics, 2016
 
Tapas Nayak, Santanu Pal, Sudip Kumar Naskar, Sivaji Bandyopadhyay, Josef van Genabith
Beyond Translation Memories: Generating Translation Suggestions based on Parsing and POS Tagging
In the Proceedings of the 2nd Workshop on Natural Language Processing for Translation Memories, Portoroz, Slovenia, o.A., 2016
 
Hanna Bechara, Rohit Gupta, Li Ling Tan, Constantin Orasan, Ruslan Mitkov, Josef van Genabith
WOLVESAAR at SemEval-2016 Task 1: Replicating the Success of Monolingual Word Alignment and Neural Embeddings for Semantic Textual Similarity
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, CA, USA, ACL, 2016
 
Santanu Pal, Sudip Kumar Naskar, Josef van Genabith
Multi-Engine and Multi-Alignment Based Automatic Post-Editing and its Impact on Translation Productivity
26th International Conference on Computational Linguistics volume Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, Osak, Japan, o.A., 2016
 
Rohit Gupta, Constantin Orăsan, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Ruslan Mitkov
Improving Translation Memory Matching and Retrieval Using Paraphrases.
Machine Translation volume 30 number 1-2, Pages 19-40, Springer, 2016
 
Raphael Rubino, Ekaterina Lapshinova-Koltunski, Josef van Genabith
Information Density and Quality Estimation Features as Translationese Indicators for Human Translation Classification
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, CA, USA, ACL, Association for Computational Linguistics, 2016
 
Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak, Josef van Genabith
CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: System Demonstrations, Pages 98-102, Osaka, Japan, o.A., 2016
 
Santanu Pal, Sudip Kumar Naskar, Josef van Genabith
Forest to string based statistical machine translation with hybrid word alignments
Computational Linguistics and Intelligent Text Processing Lecture Notes in Computer Science volume 9624, Pages 38-50, Konya, Turkey, Springer, 2016
 
Li Ling Tan, Francis Bond, Josef van Genabith
USAAR at SemEval-2016 Task 13: Hyponym Endocentricity
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL), San Diego, CA, USA, ACL, ACL, 2016
 
Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audronė Bielevičienė, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen García-Mateo, Josef van Genabith, Jan Hajič, Inma Hernáez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunción Moreno, Maciej Ogrodniczuk Jan Odjik, Piotr Pęzik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiş, Tamás Váradi, Andrejs Vasiļjevs, Kadri Vider, Jolanta Zabarskaite
The Strategic Impact of META-NET on the Regional, National and International Level
Language Resources and Evaluation volume 50 number 2, Pages 351-374, Springer, 2016
 
Raphael Rubino, Stefania Degaetano-Ortlieb, Elke Teich, Josef van Genabith
Modeling Diachronic Change in Scientific Writing with Information Density
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, Osaka, Japan, o.A., 2016
 
2015
 
Constantin Orasan, Alessandro Cattelan, Gloria Corpas Pastor, Josef van Genabith, Manuel Herranz, Juan José Arevalillo, Qun Liu, Khalil Sima’an, Lucia Specia
The EXPERT Project: Advancing the State of the Art in Hybrid Translation Technologies
10/2015
 
Antonio Toral, Pavel Pecina, Longyue Wan, Josef van Genabith
Linguistically-augmented Perplexity-based Data Selection for Language Models
Computer Speech & Language volume 32 number 1, Pages 11-26, Academic Press, 7/2015
 
Santanu Pal, Partha Pakray, Alexander Gelbukh, Josef van Genabith
Mining parallel resources for machine translation from comparable corpora
in: Alexander Gelbukh (ed.):
Computational Linguistics and Intelligent Text Processing (Lecture Notes in Computer Science) volume 9041,
Lecture Notes in Computer Science, Pages 534-544, Springer, 4/2015
 
Pavel Pecina, Antonio Toral, Vassilis Papavassiliou, Prokopis Prokopidis, Ale¨ Tamchyna, Andy Way, Josef van Genabith
Domain adaptation of statistical machine translation with domain-focused web crawling
Journal on Language Resources and Evaluation volume 49 number 1, Pages 147-193, Springer, Netherlands, 3/2015
 
Li Ling Tan, Rohit Gupta, Josef van Genabith
USAAR-WLV: Hypernym Generation with Deep Neural Nets
Proceedings of the 9th International Workshop on Semantic Evaluation , Pages 932-937, Denver, Colorado, USA, o.A., Association for Computational Linguistics, 2015
 
Li Ling Tan, Jonathan Dehdari, Josef van Genabith
An Awkward Disparity between BLEU / RIBES Scores and Human Judgements in Machine Translation
Proceedings of the Workshop on Asian Translation (WAT-2015), Pages 74-81, Kyoto, Japan, Association for Computational Linguistics, 2015
 
Tapas Nayek, Sudip Kumar Naskar, Santanu Pal, Marcos Zampieri, Mihaela Vela, Josef van Genabith
CATaLog: New Approaches to TM and Post Editing Interfaces
Proceedings of the 1st Workshop on Natural Language Processing for Translation Memories, Pages 36-43, Hissar, Bulgaria, o.A., 2015
 
Carolina Scarton, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Lucia Specia
Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, Pages 35-42, Antalya, Turkey, o.A., 2015
 
Santanu Pal, Partha Pakray, Alexander Gelbukh, Josef van Genabith
Mining Parallel Resources for Machine Translation from Comparable Corpora
in: Alexander Gelbukh (ed.):
Lecture Notes in Computer Science Computational Linguistics and Intelligent Text Processing, 16th International Conference, CICLing 2015, Proceedings volume 9041,
Lecture Notes in Computer Science, Pages 534-544, Cairo, Egypt, Springer, 2015
 
Marcos Zampieri, Binyam Gebrekidan Gebre, Hernani Costa, Josef van Genabith
Comparing Approaches to the Identification of Similar Languages
Proceedings of the Joint Workshop on Language Technology for Closely Related Languages, Varieties and Dialects, Pages 66-73, Hissar, Bulgaria, o.A., 2015
 
Rohit Gupta, Constantin Orasan, Marcos Zampieri, Mihaela Vela, Josef van Genabith
Can Translation Memories afford not to use paraphrasing?
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, Pages 35-42, Antalya, Turkey, o.A., 2015
 
Rohit Gupta, Constantin Orasan, Josef van Genabith
ReVal: A Simple and Effective Machine Translation Evaluation Metric Based on Recurrent Neural Networks
Proceedings of the 2015 Conference on Emperical Methods in Natural Language Processing, Pages 1066-1072, Lisbon, Portugal, o.A., Association for Computational Linguistics, 2015
 
Li Ling Tan, Jonathan Dehdari, Josef van Genabith
An Awkward Disparity between BLEU / RIBES Scores and Human Judgements in Machine Translation
Proceedings of the 2nd Workshop on Asian Translation, Pages 74-81, Kyoto, Japan, o.A., 2015
 
Li Ling Tan, Josef van Genabith, Francis Bond
Passive and Pervasive Use of a Bilingual Dictionary in Statistical Machine Translation
Proceedings of the ACL 2015 Fourth Workshop on Hybrid Approaches to Translation, Pages 30-34, Beijing, China, o.A., Association for Computational Linguistics, 2015
 
Li Ling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith
Usaar-sheffield: Semantic textual similarity with deep regression and machine translation evaluation metrics
Proceedings of the 9th international workshop on semantic evaluation (SemEval 2015), Pages 85-89, Denver, Colorade, USA, o.A., 2015
 
Mihaela Vela, Josef van Genabith
Re-assessing the WMT2013 Human Evaluation with Professional Translators Trainees
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, Pages 161-169, Antalya, Turkey, o.A., European Association for Machine Translation, 2015
 
Santanu Pal, Mihaela Vela, Sudip Kumar Naskar, Josef van Genabith
USAAR-SAPE: An English–Spanish Statistical Automatic Post-EditingSystem
Proceedings of the Tenth Workshop on Statistical Machine Translation, Pages 380-384, Lissabon, Portugal, o.A., Association for Computational Linguistics, 2015
 
Li Ling Tan, Carolina Scarton, Lucia Specia, Josef van Genabith
USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics
Proceedings of the 9th International Workshop on Semantic Evaluation , Pages 85-89, Denver, Colorado, USA, o.A., 2015
 
Li Ling Tan, Rohit Gupta, Josef van Genabith
Usaar-wlv: Hypernym generation with deep neural nets
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), Denver, Colorado, USA, o.A., 2015
 
Santanu Pal, Sudip Kumar Naskar, Josef van Genabith
UdS-Sant: English–German Hybrid Machine Translation System
Proceedings of the Tenth Workshop on Statistical Machine Translation, Pages 152-157, Lisbon, Portugal, o.A., Association for Computational Linguistics, 2015
 
2014
 
Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, Núria Bel, Audronė Bielevičienė, Lars Borin, António Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garabík, Marko Grobelnik, Carmen García-Mateo, Josef van Genabith, Jan Hajič, Inma Hernáez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lindén, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunción Moreno, Jan Odjik, Maciej Ogrodniczuk, Piotr Pęzik, Stelios Piperidis, Adam Przepiórkowski, Eiríkur Rögnvaldsson, Mike Rosner, Bolette Sandford Pedersen, Inguna Skadiņa, Koenraad De Smedt, Marko Tadić, Paul Thompson, Dan Tufiş, Tamás Váradi, Andrejs Vasiļjevs, Kadri Vider, Jolanta Zabarskaite
The Strategic Impact of META-NET on the Regional, National and International Level
Proceedings of the 9th Language Resources and Evaluation Conference, Reykjavik, Iceland, o.A., Reykjavik, 5/2014
 
Aljoscha Burchardt, Arle Richard Lommel, Georg Rehm, Felix Sasaki, Josef van Genabith, Hans Uszkoreit
Language Technology Drives Quality Translation
MultiLingual volume 143, Pages 33-39, MultiLingual Computing, Inc., 4/2014
 
Pratyush Banerjee, Raphael Rubino, Johann Roturier, Josef van Genabith
Quality estimation-guided data selection for domain adaptation of smt
Machine Translation volume o.A., Pages 101-108, Springer, 2014
 
2012
 
Christian Federmann, Eleftherios Avramidis, Marta R. Costa-jussa, Josef van Genabith, Maite Melero, Pavel Pecina
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation
Proceedings of the 8th ELRA Conference on Language Resources and Evaluation, Istanbul, Turkey, European Language Resources Association (ELRA), 5/2012