Search CORE

1,720,992 research outputs found

Proceedings of The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017)

Author: Seretan Violeta
Johanna Monti Ruslan Mitkov, Violeta Seretan, Gloria Corpas Pastor
AA VV,
Corpas Pastor Gloria
Mitkov Ruslan
Monti Johanna
Publication venue
Publication date: 01/01/2018
Field of study

This volume documents the proceedings of the 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), held on 4 November 2017 as part of the EUROPHRAS 2017 conference: "Computational and Corpus-based Approaches to Phraseology: Recent advances and interdisciplinary approaches" (London, 13-14 November 2015), jointly organised by the European Association for Phraseology (EUROPHRAS), the University of Wolverhampton (Research Institute of Information and Language Processing) and the Association for Computational Linguistics – Bulgaria. The workshop was held under the auspices of the European Society of Phraseology (EUROPHRAS), the Special Interest Group on the Lexicon of the Association for Computational Linguistics (SIGLEX), and SIGLEX's Multiword Expressions Section (SIGLEX-MWE). The workshop was co-chaired by Ruslan Mitkov (University of Wolverhampton), Johanna Monti (Università degli Studi di Sassari), Gloria Corpas Pastor (Universidad de Málaga) and Violeta Seretan (Université de Genève). The topic of the workshop was the integration of multi-word units in machine translation and translation technology tools. In spite of the relative progress achieved for particular types of units such as verb-particle constructions, the identification, interpretation and translation of multi-word units in general still represent open challenges, both from a theoretical and a practical point of view. The idiosyncratic morpho-syntactic, semantic and translational properties of multi-word units pose many obstacles even to human translators, mainly because of intrinsic ambiguities, structural and lexical asymmetries between languages, and, finally, cultural differences. The aim of the workshop was to bring together researchers and practitioners working on MWU processing from various perspectives, in order to enable cross fertilisation and foster the creation of innovative solutions that can only arise from interdisciplinary collaborations. The present edition of the workshop provided a forum for researchers and practitioners in the fields of (Computational) Linguistics, (Computational) Phraseology, Translation Studies and Translation Technology to discuss recent advances in the area of multi-word unit processing and to coordinate research efforts across disciplines in order to improve the integration of multi-word units in machine translation and translation technology tools. The programme included 5 oral presentations, and featured an invited talk by Carlos Ramisch, Aix-Marseille University, France. The papers accepted are indicative of the current efforts of researchers and developers who are actively engaged in improving the state of the art of multi-word unit translation. We would like to thank all authors who contributed papers to this workshop edition and the Programme Committee members who provided valuable feedback during the review process

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Multi-word Units in Machine Translation and Translation Technology

Author: Seretan Violeta
Ruslan Mitkov Johanna Monti, Gloria Corpas Pastor, Violeta Seretan, Uxoa Iñurrieta, Itziar Aduriz, Arantza Díaz de Ilarraza, Gorka Labaka and Kepa Sarasola, Joke Daems, Michael Carl, Sonia Vandepitte, Robert J. Hartsuiker and Lieve Macken, Amalia Todiraşcu and Mirabela Navlea, Gregor Thurmair, Simon Clematide, Stéphanie Lehner, Johannes Graën and Martin Volk, Lieve Macken and Arda Tezcan, Oscar Mendoza Rivera, Lukasz Grabowski, Kristina Kocijan and Sara Librenjak, Goranka Blagus Bartolec and Ivana Matas Ivanković,Eric Wehrli and Luka Nerima
Corpas Pastor Gloria
Ruslan Mitkov Johanna Monti, Gloria Corpas Pastor, Violeta Seretan
Mitkov Ruslan
Monti Johanna
Publication venue
Publication date: 01/01/2018
Field of study

The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French Romanian, German, Dutch and Croatian among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Multiword units in machine translation and translation technology

Author: Johanna Monti
Seretan Violeta
Monti Johanna (Ed.)
Violeta Seretan
Gloria Corpas Pastor
Corpas Pastor Gloria
Seretan Violeta (Ed.)
Mitkov Ruslan (Ed.)
Ruslan Mitkov
Mitkov Ruslan
Monti Johanna
Corpas Pastor Gloria (Ed.)
Publication venue
Publication date: 01/01/2018
Field of study

Crossref

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Università degli Studi di Napoli L'Orientale: CINECA IRIS

Archive ouverte UNIGE

Multi-word unit processing in Machine Translation

Author: Seretan Violeta
Monti J Mitkov R, Corpas Pastor G, Seretan V
Corpas Pastor Gloria
Ruslan Mitkov Johanna Monti, Gloria Corpas Pastor, Violeta Seretan
Mitkov Ruslan
Monti Johanna
Publication venue
Publication date: 01/01/2018
Field of study

The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Language Processing but is a challenging and complex task. In recent years, the computational treatment of MWUs has received considerable attention but there is much more to be done before we can claim that NLP and Machine Translation (MT) systems process MWUs successfully. This volume provides a general overview of the field with particular reference to Machine Translation and Translation Technology and focuses on languages such as English, Basque, French, Romanian, German, Dutch and Croatian, among others. The chapters of the volume illustrate a variety of topics that address this challenge, such as the use of rule-based approaches, compound splitting techniques, MWU identification methodologies in multilingual applications, and MWU alignment issues

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Conclusion

Author: Violeta Seretan
Publication venue
Publication date: 20/11/2010
Field of study

Crossref

Syntax-Based Extraction

Author: Violeta Seretan
Publication venue
Publication date: 20/11/2010
Field of study

Crossref

Introduction

Author: Violeta Seretan
Publication venue
Publication date: 20/11/2010
Field of study

Crossref

Extensions

Author: Violeta Seretan
Publication venue
Publication date: 20/11/2010
Field of study

Crossref

Accurate Collocation Extraction Using a Multilingual Parser

Author: Violeta Seretan
Publication venue
Publication date: 01/04/2008
Field of study

This paper focuses on the use of advanced techniques of text analysis as support for collocation extraction. A hybrid system is presented that combines statistical methods and multilingual parsing for detecting accurate collocational information from English, French, Spanish and Italian corpora. The advantage of relying on full parsing over using a traditional window method (which ignores the syntactic information) is first theoretically motivated, then empirically validated by a comparative evaluation experiment.

CiteSeerX

Survey of Extraction Methods

Author: Violeta Seretan
Publication venue
Publication date: 20/11/2010
Field of study

Crossref