Charles University

Biblio at Institute of Formal and Applied Linguistics

Not a member yet

539 research outputs found

Sort by

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Author: Gehrmann Sebastian
Eddine Moussa
White Michael
Dušek Ondřej
Shvets Anna
Jolly Shailza
Mahamood Saad
Thomson Craig
Dhole Kaustubh
Subramani Nishant
Jernite Yacine
Puduppully Ratish
Strobelt Hendrik
Hou Yufang
Cardenas Ronald
Deutsch Daniel
Durmus Esin
Kale Mihir
Clark Elizabeth
Zhang Li
Xu Ying
Osei Salomey
Creutz Mathias
Bhattacharjee Abhik
Zhou Jiawei
Shen Tianhao
Wang Dakuo
You Chaobin
Ammanamanchi Pawan
Liang Paul
Wang Alex
Štajner Sanja
Juraska Juraj
Shahriyar Rifat
Perez-Beltrachini Laura
Ribeiro Leonardo
Daheim Nico
Tunstall Lewis
Novikova Jekaterina
Sedoc Joao
Chim Jenny
Liu Yixin
Maynez Joshua
Mille Simon
Ladhak Faisal
Papangelis Alexandros
Upadhay Ashish
Zhu Qi
Ginter Filip
Yao Bingsheng
Raunak Vikas
Kanerva Jenna
Bhagavatula Chandra
Sang Yisi
Tsai Vivian
Clive Jordan
Madaan Aman
Adewumi Tosin
Gkatzia Dimitra
Garbacea Cristina
Cahyawijaya Samuel
Pushkarna Mahima
Radev Dragomir
Jin Di
Mahendiran Abinaya
Xiong Deyi
Winata Genta
Kriz Reno
Wilie Bryan
Raheja Vipul
Hayashi Hiroaki
McMillan-Major Angelina
Nikolaev Vitaly
Montella Sebastien
Hasan Tahmid
Chandu Khyati
Publication venue
Publication date: 01/01/2022
Field of study

Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, and human evaluation to make definitive claims. To make following best model evaluation practices easier, we introduce GEMv2. The new version of the Generation, Evaluation, and Metrics Benchmark introduces a modular infrastructure for dataset, model, and metric developers to benefit from each others work. GEMv2 supports 40 documented datasets in 51 languages. Models for all datasets can be evaluated online and our interactive data card creation and rendering tools make it easier to add new datasets to the living benchmark

CLS Infra Training School on Data and Annotation

Author: Janssen Maarten
Cvrček Václav
Křen Michal
Cinková Silvie
Publication venue
Publication date: 01/01/2022
Field of study

Keen to get a flying start in Digital Humanities? You can build, edit, annotate, and query a text corpus without a single line of code! Structure texts with the XML-TEI. Run an NLP tool to add linguistic information. Tackle real research questions concerning Shakespeare’s dramas or your own data by mastering CQL and Universal Dependencies

The Seventh Workshop on Search-Oriented Conversational Artificial Intelligence (SCAI'22)

Author: Clark Leigh
Adlakha Vaibhav
Vakulenko Svitlana
Penha Gustavo
Pal Vaishali
Dušek Ondřej
Publication venue
Publication date: 01/01/2022
Field of study

The goal of the seventh edition of SCAI (https://scai.info) is to bring together and further grow a community of researchers and practitioners interested in conversational systems for information access. The previous iterations of the workshop already demonstrated the breadth and multidisciplinarity inherent in the design and development of conversational search agents. The proposed shift from traditional web search to search interfaces enabled via human-like dialogue leads to a number of challenges, and although such challenges have received more attention in the recent years, there are many pending research questions that should be addressed by the information retrieval community and can largely benefit from a collaboration with other research fields, such as natural language processing, machine learning, human-computer interaction and dialogue systems. This workshop is intended as a platform enabling a continuous discussion of the major research challenges that surround the design of search-oriented conversational systems. This year, participants have the opportunity to meet in person and have more in-depth interactive discussions with a full-day onsite workshop

Simultaneous Translation for Unsegmented Input: A Sliding Window Approach

Author: Sen Sukanta
Haddow Barry
Bojar Ondřej
Publication venue: arxiv
Publication date: 01/01/2022
Field of study

In the cascaded approach to spoken language translation (SLT), the ASR output is typically punctuated and segmented into sentences before being passed to MT, since the latter is typically trained on written text. However, erroneous segmentation, due to poor sentence-final punctuation by the ASR system, leads to degradation in translation quality, especially in the simultaneous (online) setting where the input is continuously updated. To reduce the influence of automatic segmentation, we present a sliding window approach to translate raw ASR outputs (online or offline) without needing to rely on an automatic segmenter. We train translation models using parallel windows (instead of parallel sentences) extracted from the original training data. At test time, we translate at the window level and join the translated windows using a simple approach to generate the final translation. Experiments on English-to-German and English-to-Czech show that our approach improves 1.3--2.0 BLEU points over the usual ASR-segmenter pipeline, and the fixed-length window considerably reduces flicker compared to a baseline retranslation-based online SLT system

CorefUD 1.0: Coreference Meets Universal Dependencies

Author: Nedoluzhko Anna
Novák Michal
Žabokrtský Zdeněk
Zeldes Amir
Popel Martin
Zeman Daniel
Publication venue
Publication date: 01/01/2022
Field of study

Recent advances in standardization for annotated language resources have led to successful large scale efforts, such as the Universal Dependencies (UD) project for multilingual syntactically annotated data. By comparison, the important task of coreference resolution, which clusters multiple mentions of entities in a text, has yet to be standardized in terms of data formats or annotation guidelines. In this paper we present CorefUD, a multilingual collection of corpora and a standardized format for coreference resolution, compatible with morphosyntactic annotations in the UD framework and including facilities for related tasks such as named entity recognition, which forms a first step in the direction of convergence for coreference resolution across languages

Why don’t people use character-level machine translation?

Author: Libovický Jindřich
Schmid Helmut
Fraser Alexander
Publication venue
Publication date: 01/01/2022
Field of study

We present a literature and empirical survey that critically assesses the state of the art in character-level modeling for machine translation (MT). Despite evidence in the literature that character-level systems are comparable with subword systems, they are virtually never used in competitive setups in WMT competitions. We empirically show that even with recent modeling innovations in character-level natural language processing, character-level MT systems still struggle to match their subword-based counterparts. Character-level MT systems show neither better domain robustness, nor better morphological generalization, despite being often so motivated. However, we are able to show robustness towards source side noise and that translation quality does not degrade with increasing beam size at decoding time

DIASER: A Unifying View On Task-oriented Dialogue Annotation

Author: Hudeček Vojtěch
Štancl Daniel
Dušek Ondřej
Schaub Léon-Paul
Paroubek Patrick
Publication venue
Publication date: 01/01/2022
Field of study

Every model is only as strong as the data that it is trained on. In this paper, we present a new dataset, obtained by merging four publicly available annotated corpora for task-oriented dialogues in several domains (MultiWOZ 2.2, CamRest676, DSTC2 and Schema-Guided Dialogue Dataset). This way, we assess the feasibility of providing a unified ontology and annotation schema covering several domains with a relatively limited effort. We analyze the characteristics of the resulting dataset along three main dimensions: language, information content and performance. We focus on aspects likely to be pertinent for improving dialogue success, e.g. dialogue consistency. Furthermore, to assess the usability of this new corpus, we thoroughly evaluate dialogue generation performance under various conditions with the help of two prominent recent end-to-end dialogue models: MarCo and GPT-2. These models were selected as popular open implementations representative of the two main dimensions of dialogue modelling. While we did not observe a significant gain for dialogue state tracking performance, we show that using more training data from different sources can improve language modelling capabilities and positively impact dialogue flow (consistency). In addition, we provide the community with one of the largest open dataset for machine learning experiments

Overview of the ELE Project

Author: Gaspari Federico
Piperidis Stelios
Rehm Georg
Dunne Jane
Way Andy
Aldabe Itziar
Rigau German
Gallagher Owen
Hajič Jan
Lynn Teresa
Marheinecke Katrin
Kückens Jens
Farwell Aritz
Vojtěchová Tereza
Giagkou Maria
Resende Natalia
Publication venue
Publication date: 01/01/2022
Field of study

This paper presents the ongoing European Language Equality (ELE) project, an 18-month action funded by the European Commission. The primary goal of the ELE project is to prepare the ELE programme, in the form of a strategic research, innovation and implementation agenda and roadmap for achieving full digital language equality in Europe by 2030

Non-Autoregressive Machine Translation: It's Not as Fast as it Seems

Author: Haddow Barry
Helcl Jindřich
Birch Alexandra
Publication venue
Publication date: 01/01/2022
Field of study

Efficient machine translation models are commercially important as they can increase inference speeds, and reduce costs and carbon emissions. Recently, there has been much interest in non-autoregressive (NAR) models, which promise faster translation. In parallel to the research on NAR models, there have been successful attempts to create optimized autoregressive models as part of the WMT shared task on efficient translation. In this paper, we point out flaws in the evaluation methodology present in the literature on NAR models and we provide a fair comparison between a state-of-the-art NAR model and the autoregressive submissions to the shared task. We make the case for consistent evaluation of NAR models, and also for the importance of comparing NAR models with other widely used methods for improving efficiency. We run experiments with a connectionist-temporal-classification-based (CTC) NAR model implemented in C++ and compare it with AR models using wall clock times. Our results show that, although NAR models are faster on GPUs, with small batch sizes, they are almost always slower under more realistic usage conditions. We call for more realistic and extensive evaluation of NAR models in future work

Neural String Edit Distance

Author: Libovický Jindřich
Fraser Alexander
Publication venue
Publication date: 01/01/2022
Field of study

We propose the neural string edit distance model for string-pair matching and string transduction based on learnable string edit distance. We modify the original expectation-maximization learned edit distance algorithm into a differentiable loss function, allowing us to integrate it into a neural network providing a contextual representation of the input. We evaluate on cognate detection, transliteration, and grapheme-to-phoneme conversion, and show that we can trade off between performance and interpretability in a single framework. Using contextual representations, which are difficult to interpret, we match the performance of state-of-the-art string-pair matching models. Using static embeddings and a slightly different loss function, we force interpretability, at the expense of an accuracy drop

58

full texts

539

metadata records

Updated in last 30 days.

Biblio at Institute of Formal and Applied Linguistics

Access Repository Dashboard

Do you manage Open Research Online? Become a CORE Member to access insider analytics, issue reports and manage access to outputs from your repository in the CORE Repository Dashboard! 👇