AI-Linguistica

Not a member yet

24 research outputs found

Sort by

‘Nous fêterons’ or ‘On va fêter’? Mimicking Age-Sensitive Variation with ChatGPT

Author: Hekkel Valerie
Schulz Friederike
Lupica Spagnolo Marta
Publication venue: PUBLIA – SLUB Open Publishing
Publication date: 09/07/2024
Field of study

This study explores ChatGPT’s capability to mimic age-sensitive linguistic variation in contemporary French, particularly focusing on older adult speech. Our investigation aimed to assess whether ChatGPT could (1) align its naive responses with age-related language use, (2) demonstrate explicit knowledge of age-related linguistic variation, and (3) modify responses based on such knowledge. Using contexts from the LangAge corpus, ChatGPT was prompted to answer questions from the perspective of speakers of different ages (30– 90) in different interview years (1980–2020), with a specific focus on the use of first-person plural subject clitics (nous/on) and future tenses (futur simple/proche). The results revealed that ChatGPT’s responses predominantly favored formal linguistic variants across all ages. While expert-knowledge injection significantly increased the usage of formal variants, there was no systematic influence of age, birth year, or interview year on variant selection. A partial exception is represented by speakers aged 70 for whom ChatGPT displayed heightened linguistic uncertainty in the naive answer. By contrast, the variant distribution in (3) is mainly motivated by ChatGPT’s expert knowledge generated in (2). These findings highlight the potential and limitations of current LLMs in capturing age-specific variation while encouraging further integration of sociolinguistic methods into LLM research

Dealing with common ground in Human Translation and Neural Machine Translation: A case study on Italian equivalents of German Modal Particles

Author: Meier Franz
Publication venue: PUBLIA – SLUB Open Publishing
Publication date: 12/07/2024
Field of study

The purpose of this chapter is to examine the neural machine translation of modal particles and to compare it to human translation. The quantitatively-oriented study focuses on Italian lexical translation equivalents of German eben and einfach. The two modal particles have similar meaning, as the speaker uses both to underline the obvious character of their utterance. The study is based on a sample of human translations of literary texts as well as on the neural machine translations of these texts generated by Google Translate and DeepL. It will be analyzed to what extent the lexical translation equivalents proposed by the human translators and the NMT tools reflect the modal meaning of eben and einfach and provide information on the existence of modal particles in Italian

DeepL traduce Maraini: le poesie metasemantiche "sotto gli occhi" della traduzione automatica

Author: Saccà Francesco
Publication venue: PUBLIA – SLUB Open Publishing
Publication date: 04/07/2024
Field of study

oai:ai-ling.publia.org:article/4This study reflects on the translation of poetic texts characterised by literary occasionalisms by two MT systems: DeepL and Google Translate. A clear example of this is the translation process carried out in the processing of the metasemantic poems in the collection entitled Gnòsi delle fànfole by Fosco Maraini, whose invented words refer the reader to morphological and phonetic components that are known, but often not identifiable as such. By inserting, e.g., the entire text of the famous poem Il lonfo into the translators, these recognise cognitive patterns probably akin to the symbolism evoked by the morphological and phonological characters of the non-words used by the author and transpose these same patterns into non-romantic languages. Even though the final text in the target language is clearly far from a possible literary translation and the mechanism that leads one to pay attention to such textual components is not entirely transparent, the results seem to be in line with possible textual interpretations provided by a human translator. The research is thus aimed at demonstrating how machine translators, when attempting to translate similar texts, can highlight hidden text properties, such as iconic syntactic structures and phonosymbolic components, through statistical text processing.Il presente studio riflette sull’interpretazione dei testi poetici caratterizzati da occasionalismi letterari, da parte di due sistemi di traduzione automatica: DeepL e Google Translate. Ne è un chiaro esempio il processo traduttivo compiuto nell’elaborazione delle poesie metasemantiche presenti nella raccolta dal titolo Gnòsi delle fànfole di Fosco Maraini, le cui parole inventate rimandano il lettore a componenti morfologiche e fonetiche conosciute, ma spesso non identificabili come tali. Inserendo ad esempio l’intero testo della famosa poesia Il lonfo all’interno dei traduttori questi riconoscono modelli cognitivi probabilmente affini al simbolismo evocato dai caratteri morfologici e fonologici delle non-parole utilizzate dall’autore, e traspongono questi stessi schemi in lingue non-romanze. Nonostante il testo finale nella lingua d’arrivo sia chiaramente lontano da una possibile traduzione letteraria e nonostante il meccanismo che porta a prestare attenzione a tali componenti testuali non sia del tutto trasparente, i risultati sono a primo impatto conformi a possibli interpretazioni testuali fornite da un traduttore umano madrelingua. La ricerca si ripropone pertanto di dimostrare come i traduttori automatici, nel processo di traduzione di simili testi, possano mettere in evidenza proprietà del testo più nascoste, quali ad esempio strutture sintattiche iconiche e componenti fonosimboliche, attraverso un’elaborazione del testo di tipo statistico

Valutare la qualità dei testi generati in lingua italiana

Author: Tavosanis Mirko
Publication venue: PUBLIA – SLUB Open Publishing
Publication date: 08/08/2024
Field of study

Il contributo presenta il problema della valutazione dei testi generati in lingua italiana da LLM collocandolo nel contesto delle pratiche di valutazione esistenti in diversi settori connessi. Nella prima parte viene mostrato il posizionamento della valutazione degli LLM in rapporto ad alcune pratiche ampiamente diffuse, dalla valutazione semiautomatica delle traduzioni automatiche alla valutazione umana di testi di vario genere. Nella seconda parte vengono presentate le condizioni della valutazione in rapporto alle capacità effettive degli LLM, che sono in grado di generare testi di tipo molto diverso. Viene poi presentato l’esame di un campione di testi generati da ChatGPT, eseguito appunto nella prospettiva dell’identificazione delle pratiche di valutazione più adatte. Le produzioni testuali oggetto dell’esame, ricavate da conversazioni condotte con ChatGPT-3.5 e ChatGPT-4, mostrano infatti un avvicinamento notevole alle produzioni professionali di esseri umani, ma anche alcuni punti di divergenza. Complessivamente, il contributo argomenta quindi in favore dell’importanza e dell’utilità della valutazione umana applicata ai testi di buon livello linguistico prodotti dai sistemi attuali basati sull’intelligenza artificiale, purché in questa applicazione venga evitata l’adozione passiva di criteri sviluppati per altri tipi di attività.Il contributo presenta il problema della valutazione dei testi generati in lingua italiana da LLM collocandolo nel contesto delle pratiche di valutazione esistenti in diversi settori connessi. Nella prima parte viene mostrato il posizionamento della valutazione degli LLM in rapporto ad alcune pratiche ampiamente diffuse, dalla valutazione semiautomatica delle traduzioni automatiche alla valutazione umana di testi di vario genere. Nella seconda parte vengono presentate le condizioni della valutazione in rapporto alle capacità effettive degli LLM, che sono in grado di generare testi di tipo molto diverso. Viene poi presentato l’esame di un campione di testi generati da ChatGPT, eseguito appunto nella prospettiva dell’identificazione delle pratiche di valutazione più adatte. Le produzioni testuali oggetto dell’esame, ricavate da conversazioni condotte con ChatGPT-3.5 e ChatGPT-4, mostrano infatti un avvicinamento notevole alle produzioni professionali di esseri umani, ma anche alcuni punti di divergenza. Complessivamente, il contributo argomenta quindi in favore dell’importanza e dell’utilità della valutazione umana applicata ai testi di buon livello linguistico prodotti dai sistemi attuali basati sull’intelligenza artificiale, purché in questa applicazione venga evitata l’adozione passiva di criteri sviluppati per altri tipi di attività

13

full texts

24

metadata records

Updated in last 30 days.

AI-Linguistica

Access Repository Dashboard

Do you manage Open Research Online? Become a CORE Member to access insider analytics, issue reports and manage access to outputs from your repository in the CORE Repository Dashboard! 👇