2 |
How much context span is enough? Examining context-related issues for document-level MT
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 (2022) How much context span is enough? Examining context-related issues for document-level MT. In: 13th Language Resources and Evaluation Conference, 21-23 June 2022, Marseille, France. (In Press) (2022)
|
|
Abstract:
This paper analyses how much context span is necessary to solve different context-related issues, namely, reference, ellipsis, gender, number, lexical ambiguity, and terminology when translating from English into Portuguese. We use the DELA corpus, which consists of 60 documents and six different domains (subtitles, literary, news, reviews, medical, and legislation). We find that the shortest context span to disambiguate issues can appear in different positions in the document including preceding, following, global, world knowledge; and that the average length depends on the issue types as well as the domain. Additionally, we show that the standard approach of relying on only two preceding sentences as context might not be enough depending on the domain and issue types.
|
|
Keyword:
Computational linguistics; context span; document-level; Language; Linguistics; Machine translating; Translating and interpreting
|
|
URL: http://doras.dcu.ie/27009/
|
|
BASE
|
|
Hide details
|
|
3 |
An investigation of English-Irish machine translation and associated resources
|
|
Dowling, Meghan. - : Dublin City University. School of Computing, 2022. : Dublin City University. ADAPT, 2022
|
|
In: Dowling, Meghan orcid:0000-0003-1637-4923 (2022) An investigation of English-Irish machine translation and associated resources. PhD thesis, Dublin City University. (2022)
|
|
BASE
|
|
Show details
|
|
4 |
One model for the learning of language.
|
|
|
|
In: Proceedings of the National Academy of Sciences of the United States of America, vol 119, iss 5 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Computational Measures of Deceptive Language: Prospects and Issues
|
|
|
|
In: ISSN: 2297-900X ; EISSN: 2297-900X ; Frontiers in Communication ; https://hal.archives-ouvertes.fr/hal-03629780 ; Frontiers in Communication, Frontiers, 2022, 7, pp.792378. ⟨10.3389/fcomm.2022.792378⟩ (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Animal linguistics in the making: the Urgency Principle and titi monkeys’ alarm system
|
|
|
|
In: ISSN: 0394-9370 ; Ethology Ecology and Evolution ; https://hal.inrae.fr/hal-03518874 ; Ethology Ecology and Evolution, Taylor & Francis, 2022, pp.1-17. ⟨10.1080/03949370.2021.2015452⟩ (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Finding the best way to put media bias research into practice via an annotation app ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Labour market discrimination and biases in human judgement and Artificial Intelligence ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Movies with imaginary worlds cluster together because of exploration-related terms in plot summaries ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Finding the best way to put media bias research into practice through an annotation app ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Maschinelle Übersetzung (MT) für den Notfall : Ratgeber zum Einsatz von MT Tools für die Kommunikation mit Flüchtlingen aus der Ukraine ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Neural machine translation and language teaching : possible implications for the CEFR ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
From bag-of-words towards natural language: adapting topic models to avoid stop word removal ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Linked Open Tafsir - Rekonstruktion der Entstehungsdynamik(en) des Korans mithilfe der Netzwerkmodellierung früher islamischer Überlieferungen ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
DiaCollo für GEI-Digital - Ein experimentelles Projekt zur weiteren Erschließung digitalisierter historischer Schulbuchbestände ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
DiaCollo für GEI-Digital - Ein experimentelles Projekt zur weiteren Erschließung digitalisierter historischer Schulbuchbestände ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
DiaCollo für GEI-Digital - Ein experimentelles Projekt zur weiteren Erschließung digitalisierter historischer Schulbuchbestände ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Linked Open Tafsir - Rekonstruktion der Entstehungsdynamik(en) des Korans mithilfe der Netzwerkmodellierung früher islamischer Überlieferungen ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|