1 |
Source language difficulties in learner translation: Evidence from an error-annotated corpus
|
|
|
|
BASE
|
|
Show details
|
|
2 |
An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
An exploratory analysis of multilingual word-level quality estimation with cross-lingual transformers
|
|
|
|
BASE
|
|
Show details
|
|
4 |
A sequence labelling approach for automatic analysis of ello: tagging pronouns, antecedents, and connective phrases
|
|
|
|
BASE
|
|
Show details
|
|
5 |
TransQuest at WMT2020: Sentence-Level direct assessment
|
|
|
|
In: 1049 ; 1055 (2020)
|
|
BASE
|
|
Show details
|
|
6 |
TransQuest: Translation quality estimation with cross-lingual transformers
|
|
|
|
In: 5070 ; 5081 (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Intelligent translation memory matching and retrieval with sentence encoders
|
|
|
|
In: 175 ; 184 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Contributions to the Computational Treatment of Non-literal Language
|
|
|
|
Abstract:
A thesis submitted in partial ful lment of the requirements of the University of Wolverhampton for the degree of Doctor of Philosophy. ; Non-literal language concerns the deliberate use of language in such a way that meaning cannot be inferred through a mere literal interpretation. In this thesis, three different forms of this phenomenon are studied; namely, irony, non-compositional Multiword Expressions (MWEs), and metaphor. We start by developing models to identify ironic comments in the context of the social micro-blogging website Twitter. In these experiments, we proposed a new way to extract features based on a study of their spatial structure. The proposed model is shown to perform competitively on a standard Twitter dataset. Next, we extensively study MWEs, which are the central point of focus in this work. We start by framing the task of MWE identi fication as sequence labelling and devise experiments to see the effect of eye-tracking data in capturing formulaic MWEs using structured prediction. We also develop a novel neural architecture to speci fically address the issue of discontinuous MWEs using a combination of Graph Convolutional Neural Networks (GCNs) and self-attention. The proposed model is subsequently tested on several languages where it is shown to outperform the state-of-the-art in overall criteria and also in capturing gappy MWEs. In the final part of the thesis, we look at metaphor and its interaction with verbal MWEs. In a series of experiments, we propose a hybrid BERT-based model augmented with a novel variation of GCN where we perform classifi cation on two standard metaphor datasets using information from MWEs. This model which performs at the same level with state-of-the-art is, to the best of our knowledge, the first MWE-aware metaphor identifi cation system paving the way for further experimentation on the interaction of different types of fi gurative language. ; Research Group in Computational Linguistics.
|
|
Keyword:
deep learning; eye tracking; graph convolutional networks; idioms; irony detection; metaphor processing; multiword expressions; neural networks; sarcasm
|
|
URL: http://hdl.handle.net/2436/623843
|
|
BASE
|
|
Hide details
|
|
9 |
What matters more: the size of the corpora or their quality? The case of automatic translation of multiword expressions using comparable corpora.
|
|
|
|
BASE
|
|
Show details
|
|
11 |
RGCL at SemEval-2020 task 6: Neural approaches to definition extraction
|
|
|
|
In: 717 ; 723 (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Automated text simplification as a preprocessing step for machine translation into an under-resourced language
|
|
|
|
In: Štajner, Sanja orcid:0000-0002-7780-7035 and Popović, Maja orcid:0000-0001-8234-8745 (2019) Automated text simplification as a preprocessing step for machine translation into an under-resourced language. In: Recent Advances in Natural Language Processing (RANLP 2019), 2-4 Sept 2019, Varna, Bulgaria. (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Are ambiguous conjunctions problematic for machine translation?
|
|
|
|
In: Popović, Maja orcid:0000-0001-8234-8745 and Castilho, Sheila orcid:0000-0002-8416-6555 (2019) Are ambiguous conjunctions problematic for machine translation? In: Recent Advances in Natural Language Processing (RANLP 2019), 2 - 4 Sept 2019, Varna, Bulgaria. (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Natural Language Generation
|
|
|
|
In: Handbook of Computational Linguistics (2nd edition) ; https://hal.archives-ouvertes.fr/hal-02079245 ; Mitkov, Ruslan. Handbook of Computational Linguistics (2nd edition), In press (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Summary Refinement through Denoising
|
|
|
|
In: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019) (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Large-Scale Hierarchical Alignment for Data-driven Text Rewriting
|
|
|
|
In: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019) (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Do Online Resources Give Satisfactory Answers to Questions about Meaning and Phraseology?
|
|
|
|
BASE
|
|
Show details
|
|
19 |
RGCL at IDAT: deep learning models for irony detection in Arabic language
|
|
|
|
In: 2517 ; 416 ; 425 (2019)
|
|
BASE
|
|
Show details
|
|
20 |
Bridging the gap: attending to discontinuity in identification of multiword expressions
|
|
|
|
BASE
|
|
Show details
|
|
|
|