1 |
Determining Tone of a Body of Text
|
|
|
|
In: Senior Projects Spring 2020 (2020)
|
|
BASE
|
|
Show details
|
|
3 |
Modelling source- and target-language syntactic Information as conditional context in interactive neural machine translation
|
|
|
|
In: Gupta, Kamal Kumar, Haque, Rejwanul orcid:0000-0003-1680-0099 , Ekbal, Asif, Bhattacharyya, Pushpak and Way, Andy orcid:0000-0001-5736-5930 (2020) Modelling source- and target-language syntactic Information as conditional context in interactive neural machine translation. In: Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2-6 Nov 2020, Lisboa, Portugal. (2020)
|
|
BASE
|
|
Show details
|
|
4 |
AlphaMWE: construction of multilingual parallel corpora with MWE annotations
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2020) AlphaMWE: construction of multilingual parallel corpora with MWE annotations. In: Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 13 Dec 2020, Barcelona, Spain (Online). (2020)
|
|
Abstract:
In this work, we present the construction of multilingual parallel corpora with annotation of multiword expressions (MWEs). MWEs include verbal MWEs (vMWEs) defined in the PARSEME shared task that have a verb as the head of the studied terms. The annotated vMWEs are also bilingually and multilingually aligned manually. The languages covered include English, Chinese, Polish, and German. Our original English corpus is taken from the PARSEME shared task in 2018. We performed machine translation of this source corpus followed by human post editing and annotation of target MWEs. Strict quality control was applied for error limitation, i.e., each MT output sentence received first manual post editing and annotation plus second manual quality rechecking. One of our findings during corpora preparation is that accurate translation of MWEs presents challenges to MT systems. To facilitate further MT research, we present a categorisation of the error types encountered by MT systems in performing MWE related translation. To acquire a broader view of MT issues, we selected four popular state-of-the-art MT models for comparisons namely: Microsoft Bing Translator, GoogleMT, Baidu Fanyi and DeepL MT. Because of the noise removal, translation post editing and MWE annotation by human professionals, we believe our AlphaMWE dataset will be an asset for cross-lingual and multilingual research, such as MT and information extraction. Our multilingual corpora are available as open access at github.com/poethan/AlphaMWE.
|
|
Keyword:
Artificial intelligence; Computational linguistics; Language; Machine translating; Multi-word Expressions; Multilingual corpora
|
|
URL: http://doras.dcu.ie/25153/
|
|
BASE
|
|
Hide details
|
|
5 |
DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) DIACR-Ita @ EVALITA2020: overview of the EVALITA2020 DiachronicLexical semantics (DIACR-Ita) task. In: Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian, 17 Dec 2020, Online. (2020)
|
|
BASE
|
|
Show details
|
|
6 |
On the differences between human translations
|
|
|
|
In: Popović, Maja orcid:0000-0001-8234-8745 (2020) On the differences between human translations. In: 22nd Annual Conference of the European Association for Machine Translation (EAMT 2020), 3 -5 Nov 2020, Lisbon, Portugal (Online). (2020)
|
|
BASE
|
|
Show details
|
|
7 |
A diachronic Italian corpus based on “L’Unit`a”
|
|
|
|
In: Basile, Pierpaolo, Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2020) A diachronic Italian corpus based on “L’Unit`a”. In: Seventh Italian Conference on Computational Linguistics, 1-3 Mar 2021, Bologna (Online). (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Neural machine translation between similar south-Slavic languages
|
|
|
|
In: Popović, Maja orcid:0000-0001-8234-8745 and Poncelas, Alberto orcid:0000-0002-5089-1687 (2020) Neural machine translation between similar south-Slavic languages. In: 2020 Fifth Conference on Machine Translation (WMT20), 19-20 Nov 2020, Dominican Republic (Online). (2020)
|
|
BASE
|
|
Show details
|
|
9 |
Annotating verbal MWEs in Irish for the PARSEME Shared Task 1.2
|
|
|
|
In: Walsh, Abigail, Lynn, Teresa and Foster, Jennifer orcid:0000-0002-7789-4853 (2020) Annotating verbal MWEs in Irish for the PARSEME Shared Task 1.2. In: Joint Workshop on Multiword Expressions and Electronic Lexicons, 13 Dec 2020, Barcelona, Spain (Online). (2020)
|
|
BASE
|
|
Show details
|
|
10 |
GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering
|
|
|
|
In: Cassotti, Pierluigi, Caputo, Annalina orcid:0000-0002-7144-8545 , Polignano, Marco orcid:0000-0002-3939-0136 and Basile, Pierpaolo orcid:0000-0002-0545-1105 (2020) GM-CTSC at SemEval-2020 Task 1: Gaussian mixtures cross temporal similarity clustering. In: Fourteenth Workshop on Semantic Evaluation, Dec 2020, Barcelona (Online). (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Syntax-informed interactive neural machine translation
|
|
|
|
In: Gupta, Kamal Kumar, Haque, Rejwanul orcid:0000-0003-1680-0099 , Ekbal, Asif, Bhattacharyya, Pushpak and Way, Andy orcid:0000-0001-5736-5930 (2020) Syntax-informed interactive neural machine translation. In: The International Joint Conference on Neural Networks (IJCNN), 19-24 July 2020, Glasgow, UK (Online). (2020)
|
|
BASE
|
|
Show details
|
|
12 |
Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages
|
|
|
|
In: Chakravarthi, Bharathi Raja orcid:0000-0002-4575-7934 , Rajasekaran, Navaneethan, Arcan, Mihael orcid:0000-0002-3116-621X , McGuinness, Kevin orcid:0000-0003-1336-6477 , O'Connor, Noel E. orcid:0000-0002-4033-9135 and McCrae, John P. orcid:0000-0002-7227-1331 (2020) Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages. In: 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 13 Dec 2020, Barcelona, Spain (Online). (2020)
|
|
BASE
|
|
Show details
|
|
13 |
MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2020) MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora. In: 12th International Conference on Language Resources and Evaluation (LREC), 11-16 May, 2020, Marseille, France. (Virtual). (2020)
|
|
BASE
|
|
Show details
|
|
14 |
MultiMWE: building a multi-lingual multi-word expression (MWE) Pparallel corpora
|
|
|
|
In: Han, Lifeng, Gareth, Jones orcid:0000-0003-2923-8365 and Alan, Smeaton orcid:0000-0003-1028-8389 (2020) MultiMWE: building a multi-lingual multi-word expression (MWE) Pparallel corpora. In: International Conference on Language Resources and Evaluation (LREC), 11-16 May, 2020, Marseille, France. (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Rapid development of competitive translation engines for access to multilingual COVID-19 information
|
|
|
|
In: Way, Andy orcid:0000-0001-5736-5930 , Haque, Rejwanul orcid:0000-0003-1680-0099 , Xie, Guodong, Gaspari, Federico orcid:0000-0003-3808-8418 , Popović, Maja orcid:0000-0001-8234-8745 and Poncelas, Alberto orcid:0000-0002-5089-1687 (2020) Rapid development of competitive translation engines for access to multilingual COVID-19 information. Informatics . ISSN 2227-9709 (2020)
|
|
BASE
|
|
Show details
|
|
16 |
Improving document-level sentiment analysis with user and product context
|
|
|
|
In: Lyu, Chenyang, Foster, Jennifer orcid:0000-0002-7789-4853 and Graham, Yvette (2020) Improving document-level sentiment analysis with user and product context. In: Proceedings of the 28th International Conference on Computational Linguistics, 8-13 Dec 20, Barcelona, Spain (Online). (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Machine translation of user-generated content
|
|
Lohar, Pintu. - : Dublin City University. School of Computing, 2020. : Dublin City University. ADAPT, 2020
|
|
In: Lohar, Pintu (2020) Machine translation of user-generated content. PhD thesis, Dublin City University. (2020)
|
|
BASE
|
|
Show details
|
|
18 |
Explainable sentiment analysis application for social media crisis management in retail
|
|
|
|
In: Cirqueira, Douglas orcid:0000-0002-1283-0453 , Almeida, Fernando, Cakir, Gültekin orcid:0000-0001-9715-7167 , Jacob, Antonio orcid:0000-0002-9415-7265 , Lobato, Fabio orcid:0000-0002-6282-0368 , Bezbradica, Marija orcid:0000-0001-9366-5113 and Helfert, Markus orcid:0000-0001-6546-6408 (2020) Explainable sentiment analysis application for social media crisis management in retail. In: 4th International Conference on Computer-Human Interaction Research and Applications - Volume 1: WUDESHI-DR, 5-6 Nov 2020, Budapest, Hungry (Online). ISBN 978-989-758-480-0 (2020)
|
|
BASE
|
|
Show details
|
|
19 |
The ARK platform: enabling risk management through semantic web technologies
|
|
|
|
In: Crotti Junior, Ademar orcid:0000-0003-1025-9262 , Basereh, Maryam, Abgaz, Yalemisew orcid:0000-0002-3887-5342 , Liang, Junli, Duda, Natalia, McDonald, Nick and Brennan, Rob orcid:0000-0001-8236-362X (2020) The ARK platform: enabling risk management through semantic web technologies. In: 11th International Conference on Biomedical Ontologies (ICBO 2020), 17 Sept 2020, Bolzano, Italy (Online). (2020)
|
|
BASE
|
|
Show details
|
|
20 |
How to make neural natural language generation as reliable as templates in task-oriented dialogue
|
|
|
|
In: Elder, Henry, O'Connor, Alexander orcid:0000-0003-0301-999X and Foster, Jennifer orcid:0000-0002-7789-4853 (2020) How to make neural natural language generation as reliable as templates in task-oriented dialogue. In: 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 16-20 Nov 2020, Online. (2020)
|
|
BASE
|
|
Show details
|
|
|
|