1 |
An investigation into multi-word expressions in machine translation
|
|
Han, Lifeng. - : Dublin City University. School of Computing, 2022. : Dublin City University. ADAPT, 2022
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 (2022) An investigation into multi-word expressions in machine translation. PhD thesis, Dublin City University. (2022)
|
|
Abstract:
Multi-word Expressions (MWEs) present challenges in natural language processing and computational linguistics due to their popular usage, richness in variety, idiomaticity, and non-decompositionality, which are present in the text content in which they are used. This is a typical level of expectation in the machine translation (MT) field where we require algorithms to perform a translation from one human language to another automatically while requiring high-quality output including features such as adequacy, fluency, and keeping the same or making creative and correct style decisions in that output. In this thesis, we carry out an extensive investigation into MWEs in Neural MT. Firstly, we carry out a review of relevant literature which includes experimental work on re-examining state-of-the-art models that combine knowledge of MWEs into MT systems, but with new language pairs setting to see what gaps might exist in the published literature. Secondly, we propose our new models on how to address MWE translations. This includes a design where we treat MWEs as low-frequency words and phrases translation issues, by integrating language-specific features such as strokes and radicals representation of Chinese characters into the learning model, expecting that this will facilitate improved accuracy. Thirdly, to properly examine different MT models' performances in the context of MWEs, we need to carry out a new evaluation methodology, and in light of this, we create a multilingual parallel corpus with MWE annotations (AlphaMWE). During the creation of this corpus, we classify the MT issues on MWE-related content into several categories with the expectation that this will help future MT researchers to focus on one or some of these in order to achieve a new state of the art in MT performance, ultimately moving towards human parity. Finally, we propose a new methodology for human in the loop MT evaluation with MWE considerations (HiLMeMe).
|
|
Keyword:
Machine translating
|
|
URL: http://doras.dcu.ie/26559/
|
|
BASE
|
|
Hide details
|
|
2 |
The L2L system for second language learning using visualised zoom calls among students
|
|
|
|
In: Dey-Plissonneau, Aparajita, Lee, Hyowon orcid:0000-0003-4395-7702 , Pradier, Vincent orcid:0000-0002-7050-6408 , Scriney, Michael orcid:0000-0001-6813-2630 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2021) The L2L system for second language learning using visualised zoom calls among students. In: 16th European Conference on Technology-Enhanced Learning EC-TEL 2021, 20-24 Sept 2021, Bozen-Bolzano, Italy (Online). ISBN 978-3-030-86435-4 (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Utilising visual attention cues for vehicle detection and tracking
|
|
|
|
In: Hu, Feiyan orcid:0000-0001-7451-6438 , Gurram Munirathnam, Venkatesh orcid:0000-0002-4393-9267 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Little, Suzanne orcid:0000-0003-3281-3471 (2021) Utilising visual attention cues for vehicle detection and tracking. In: 25th International Conference on Pattern Recognition (ICPR2020), 10-15 Jan 2021, Milan, Italy (Online). (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Facilitating reflection in teletandem through automatically generated conversation metrics and playback video
|
|
|
|
In: Dey-Plissonneau, Aparajita, Lee, Hyowon orcid:0000-0001-7628-1441 , Scriney, Michael orcid:0000-0001-6813-2630 , Smeaton, Alan F. orcid:0000-0001-6339-6194 , Pradier, Vincent orcid:0000-0002-7050-6408 and Riaz, Hamza (2021) Facilitating reflection in teletandem through automatically generated conversation metrics and playback video. In: EuroCALL: Computer Aided Language Learning, August 2021, Paris, France. (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Attention based video summaries of live online zoom classes
|
|
|
|
In: Lee, Hyowon orcid:0000-0003-4395-7702 , Liu, Mingming orcid:0000-0002-8988-2104 , Riaz, Hamza, Rajasekaran, Navaneethan, Scriney, Michael orcid:0000-0001-6813-2630 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2021) Attention based video summaries of live online zoom classes. In: AAAI-2021 Workshop on AI Education: "Imagining Post-COVID Education with AI" (TIPCE-2021)., 9 Feb 2021, Online (Vancouver, Canada). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Chinese character decomposition for neural MT with multi-word expressions
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Bolzoni, Paolo (2021) Chinese character decomposition for neural MT with multi-word expressions. In: 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021), 31 May- 2 June 2021, Reykjavik, Iceland (Online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Translation quality assessment: a brief survey on manual and automatic methods
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2021) Translation quality assessment: a brief survey on manual and automatic methods. In: MoTra21: Workshop on Modelling Translation: Translatology in the Digital Age, 31 May- 2 Jun 2021, Rejkjavik, Iceland (Online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Translation Quality Assessment: A Brief Survey on Manual and Automatic Methods ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
AlphaMWE: construction of multilingual parallel corpora with MWE annotations
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2020) AlphaMWE: construction of multilingual parallel corpora with MWE annotations. In: Joint Workshop on Multiword Expressions and Electronic Lexicons (MWE-LEX 2020), 13 Dec 2020, Barcelona, Spain (Online). (2020)
|
|
BASE
|
|
Show details
|
|
10 |
MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2020) MultiMWE: building a multi-lingual multi-word expression (MWE) parallel corpora. In: 12th International Conference on Language Resources and Evaluation (LREC), 11-16 May, 2020, Marseille, France. (Virtual). (2020)
|
|
BASE
|
|
Show details
|
|
11 |
Generative forms of multimedia content. (Opening keynote talk)
|
|
|
|
In: Smeaton, Alan F. orcid:0000-0003-1028-8389 (2020) Generative forms of multimedia content. (Opening keynote talk). In: IEEE International Conference on Multimedia and Expo - ICME 2020, 6-9 July 2020, London, UK (virtual). (2020)
|
|
BASE
|
|
Show details
|
|
12 |
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Recognising Irish sign language using electromyography
|
|
|
|
In: Galea, Laura Christina and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2019) Recognising Irish sign language using electromyography. In: 17th IEEE Content Based Multimedia Indexing (CBMI) Conference, 4-6 Sept 2019, Dublin, Ireland. ISBN 978-1-7281-4673-7 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Neural machine translation for multimodal interaction
|
|
Dutta Chowdhury, Koel. - : Dublin City University. School of Computing, 2019. : Dublin City University. ADAPT, 2019
|
|
In: Dutta Chowdhury, Koel (2019) Neural machine translation for multimodal interaction. Master of Science thesis, Dublin City University. (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Rethinking summarization and storytelling for modern social multimedia
|
|
|
|
In: Rudinac, Stevan, Chua, Tat-Seng, Diaz-Ferreyra, Nicolas, Friedland, Gerald, Gornostaja, Tatjana, Huet, Benoit, Kaptein, Rianne, Lindén, Krister, Moens, Marie-Francine, Peltonen, Jaakko, Redi, Miriam, Schedl, Markus, Shamma, David A, Smeaton, Alan F. orcid:0000-0003-1028-8389 and Xie, Lexing (2018) Rethinking summarization and storytelling for modern social multimedia. In: The 24th International Conference on Multimedia Modeling (MMM2018), 5-7 Feb, 2018, Bangkok, Thailand. ISBN 978-3-319-73599-3 (2018)
|
|
BASE
|
|
Show details
|
|
17 |
Evaluation of automatic video captioning using direct assessment
|
|
|
|
In: Graham, Yvette, Awad, George M. and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2018) Evaluation of automatic video captioning using direct assessment. PLoS One . ISSN 1932-6203 (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Dublin City University participation in the VTT track at TRECVid 2017
|
|
|
|
In: Afli, Haithem orcid:0000-0002-7449-4707 , Hu, Feiyan orcid:0000-0001-7451-6438 , Du, Jinhua orcid:0000-0002-3267-4881 , Cosgrove, Daniel, McGuinness, Kevin orcid:0000-0003-1336-6477 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Arazo Sánchez, Eric, Zhou, Jiang orcid:0000-0002-3067-8512 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2017) Dublin City University participation in the VTT track at TRECVid 2017. In: TRECVid workshop, 13-15 Nov 2017, Gaithersburg, Md., USA. (2017)
|
|
BASE
|
|
Show details
|
|
19 |
Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016
|
|
|
|
In: Marsden, Mark, Mohedano, Eva, McGuinness, Kevin orcid:0000-0003-1336-6477 , Calafell, Andrea, Giró-i-Nieto, Xavier orcid:0000-0002-9935-5332 , O'Connor, Noel E. orcid:0000-0002-4033-9135 , Zhou, Jiang orcid:0000-0002-3067-8512 , Azevedo, Lucas, Daudert, Tobias, Davis, Brian, Hurlimann, Manuela, Afli, Haithem orcid:0000-0002-7449-4707 , Du, Jinhua, Ganguly, Debasis orcid:0000-0003-0050-7138 , Li, Wei B. orcid:0000-0001-7347-3501 , Way, Andy orcid:0000-0001-5736-5930 and Smeaton, Alan F. orcid:0000-0003-1028-8389 (2016) Dublin City University and partners’ participation in the INS and VTT tracks at TRECVid 2016. In: TRECVid Conference, 14-16 Nov 2016, Gaithersburg, Md., USA. (2016)
|
|
BASE
|
|
Show details
|
|
20 |
Dublin City University and Partners' participation in the INS and VTT Tracks at TRECVid 2016
|
|
|
|
BASE
|
|
Show details
|
|
|
|