Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2021 (2)
  - 2020 (15)
  - 2019 (3)
  - 2018 (4)
  - 2017 (4)
  - 2016 (1)
  - 2015 (2)
  - 2014 (2)
  - 2013 (2)
  - 2010 (1)
  - more
- Medium
- Type
- BLLDB-Access:
  - free (106)
  - subject to license (1)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6

Hits 21 – 40 of 106

21	Neural Networks for Cross-lingual Negation Scope Detection ...
	Fancellu, Federico; Lopez, Adam; Webber, Bonnie. - : arXiv, 2018
	BASE
	Show details

22	Entity-based coherence in statistical machine translation: a modelling and evaluation perspective
	Wetzel, Dominikus Emanuel. - : The University of Edinburgh, 2018
	BASE
	Show details

23	Computational models for multilingual negation scope detection
	Fancellu, Federico. - : The University of Edinburgh, 2018
	BASE
	Show details

24	Explicit Discourse Connectives / Implicit Discourse Relations
	Webber, Bonnie; Rohde, Hannah; Dickinson, Anna...
	In: Proceedings of the Society for Computation in Linguistics (2018)
	BASE
	Show details

25	DiscoMT 2016 Shared Task on Cross-lingual Pronoun Prediction
	Guillou, Liane; Hardmeier, Christian; Nakov, Preslav. - : Uppsala University, 2017
	BASE
	Show details

26	2015-2016 CoNLL Shared Task
	Xue, Nianwen; Ng, Hwee Tou; Pradhan, Sameer. - : Linguistic Data Consortium, 2017. : https://www.ldc.upenn.edu, 2017
	BASE
	Show details

27	Universal Dependencies to Logical Forms with Negation Scope ...
	Fancellu, Federico; Reddy, Siva; Lopez, Adam. - : arXiv, 2017
	BASE
	Show details

28	2015-2016 CoNLL Shared Task ...
	Xue, Nianwen; Ng, Hwee Tou; Pradhan, Sameer. - : Linguistic Data Consortium, 2017
	BASE
	Show details

29	Incorporating pronoun function into statistical machine translation
	Guillou, Liane Kirsten. - : The University of Edinburgh, 2016
	Abstract: Pronouns are used frequently in language, and perform a range of functions. Some pronouns are used to express coreference, and others are not. Languages and genres differ in how and when they use pronouns and this poses a problem for Statistical Machine Translation (SMT) systems (Le Nagard and Koehn, 2010; Hardmeier and Federico, 2010; Novák, 2011; Guillou, 2012; Weiner, 2014; Hardmeier, 2014). Attention to date has focussed on coreferential (anaphoric) pronouns with NP antecedents, which when translated from English into a language with grammatical gender, must agree with the translation of the head of the antecedent. Despite growing attention to this problem, little progress has been made, and little attention has been given to other pronouns. The central claim of this thesis is that pronouns performing different functions in text should be handled differently by SMT systems and when evaluating pronoun translation. This motivates the introduction of a new framework to categorise pronouns according to their function: Anaphoric/cataphoric reference, event reference, extra-textual reference, pleonastic, addressee reference, speaker reference, generic reference, or other function. Labelling pronouns according to their function also helps to resolve instances of functional ambiguity arising from the same pronoun in the source language having multiple functions, each with different translation requirements in the target language. The categorisation framework is used in corpus annotation, corpus analysis, SMT system development and evaluation. I have directed the annotation and conducted analyses of a parallel corpus of English-German texts called ParCor (Guillou et al., 2014), in which pronouns are manually annotated according to their function. This provides a first step toward understanding the problems that SMT systems face when translating pronouns. In the thesis, I show how analysis of manual translation can prove useful in identifying and understanding systematic differences in pronoun use between two languages and can help inform the design of SMT systems. In particular, the analysis revealed that the German translations in ParCor contain more anaphoric and pleonastic pronouns than their English originals, reflecting differences in pronoun use. This raises a particular problem for the evaluation of pronoun translation. Automatic evaluation methods that rely on reference translations to assess pronoun translation, will not be able to provide an adequate evaluation when the reference translation departs from the original source-language text. I also show how analysis of the output of state-of-the-art SMT systems can reveal how well current systems perform in translating different types of pronouns and indicate where future efforts would be best directed. The analysis revealed that biases in the training data, for example arising from the use of “it” and “es” as both anaphoric and pleonastic pronouns in both English and German, is a problem that SMT systems must overcome. SMT systems also need to disambiguate the function of those pronouns with ambiguous surface forms so that each pronoun may be translated in an appropriate way. To demonstrate the value of this work, I have developed an automated post-editing system in which automated tools are used to construct ParCor-style annotations over the source-language pronouns. The annotations are then used to resolve functional ambiguity for the pronoun “it” with separate rules applied to the output of a baseline SMT system for anaphoric vs. non-anaphoric instances. The system was submitted to the DiscoMT 2015 shared task on pronoun translation for English-French. As with all other participating systems, the automatic post-editing system failed to beat a simple phrase-based baseline. A detailed analysis, including an oracle experiment in which manual annotation replaces the automated tools, was conducted to discover the causes of poor system performance. The analysis revealed that the design of the rules and their strict application to the SMT output are the biggest factors in the failure of the system. The lack of automatic evaluation metrics for pronoun translation is a limiting factor in SMT system development. To alleviate this problem, Christian Hardmeier and I have developed a testing regimen called PROTEST comprising (1) a hand-selected set of pronoun tokens categorised according to the different problems that SMT systems face and (2) an automated evaluation script. Pronoun translations can then be automatically compared against a reference translation, with mismatches referred for manual evaluation. The automatic evaluation was applied to the output of systems submitted to the DiscoMT 2015 shared task on pronoun translation. This again highlighted the weakness of the post-editing system, which performs poorly due to its focus on producing gendered pronoun translations, and its inability to distinguish between pleonastic and event reference pronouns.
	Keyword: discourse; pronouns; SMT; Statistical Machine Translation
	URL: http://hdl.handle.net/1842/20448
	BASE
	Hide details

30	Analysing ParCor and its Translations by State-of-the-art SMT Systems
	Guillou, Liane [Verfasser]; Webber, Bonnie [Verfasser]. - Aachen : Universitätsbibliothek der RWTH Aachen, 2015
	DNB Subject Category Language
	Show details

31	Attribution: a computational approach
	Pareti, Silvia. - : The University of Edinburgh, 2015
	BASE
	Show details

32	Structured and Unstructured Cache Models for SMT Domain Adaptation
	Louis, Annie P; Webber, Bonnie. - : Association for Computational Linguistics, 2014
	BASE
	Show details

33	Cross-lingual genre classification
	Petrenz, Philipp. - : The University of Edinburgh, 2014
	BASE
	Show details

34	Coherence relations and referential expectations
	Webber, Bonnie
	In: Theoretical linguistics. - Berlin [u.a.] : de Gruyter 39 (2013) 1, 123-128
	OLC Linguistik
	Show details

35	Evaluating a city exploration dialogue system combining question-answering and pedestrian navigation
	Lemon, Oliver; Janarthanam, Srini; Webber, Bonnie. - : The Association for Computational Linguistics (ACL), 2013. : Stroudsburg, PA, USA, 2013
	BASE
	Show details

36	Acquiring syntactic and semantic transformations in question answering
	Kaisser, Michael. - : The University of Edinburgh, 2010
	BASE
	Show details

37	Special issue on interactive question answering: introduction
	Webber, Bonnie; Webb, Nick
	In: Natural language engineering. - Cambridge : Cambridge University Press 15 (2009) 1, 1-8
	BLLDB
	OLC Linguistik
	Show details

38	Computational treatment of superlatives
	Scheible, Silke. - : The University of Edinburgh, 2009
	BASE
	Show details

39	Topic indexing and retrieval for open domain factoid question answering
	Ahn, Kisuh. - : The University of Edinburgh, 2009
	BASE
	Show details

40	Penn Discourse Treebank Version 2.0
	Prasad, Rashmi; Lee, Alan; Dinesh, Nikhil. - : Linguistic Data Consortium, 2008. : https://www.ldc.upenn.edu, 2008
	BASE
	Show details

Page: 1 2 3 4 5 6

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern