41 |
Packed rules for automatic transfer-rule induction
|
|
|
|
In: Graham, Yvette and van Genabith, Josef (2008) Packed rules for automatic transfer-rule induction. In: the European Association of Machine Translation Conference 2008, Hamburg, Germany. (2008)
|
|
BASE
|
|
Show details
|
|
42 |
Wide-coverage deep statistical parsing using automatic dependency structure annotation
|
|
|
|
In: Cahill, Aoife orcid:0000-0002-3519-7726 , Burke, Michael, O'Donovan, Ruth, Riezler, Stefan, van Genabith, Josef orcid:0000-0003-1322-7944 and Way, Andy orcid:0000-0001-5736-5930 (2008) Wide-coverage deep statistical parsing using automatic dependency structure annotation. Computational Linguistics, 34 (1). pp. 81-124. (2008)
|
|
BASE
|
|
Show details
|
|
43 |
A novel dependency-based evaluation metric for machine translation
|
|
|
|
In: Owczarzak, Karolina (2008) A novel dependency-based evaluation metric for machine translation. PhD thesis, Dublin City University. (2008)
|
|
Abstract:
Automatic evaluation measures such as BLEU (Papineni et al. (2002)) and NIST (Doddington (2002)) are indispensable in the development of Machine Translation (MT) systems, because they allow MT developers to conduct frequent, fast, and cost-effective evaluations of their evolving translation models. However, most of the automatic evaluation metrics rely on a comparison of word strings, measuring only the surface similarity of the candidate and reference translations, and will penalize any divergence. In effect,a candidate translation expressing the source meaning accurately and fluently will be given a low score if the lexical and syntactic choices it contains, even though perfectly legitimate, are not present in at least one of the references. Necessarily, this score would differ from a much more favourable human judgment that such a translation would receive. This thesis presents a method that automatically evaluates the quality of translation based on the labelled dependency structure of the sentence, rather than on its surface form. Dependencies abstract away from the some of the particulars of the surface string realization and provide a more "normalized" representation of (some) syntactic variants of a given sentence. The translation and reference files are analyzed by a treebank-based, probabilistic Lexical-Functional Grammar (LFG) parser (Cahill et al. (2004)) for English, which produces a set of dependency triples for each input. The translation set is compared to the reference set, and the number of matches is calculated, giving the precision, recall, and f-score for that particular translation. The use of WordNet synonyms and partial matching during the evaluation process allows for adequate treatment of lexical variation, while employing a number of best parses helps neutralize the noise introduced during the parsing stage. The dependency-based method is compared against a number of other popular MT evaluation metrics, including BLEU, NIST, GTM (Turian et al. (2003)), TER (Snover et al. (2006)), and METEOR (Banerjee and Lavie (2005)), in terms of segment- and system-level correlations with human judgments of fluency and adequacy. We also examine whether it shows bias towards statistical MT models. The comparison of the dependency-based method with other evaluation metrics is then extended to languages other than English: French, German, Spanish, and Japanese, where we apply our method to dependencies generated by Microsoft's NLPWin analyzer (Corston-Oliver and Dolan (1999); Heidorn (2000)) as well as, in the case of the Spanish data, those produced by the treebank-based, probabilistic LFG parser of Chrupa la and van Genabith (2006a,b).
|
|
Keyword:
Machine translating
|
|
URL: http://doras.dcu.ie/484/
|
|
BASE
|
|
Hide details
|
|
44 |
Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2008) Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics. In: LREC 2008 - Sixth International Conference on Language Resources and Evaluation, 28-30 May 2008, Marrakech, Morocco. (2008)
|
|
BASE
|
|
Show details
|
|
45 |
Learning morphology with Morfette
|
|
|
|
In: Chrupała, Grzegorz, Dinu, Georgiana and van Genabith, Josef (2008) Learning morphology with Morfette. In: LREC 2008 - Sixth International Conference on Language Resources and Evaluation, 28-30 May 2008, Marrakech, Morocco. (2008)
|
|
BASE
|
|
Show details
|
|
46 |
Dependency-based n-gram models for general purpose sentence realisation
|
|
|
|
In: Guo, Yuqing, van Genabith, Josef and Wang, Haifeng (2008) Dependency-based n-gram models for general purpose sentence realisation. In: COLING 2008 - 22nd International Conference on Computational Linguistics, 18-22 August 2008, Manchester, UK. (2008)
|
|
BASE
|
|
Show details
|
|
47 |
Parser-based retraining for domain adaptation of probabilistic generators
|
|
|
|
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Parser-based retraining for domain adaptation of probabilistic generators. In: INLG 08 - 5th International Natural Language Generation Conference, 12-14 June 2008, Salt Fork, Ohio, USA. (2008)
|
|
BASE
|
|
Show details
|
|
48 |
Recovering non-local dependencies for Chinese
|
|
|
|
In: Guo, Yuqing, Wang, Haifeng and van Genabith, Josef (2007) Recovering non-local dependencies for Chinese. In: EMNLP-CoNLL 2007 - Joint Meeting of the Conference on Empirical Methods in Natural Language Processing and the Conference on Computational Natural Language Learning, 28-30 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
49 |
Preparing, restructuring, and augmenting a French treebank: lexicalised parsers or coherent treebanks?
|
|
|
|
In: Schluter, Natalie and van Genabith, Josef (2007) Preparing, restructuring, and augmenting a French treebank: lexicalised parsers or coherent treebanks? In: PACLING 2007 - 10th Conference of the Pacific Association for Computational Linguistics, 19-21 September , 2007, Melbourne, Australia. (2007)
|
|
BASE
|
|
Show details
|
|
50 |
Treebank-based acquisition of LFG resources for Chinese
|
|
|
|
In: Guo, Yuqing, van Genabith, Josef and Wang, Haifeng (2007) Treebank-based acquisition of LFG resources for Chinese. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
|
|
BASE
|
|
Show details
|
|
51 |
C-structures and f-structures for the British national corpus
|
|
|
|
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2007) C-structures and f-structures for the British national corpus. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
|
|
BASE
|
|
Show details
|
|
52 |
TransBooster:black box optimisation of machine translation systems
|
|
|
|
In: Mellebeek, Bart (2007) TransBooster:black box optimisation of machine translation systems. PhD thesis, Dublin City University. (2007)
|
|
BASE
|
|
Show details
|
|
53 |
Dependency-based automatic evaluation for machine translation
|
|
|
|
In: Owczarzak, Karolina, van Genabith, Josef and Way, Andy orcid:0000-0001-5736-5930 (2007) Dependency-based automatic evaluation for machine translation. In: HLT-NAACL 2007 - Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, New York, USA. (2007)
|
|
BASE
|
|
Show details
|
|
54 |
Labelled dependencies in machine translation evaluation
|
|
|
|
In: Owczarzak, Karolina, van Genabith, Josef and Way, Andy orcid:0000-0001-5736-5930 (2007) Labelled dependencies in machine translation evaluation. In: ACL 2007 Workshop on Statistical Machine Translation, 23 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
55 |
Using very large corpora to detect raising and control verbs
|
|
|
|
In: Chrupała, Grzegorz and van Genabith, Josef (2007) Using very large corpora to detect raising and control verbs. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
|
|
BASE
|
|
Show details
|
|
56 |
Using F-structures in machine translation evaluation
|
|
|
|
In: Owczarzak, Karolina, van Genabith, Josef, Graham, Yvette and Way, Andy orcid:0000-0001-5736-5930 (2007) Using F-structures in machine translation evaluation. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
|
|
BASE
|
|
Show details
|
|
57 |
Exploiting multi-word units in history-based probabilistic generation
|
|
|
|
In: Hogan, Deirdre, Cafferkey, Conor, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef (2007) Exploiting multi-word units in history-based probabilistic generation. In: EMNLP-CoNLL 2007 - Joint Meeting of the Conference on Empirical Methods in Natural Language Processing and the Conference on Computational Natural Language Learning, 28-30 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
58 |
Adapting WSJ-trained parsers to the British national corpus using in-domain self-training
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé and van Genabith, Josef (2007) Adapting WSJ-trained parsers to the British national corpus using in-domain self-training. In: IWPT 2007 - 10th International Conference of Parsing Technology, 23-24 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
59 |
A comparative evaluation of deep and shallow approaches to the automatic detection of common grammatical errors
|
|
|
|
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2007) A comparative evaluation of deep and shallow approaches to the automatic detection of common grammatical errors. In: EMNLP-CoNLL 2007 - Joint Meeting of the Conference on Empirical Methods in Natural Language Processing and the Conference on Computational Natural Language Learning, 28-30 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
60 |
Treebank annotation schemes and parser evaluation for German
|
|
|
|
In: Rehbein, Ines and van Genabith, Josef (2007) Treebank annotation schemes and parser evaluation for German. In: EMNLP-CoNLL 2007 - Joint Meeting of the Conference on Empirical Methods in Natural Language Processing and the Conference on Computational Natural Language Learning, 28-30 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
|
|