81 |
Parsing Arabic using treebank-based LFG resources
|
|
|
|
In: Tounsi, Lamia, Attia, Mohammed and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Parsing Arabic using treebank-based LFG resources. In: Lexical Functional Grammar 2009, 13-16 July 2009, Cambridge, UK. (2009)
|
|
BASE
|
|
Show details
|
|
82 |
Experiments on domain adaptation for English-Hindi SMT
|
|
|
|
In: Haque, Rejwanul orcid:0000-0003-1680-0099 , Naskar, Sudip Kumar, van Genabith, Josef orcid:0000-0003-1322-7944 and Way, Andy orcid:0000-0001-5736-5930 (2009) Experiments on domain adaptation for English-Hindi SMT. In: PACLIC 23 - the 23rd Pacific Asia Conference on Language, Information and Computation, 3-5 December 2009, Hong Kong. (2009)
|
|
BASE
|
|
Show details
|
|
83 |
Dependency parsing resources for French: Converting acquired lexical functional grammar F-Structure annotations and parsing F-Structures directly
|
|
|
|
In: Schluter, Natalie and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Dependency parsing resources for French: Converting acquired lexical functional grammar F-Structure annotations and parsing F-Structures directly. In: Nodalida 2009 Conference, 14 - 16 May 2009, Odense, Denmark. (2009)
|
|
BASE
|
|
Show details
|
|
84 |
Treebank-based acquisition of Chinese LFG resources for parsing and generation
|
|
Guo, Yuqing. - : Dublin City University. School of Computing, 2009
|
|
In: Guo, Yuqing (2009) Treebank-based acquisition of Chinese LFG resources for parsing and generation. PhD thesis, Dublin City University. (2009)
|
|
BASE
|
|
Show details
|
|
85 |
F-structure transfer-based statistical machine translation
|
|
|
|
In: Graham, Yvette, van Genabith, Josef orcid:0000-0003-1322-7944 and Bryl, Anton (2009) F-structure transfer-based statistical machine translation. In: Lexical Functional Grammar 2009, 13-16 July 2009, Cambridge, UK. (2009)
|
|
BASE
|
|
Show details
|
|
86 |
Automatic acquisition of LFG resources for German - as good as it gets
|
|
|
|
In: Rehbein, Ines and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Automatic acquisition of LFG resources for German - as good as it gets. In: Lexical Functional Grammar 2009, 13-16 July 2009, Cambridge, UK. (2009)
|
|
BASE
|
|
Show details
|
|
87 |
Part-of-speech tagging and partial parsing for Irish using finite-state transducers and constraint grammar
|
|
|
|
In: Uí Dhonnchadha, Elaine (2009) Part-of-speech tagging and partial parsing for Irish using finite-state transducers and constraint grammar. PhD thesis, Dublin City University. (2009)
|
|
Abstract:
In this thesis, we present the development and evaluation of a suite of annotation tools for unrestricted Irish text, which go from tokenization, morphological analysis, part-of-speech tagging, right through to partial parsing. In order to develop such tools, a large body of texts is required for testing purposes. We, therefore, begin by describing our involvement in the creation of a 30 million word corpus of Irish texts (New Corpus for Ireland). From this corpus, we randomly extracted 3,000 sentences which we annotated and manually corrected in order to create a Gold Standard Corpus for evaluation purposes. We then present the annotation tools. Firstly, we describe scaling a proof-of-concept implementation of finite-state tokenization and morphological analysis based on Xerox Finite State Tools (Uí Dhonnchadha, 2002, p146), to unrestricted text. After semi-automatic population of the finite-state morphology (FSM) lexical resources, the morphological analyser contains a lexicon of 30K lemmas, which together with a set of morphological guessers assign at least one morphological analysis to all tokens in unrestricted texts. Following this, we describe our POS tagger for Irish, implemented using Constraint Grammar Disambiguation Rules, and vislcg2 software. The POS tagger currently achieves an f-score of 95% on development data and 94.35% on unseen test data. This tagger has been used to tag the 30 million word corpus of Irish. Finally, we present our implementation of partial parsing, which is a combination of dependency analysis overlaid with finite-state chunking. As this is the first attempt at implementing a partial parser for Irish, (to our knowledge), there were no guidelines or precedents available. The dependency analysis uses Constraint Grammar Dependency Mapping Rules, and the chunker is implemented using regular expressions and Xerox Finite-State Tools. The dependency analysis currently achieves an f-score of 93.60% on development data and 94.28% on unseen test data. The chunker achieves an f-score of 97.20% on development data and 93.50% on unseen test data.
|
|
Keyword:
Computational linguistics; constraint grammar; finite-state; Irish; morphology; partial parsing; POS tagging
|
|
URL: http://doras.dcu.ie/2349/
|
|
BASE
|
|
Hide details
|
|
88 |
Automatic treebank-based acquisition of Arabic LFG dependency structures
|
|
|
|
In: Tounsi, Lamia, Attia, Mohammed and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Automatic treebank-based acquisition of Arabic LFG dependency structures. In: EACL 2009 Workshop on Computational Approaches to Semitic Languages, 31 March 2009, Athens, Greece. (2009)
|
|
BASE
|
|
Show details
|
|
89 |
Treebank-based grammar acquisition for German
|
|
Rehbein, Ines. - : Dublin City University. National Centre for Language Technology (NCLT), 2009. : Dublin City University. School of Computing, 2009
|
|
In: Rehbein, Ines (2009) Treebank-based grammar acquisition for German. PhD thesis, Dublin City University. (2009)
|
|
BASE
|
|
Show details
|
|
90 |
Judging grammaticality: experiments in sentence classification
|
|
|
|
In: Wagner, Joachim orcid:0000-0002-8290-3849 , Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Judging grammaticality: experiments in sentence classification. CALICO Journal, 26 (3). pp. 474-490. ISSN 0742-7778 (2009)
|
|
BASE
|
|
Show details
|
|
91 |
A hybrid filtering approach for question answering
|
|
|
|
In: Adafre, Sisay Fissaha and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) A hybrid filtering approach for question answering. In: LFG-09 - 14th International Lexical Functional Grammar Conference, 13-16 July 2009, Cambridge, UK. (2009)
|
|
BASE
|
|
Show details
|
|
92 |
Towards a machine-learning architecture for lexical functional grammar parsing
|
|
|
|
In: Chrupała, Grzegorz (2008) Towards a machine-learning architecture for lexical functional grammar parsing. PhD thesis, Dublin City University. (2008)
|
|
BASE
|
|
Show details
|
|
93 |
Adapting a WSJ-trained parser to grammatically noisy text
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Adapting a WSJ-trained parser to grammatically noisy text. In: ACL-08:HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 15-20 June 2008, Columbus, USA. (2008)
|
|
BASE
|
|
Show details
|
|
94 |
Treebank-based acquisition of LFG parsing resources for French
|
|
|
|
In: Schluter, Natalie and van Genabith, Josef (2008) Treebank-based acquisition of LFG parsing resources for French. In: the Sixth International Language Resources and Evaluation Conference (LREC'08), May 28-30, 2008, Marrakech, Morocco. (2008)
|
|
BASE
|
|
Show details
|
|
95 |
Accurate and robust LFG-based generation for Chinese
|
|
|
|
In: Guo, Yuqing, Wang, Haifeng and van Genabith, Josef (2008) Accurate and robust LFG-based generation for Chinese. In: INLG 08 - 5th International Natural Language Generation Conference, 12-14 June 2008, Salt Fork, Ohio, USA. (2008)
|
|
BASE
|
|
Show details
|
|
96 |
Packed rules for automatic transfer-rule induction
|
|
|
|
In: Graham, Yvette and van Genabith, Josef (2008) Packed rules for automatic transfer-rule induction. In: the European Association of Machine Translation Conference 2008, Hamburg, Germany. (2008)
|
|
BASE
|
|
Show details
|
|
97 |
Exploiting multi-word units in statistical parsing and generation
|
|
|
|
In: Cafferkey, Conor (2008) Exploiting multi-word units in statistical parsing and generation. Master of Science thesis, Dublin City University. (2008)
|
|
BASE
|
|
Show details
|
|
98 |
Wide-coverage deep statistical parsing using automatic dependency structure annotation
|
|
|
|
In: Cahill, Aoife orcid:0000-0002-3519-7726 , Burke, Michael, O'Donovan, Ruth, Riezler, Stefan, van Genabith, Josef orcid:0000-0003-1322-7944 and Way, Andy orcid:0000-0001-5736-5930 (2008) Wide-coverage deep statistical parsing using automatic dependency structure annotation. Computational Linguistics, 34 (1). pp. 81-124. (2008)
|
|
BASE
|
|
Show details
|
|
99 |
A novel dependency-based evaluation metric for machine translation
|
|
|
|
In: Owczarzak, Karolina (2008) A novel dependency-based evaluation metric for machine translation. PhD thesis, Dublin City University. (2008)
|
|
BASE
|
|
Show details
|
|
100 |
Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2008) Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics. In: LREC 2008 - Sixth International Conference on Language Resources and Evaluation, 28-30 May 2008, Marrakech, Morocco. (2008)
|
|
BASE
|
|
Show details
|
|
|
|