1 |
f-align: An open-source alignment tool for LFG f-structures
|
|
|
|
In: Bryl, Anton and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) f-align: An open-source alignment tool for LFG f-structures. In: AMTA, 31 Oct - 4th Nov 2010, Denver, Colorado. (2010)
|
|
BASE
|
|
Show details
|
|
2 |
Closing the gap between stochastic and rule-based LFG grammars
|
|
|
|
In: Hautli, Annette, Cetinoglu, Ozlem and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Closing the gap between stochastic and rule-based LFG grammars. In: the LFG10 Conference, 18-20 July 2010, Ottowa, Canada. (2010)
|
|
BASE
|
|
Show details
|
|
3 |
Treebank-based automatic acquisition of wide coverage, deep linguistic resources for Japanese
|
|
Oya, Masanori. - : Dublin City University. National Centre for Language Technology (NCLT), 2010. : Dublin City University. School of Computing, 2010
|
|
In: Oya, Masanori (2010) Treebank-based automatic acquisition of wide coverage, deep linguistic resources for Japanese. Master of Science thesis, Dublin City University. (2010)
|
|
BASE
|
|
Show details
|
|
4 |
Dependency parsing resources for French: Converting acquired lexical functional grammar F-Structure annotations and parsing F-Structures directly
|
|
|
|
In: Schluter, Natalie and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Dependency parsing resources for French: Converting acquired lexical functional grammar F-Structure annotations and parsing F-Structures directly. In: Nodalida 2009 Conference, 14 - 16 May 2009, Odense, Denmark. (2009)
|
|
BASE
|
|
Show details
|
|
5 |
Treebank-based acquisition of Chinese LFG resources for parsing and generation
|
|
Guo, Yuqing. - : Dublin City University. School of Computing, 2009
|
|
In: Guo, Yuqing (2009) Treebank-based acquisition of Chinese LFG resources for parsing and generation. PhD thesis, Dublin City University. (2009)
|
|
BASE
|
|
Show details
|
|
6 |
Treebank-based grammar acquisition for German
|
|
Rehbein, Ines. - : Dublin City University. National Centre for Language Technology (NCLT), 2009. : Dublin City University. School of Computing, 2009
|
|
In: Rehbein, Ines (2009) Treebank-based grammar acquisition for German. PhD thesis, Dublin City University. (2009)
|
|
Abstract:
Manual development of deep linguistic resources is time-consuming and costly and therefore often described as a bottleneck for traditional rule-based NLP. In my PhD thesis I present a treebank-based method for the automatic acquisition of LFG resources for German. The method automatically creates deep and rich linguistic presentations from labelled data (treebanks) and can be applied to large data sets. My research is based on and substantially extends previous work on automatically acquiring wide-coverage, deep, constraint-based grammatical resources from the English Penn-II treebank (Cahill et al.,2002; Burke et al., 2004; Cahill, 2004). Best results for English show a dependency f-score of 82.73% (Cahill et al., 2008) against the PARC 700 dependency bank, outperforming the best hand-crafted grammar of Kaplan et al. (2004). Preliminary work has been carried out to test the approach on languages other than English, providing proof of concept for the applicability of the method (Cahill et al., 2003; Cahill, 2004; Cahill et al., 2005). While first results have been promising, a number of important research questions have been raised. The original approach presented first in Cahill et al. (2002) is strongly tailored to English and the datastructures provided by the Penn-II treebank (Marcus et al., 1993). English is configurational and rather poor in inflectional forms. German, by contrast, features semi-free word order and a much richer morphology. Furthermore, treebanks for German differ considerably from the Penn-II treebank as regards data structures and encoding schemes underlying the grammar acquisition task. In my thesis I examine the impact of language-specific properties of German as well as linguistically motivated treebank design decisions on PCFG parsing and LFG grammar acquisition. I present experiments investigating the influence of treebank design on PCFG parsing and show which type of representations are useful for the PCFG and LFG grammar acquisition tasks. Furthermore, I present a novel approach to cross-treebank comparison, measuring the effect of controlled error insertion on treebank trees and parser output from different treebanks. I complement the cross-treebank comparison by providing a human evaluation using TePaCoC, a new testsuite for testing parser performance on complex grammatical constructions. Manual evaluation on TePaCoC data provides new insights on the impact of flat vs. hierarchical annotation schemes on data-driven parsing. I present treebank-based LFG acquisition methodologies for two German treebanks. An extensive evaluation along different dimensions complements the investigation and provides valuable insights for the future development of treebanks.
|
|
Keyword:
Computational linguistics; German; German language; grammar acquisistion; lexical-functional grammar; LFG; parsing; PCFG; treebanks
|
|
URL: http://doras.dcu.ie/14900/
|
|
BASE
|
|
Hide details
|
|
7 |
Treebank-based acquisition of LFG parsing resources for French
|
|
|
|
In: Schluter, Natalie and van Genabith, Josef (2008) Treebank-based acquisition of LFG parsing resources for French. In: the Sixth International Language Resources and Evaluation Conference (LREC'08), May 28-30, 2008, Marrakech, Morocco. (2008)
|
|
BASE
|
|
Show details
|
|
8 |
Packed rules for automatic transfer-rule induction
|
|
|
|
In: Graham, Yvette and van Genabith, Josef (2008) Packed rules for automatic transfer-rule induction. In: the European Association of Machine Translation Conference 2008, Hamburg, Germany. (2008)
|
|
BASE
|
|
Show details
|
|
9 |
German particle verbs and pleonastic prepositions
|
|
|
|
In: Rehbein, Ines and van Genabith, Josef (2006) German particle verbs and pleonastic prepositions. In: Third ACL-SIGSEM Workshop on Prepositions, 3 April 2006, Trento, Italy. (2006)
|
|
BASE
|
|
Show details
|
|
10 |
Robust PCFG-based generation using automatically acquired LFG approximations
|
|
|
|
In: Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef (2006) Robust PCFG-based generation using automatically acquired LFG approximations. In: COLING/ACL 2006 - 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, 17-21 July 2006, Sydney, Australia. (2006)
|
|
BASE
|
|
Show details
|
|
11 |
Automatic extraction of large-scale multilingual lexical resources
|
|
|
|
In: O'Donovan, Ruth (2006) Automatic extraction of large-scale multilingual lexical resources. PhD thesis, Dublin City University. (2006)
|
|
BASE
|
|
Show details
|
|
12 |
Automatic treebank annotation for the acquisition of LFG resources
|
|
|
|
In: Burke, Michael (2006) Automatic treebank annotation for the acquisition of LFG resources. PhD thesis, Dublin City University. (2006)
|
|
BASE
|
|
Show details
|
|
13 |
Parsing with automatically acquired, wide-coverage, robust, probabilistic LFG approximations
|
|
Cahill, Aoife. - : Dublin City University. School of Computing, 2004
|
|
In: Cahill, Aoife orcid:0000-0002-3519-7726 (2004) Parsing with automatically acquired, wide-coverage, robust, probabilistic LFG approximations. PhD thesis, Dublin City University. (2004)
|
|
BASE
|
|
Show details
|
|
14 |
Design and evaluation of the linguistic basis of an automatic F-struture annotation algorithm for the Penn-II treebank
|
|
|
|
In: McCarthy, Mairéad (2003) Design and evaluation of the linguistic basis of an automatic F-struture annotation algorithm for the Penn-II treebank. Master of Science thesis, Dublin City University. (2003)
|
|
BASE
|
|
Show details
|
|
|
|