Page: 1 2 3 4 5 6 7 8 9... 12
83 |
Tree edit distance as a baseline approach for paraphrase representation
|
|
|
|
BASE
|
|
Show details
|
|
84 |
Graph-based Methods for Large-Scale Multilingual Knowledge Integration
|
|
de Melo, Gerard. - : Saarländische Universitäts- und Landesbibliothek, 2012
|
|
BASE
|
|
Show details
|
|
88 |
A generic and open framework for multiword expressions treatment : from acquisition to applications
|
|
|
|
Abstract:
The treatment of multiword expressions (MWEs), like take off, bus stop and big deal, is a challenge for NLP applications. This kind of linguistic construction is not only arbitrary but also much more frequent than one would initially guess. This thesis investigates the behaviour of MWEs across different languages, domains and construction types, proposing and evaluating an integrated methodological framework for their acquisition. There have been many theoretical proposals to define, characterise and classify MWEs. We adopt generic definition stating that MWEs are word combinations which must be treated as a unit at some level of linguistic processing. They present a variable degree of institutionalisation, arbitrariness, heterogeneity and limited syntactic and semantic variability. There has been much research on automatic MWE acquisition in the recent decades, and the state of the art covers a large number of techniques and languages. Other tasks involving MWEs, namely disambiguation, interpretation, representation and applications, have received less emphasis in the field. The first main contribution of this thesis is the proposal of an original methodological framework for automatic MWE acquisition from monolingual corpora. This framework is generic, language independent, integrated and contains a freely available implementation, the mwetoolkit. It is composed of independent modules which may themselves use multiple techniques to solve a specific sub-task in MWE acquisition. The evaluation of MWE acquisition is modelled using four independent axes. We underline that the evaluation results depend on parameters of the acquisition context, e.g., nature and size of corpora, language and type of MWE, analysis depth, and existing resources. The second main contribution of this thesis is the application-oriented evaluation of our methodology proposal in two applications: computer-assisted lexicography and statistical machine translation. For the former, we evaluate the usefulness of automatic MWE acquisition with the mwetoolkit for creating three lexicons: Greek nominal expressions, Portuguese complex predicates and Portuguese sentiment expressions. For the latter, we test several integration strategies in order to improve the treatment given to English phrasal verbs when translated by a standard statistical MT system into Portuguese. Both applications can benefit from automatic MWE acquisition, as the expressions acquired automatically from corpora can both speed up and improve the quality of the results. The promising results of previous and ongoing experiments encourage further investigation about the optimal way to integrate MWE treatment into other applications. Thus, we conclude the thesis with an overview of the past, ongoing and future work.
|
|
Keyword:
Computational linguistics; Corpus linguistics; Lexical acquisition; Lexicography; Linguagem natural; Linguística computacional; Machine translation; Multiword expressions; Natural language processing
|
|
URL: http://hdl.handle.net/10183/65777
|
|
BASE
|
|
Hide details
|
|
89 |
Linguistic spaces : kernel-based models of natural language
|
|
|
|
BASE
|
|
Show details
|
|
90 |
Creation and Analysis of a Reading Comprehension Exercise Corpus : Towards Evaluating Meaning in Context
|
|
|
|
BASE
|
|
Show details
|
|
92 |
The evidence-policy divide: a 'critical computational linguistics' approach to the language of 18 health agency CEOs from 9 countries
|
|
|
|
In: Faculty of Science, Medicine and Health - Papers: part A (2012)
|
|
BASE
|
|
Show details
|
|
93 |
Compositional syntax-based phrase-level polarity annotation for German
|
|
|
|
In: Klenner, M; Clematide, S; Petrakis, S; Luder, M (2012). Compositional syntax-based phrase-level polarity annotation for German. In: The 10th International Workshop on Treebanks and Linguistic Theories (TLT 2012), Heidelberg, 6 January 2012 - 7 January 2012, online. (2012)
|
|
BASE
|
|
Show details
|
|
94 |
Polarity preference of verbs: What could verbs reveal about the polarity of their objects?
|
|
|
|
In: Klenner, Manfred; Petrakis, Stefanos (2012). Polarity preference of verbs: What could verbs reveal about the polarity of their objects? In: Bouma, Gosse; Ittoo, Ashwin; Métais, Elisabeth; Wortmann, Hans. Natural Language Processing and Information Systems. 17th International Conference on Applications of Natural Language to Information Systems. Heidelberg: Springer, 35-46. (2012)
|
|
BASE
|
|
Show details
|
|
95 |
Using semantic resources to improve a syntactic dependency parser
|
|
|
|
In: Schneider, Gerold (2012). Using semantic resources to improve a syntactic dependency parser. In: LREC 2012 Conference Workshop "Semantic Relations II", Istanbul, Turkey, 22 May 2012 - 22 May 2012, 67-76. (2012)
|
|
BASE
|
|
Show details
|
|
96 |
Dependency bank
|
|
|
|
In: Lehmann, Hans Martin; Schneider, Gerold (2012). Dependency bank. In: LREC 2012 Conference Workshop "Challenges in the Management of Large Corpora", Istanbul, Turkey, 22 May 2012 - 22 May 2012, 23-28. (2012)
|
|
BASE
|
|
Show details
|
|
97 |
Semi-automatic annotation of semantic relations in a Swiss German sign language lexicon
|
|
|
|
In: Ebling, Sarah; Tissi, Katja; Volk, Martin (2012). Semi-automatic annotation of semantic relations in a Swiss German sign language lexicon. In: 5th Workshop on the Representation and Processing of Sign Languages: Interactions between Corpus and Lexicon, LREC 2012, Istanbul, 21 May 2012 - 27 May 2012, 31-36. (2012)
|
|
BASE
|
|
Show details
|
|
98 |
Syllabic Intensity Variations as Quantification of Speech Rhythm: Evidence from Both L1 and L2
|
|
|
|
In: He, Lei (2012). Syllabic Intensity Variations as Quantification of Speech Rhythm: Evidence from Both L1 and L2. In: Speech Prosody 2012, Shanghai, 22 May 2012 - 25 May 2012. s.n., 466-469. (2012)
|
|
BASE
|
|
Show details
|
|
99 |
A machine learning approach to the identification of translational language: an inquiry into translationese learning models
|
|
|
|
BASE
|
|
Show details
|
|
100 |
Extrinsic evaluation of sentence alignment systems
|
|
|
|
In: Abdul-Rauf, Sadaf; Fishel, Mark; Lambert, Patrik; Noubours, Sandra; Sennrich, Rico (2012). Extrinsic evaluation of sentence alignment systems. In: Workshop on Creating Cross-language Resources for Disconnected Languages and Styles, Istanbul, 27 May 2012 - 27 May 2012, 6-10. (2012)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 12
|
|