1 |
Chinese character decomposition for neural MT with multi-word expressions
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Bolzoni, Paolo (2021) Chinese character decomposition for neural MT with multi-word expressions. In: 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021), 31 May- 2 June 2021, Reykjavik, Iceland (Online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Dependency Patterns of Complex Sentences and Semantic Disambiguation for Abstract Meaning Representation Parsing ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
One Semantic Parser to Parse Them All: Sequence to Sequence Multi-Task Learning on Semantic Parsing Datasets ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Modeling Sense Structure in Word Usage Graphs with the Weighted Stochastic Block Model ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
InFillmore: Frame-Guided Language Generation with Bidirectional Context ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Learning Embeddings for Rare Words Leveraging Internet Search Engine and Spatial Location Relationships ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Evaluating Universal Dependency Parser Recovery of Predicate Argument Structure via CompChain Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
ParsFEVER : a Dataset for Farsi Fact Extraction and Verification ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Did the Cat Drink the Coffee? Challenging Transformers with Generalized Event Knowledge ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Teach the Rules, Provide the Facts: Targeted Relational-knowledge Enhancement for Textual Inference ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Multilingual Neural Semantic Parsing for Low-Resourced Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Inducing Language-Agnostic Multilingual Representations ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Denoising Word Embeddings by Averaging in a Shared Space ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Consistency improvement with a feedback recommendation in personalized linguistic group decision making
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Evaluating a Joint Training Approach for Learning Cross-lingual Embeddings with Sub-word Information without Parallel Corpora on Lower-resource Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Can Transformer Langauge Models Predict Psychometric Properties? ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Braiding Language (by Computer): Lushootseed Grammar Engineering
|
|
|
|
Abstract:
Thesis (Ph.D.)--University of Washington, 2019 ; This dissertation describes the beginnings of the t̕əbšucid project. t̕əbšucid, literally the braiding of language, is a way to refer to "grammar" in Lushootseed (also known as Puget Salish, ISO-639-3:lut). The t̕əbšucid project has three overlapping goals: (1) to advance linguistic science via grammar engineering methods with a specific focus on Lushootseed; (2) to package and distribute linguistic results in a way which is useful for people involved in the development of language-related applications which may serve a role in the documentation and revitalization of endangered languages; (3) to highlight the inherent value of the traditional language of the Puget Sound. To those ends, I began to build a system which could process Lushootseed texts. I implemented an initial morphophonological analyzer which can map Lushootseed orthography to a regularized morphophonemic representation. This representation is one which can serve as input to a syntactico-semantic grammar which provides semantic analyses for input sentences. I then implemented an initial syntactico-semantic grammar which maps the morpheme-regularized form to sentence-level semantics, via an explicit syntactic representation. In doing so, I address the first and third motivations listed above by presenting a series of case studies which emerged from the initial implementation work. In connection to the second motivation, this dissertation describes both the overall architecture and the context of the system, in the hope that the work I've carried out to-date can be is something which can be built upon by others.
|
|
Keyword:
Computational Linguistics; Grammar Engineering; Linguistics; Lushootseed; Morphology; Semantics; Syntax
|
|
URL: http://hdl.handle.net/1773/45218
|
|
BASE
|
|
Hide details
|
|
|
|