1 |
Linguistic resources for paraphrase generation in Portuguese: a Lexicon-Grammar approach
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03548861 ; Language Resources and Evaluation, Springer Verlag, 2022, ⟨10.1007/s10579-021-09561-5⟩ ; https://link.springer.com/article/10.1007/s10579-021-09561-5 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Automatic Construction of Fine-Grained Paraphrase Corpora System Using Language Inference Model
|
|
|
|
In: Applied Sciences; Volume 12; Issue 1; Pages: 499 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Using Alignments in Automatic Paraphrase Production to Combat Data Sparsity in Question Interpretation for a Virtual Patient Dialogue System
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Question Paraphrase Generation for Question Answering System
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A Computational Approach to the Analysis and Generation of Emotion in Text
|
|
|
|
BASE
|
|
Show details
|
|
6 |
A Computational Approach to the Analysis and Generation of Emotion in Text
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A Computational Approach to the Analysis and Generation of Emotion in Text ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
A Computational Approach to the Analysis and Generation of Emotion in Text
|
|
|
|
BASE
|
|
Show details
|
|
10 |
The Circle of Meaning: From Translation to Paraphrasing and Back
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Fine-Grained Linguistic Soft Constraints on Statistical Natural Language Processing Models
|
|
|
|
Abstract:
This dissertation focuses on effective combination of data-driven natural language processing (NLP) approaches with linguistic knowledge sources that are based on manual text annotation or word grouping according to semantic commonalities. I gainfully apply fine-grained linguistic soft constraints -- of syntactic or semantic nature -- on statistical NLP models, evaluated in end-to-end state-of-the-art statistical machine translation (SMT) systems. The introduction of semantic soft constraints involves intrinsic evaluation on word-pair similarity ranking tasks, extension from words to phrases, application in a novel distributional paraphrase generation technique, and an introduction of a generalized framework of which these soft semantic and syntactic constraints can be viewed as instances, and in which they can be potentially combined. Fine granularity is key in the successful combination of these soft constraints, in many cases. I show how to softly constrain SMT models by adding fine-grained weighted features, each preferring translation of only a specific syntactic constituent. Previous attempts using coarse-grained features yielded negative results. I also show how to softly constrain corpus-based semantic models of words (“distributional profiles”) to effectively create word-sense-aware models, by using semantic word grouping information found in a manually compiled thesaurus. Previous attempts, using hard constraints and resulting in aggregated, coarse-grained models, yielded lower gains. A novel paraphrase generation technique incorporating these soft semantic constraints is then also evaluated in a SMT system. This paraphrasing technique is based on the Distributional Hypothesis. The main advantage of this novel technique over current “pivoting” techniques for paraphrasing is the independence from parallel texts, which are a limited resource. The evaluation is done by augmenting translation models with paraphrase-based translation rules, where fine-grained scoring of paraphrase-based rules yields significantly higher gains. The model augmentation includes a novel semantic reinforcement component: In many cases there are alternative paths of generating a paraphrase-based translation rule. Each of these paths reinforces a dedicated score for the “goodness” of the new translation rule. This augmented score is then used as a soft constraint, in a weighted log-linear feature, letting the translation model learn how much to “trust” the paraphrase-based translation rules. The work reported here is the first to use distributional semantic similarity measures to improve performance of an end-to-end phrase-based SMT system. The unified framework for statistical NLP models with soft linguistic constraints enables, in principle, the combination of both semantic and syntactic constraints -- and potentially other constraints, too -- in a single SMT model.
|
|
Keyword:
computational linguistics; Computer Science; hybrid; Language; Linguistics; paraphrase generation; semantic distance; soft constraints; statistical machine translation
|
|
URL: http://hdl.handle.net/1903/9861
|
|
BASE
|
|
Hide details
|
|
12 |
A Symbolic Approach to Near-Deterministic Surface Realisation using Tree Adjoining Grammar
|
|
|
|
In: 45th Annual Meeting of the Association for Computational Linguistics - ACL 2007 ; https://hal.inria.fr/inria-00149366 ; 45th Annual Meeting of the Association for Computational Linguistics - ACL 2007, Jun 2007, Prague, Czech Republic. pp.328-335 (2007)
|
|
BASE
|
|
Show details
|
|
18 |
THE TEACHER MODE OF THE SENTENCE FAIRY SYSTEM: HOW TO CREATE YOUR OWN E-LEARNING WRITING LESSONS FOR GERMAN ELEMENTARY SCHOOL PUPILS
|
|
|
|
In: http://userpages.uni-koblenz.de/~harbusch/ICERI-2012.pdf
|
|
BASE
|
|
Show details
|
|
19 |
Experience
|
|
|
|
In: http://www.desilinguist.org/pdf/madnani-cv.pdf
|
|
BASE
|
|
Show details
|
|
20 |
Exploring neural paraphrasing to improve fluency of rule-based generation
|
|
|
|
BASE
|
|
Show details
|
|
|
|