DE eng

Search in the Catalogues and Directories

Hits 1 – 8 of 8

1
BiSECT: Learning to Split and Rephrase Sentences with Bitexts ...
Abstract: An important task in NLP applications such as sentence simplification is the ability to take a long, complex sentence and split it into shorter sentences, rephrasing as necessary. We introduce a novel dataset and a new model for this `split and rephrase' task. Our BiSECT training data consists of 1 million long English sentences paired with shorter, meaning-equivalent English sentences. We obtain these by extracting 1-2 sentence alignments in bilingual parallel corpora and then using machine translation to convert both sides of the corpus into the same language. BiSECT contains higher quality training examples than previous Split and Rephrase corpora, with sentence splits that require more significant modifications. We categorize examples in our corpus, and use these categories in a novel model that allows us to target specific regions of the input sentence to be split and edited. Moreover, we show that models trained on BiSECT can perform a wider variety of split operations and improve upon previous ... : 9 pages, 9 figures. Long paper to appear in Empirical Methods in Natural Language Processing 2021 (EMNLP 2021) ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/2109.05006
https://dx.doi.org/10.48550/arxiv.2109.05006
BASE
Hide details
2
BiSECT: Learning to Split and Rephrase Sentences with Bitexts ...
BASE
Show details
3
Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders
In: NPJ Schizophr (2021)
BASE
Show details
4
Towards a Practically Useful Text Simplification System
In: Dissertations available from ProQuest (2021)
BASE
Show details
5
Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification ...
BASE
Show details
6
Comparison of Diverse Decoding Methods from Conditional Language Models ...
BASE
Show details
7
Learning translations via images with a massively multilingual image dataset
Callison-Burch, Chris; Wijaya, Derry; Kriz, Reno. - : Association for Computational Linguistics, 2018
BASE
Show details
8
Simplification Using Paraphrases and Context-Based Lexical Substitution
In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-01838519 ; Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, Jun 2018, Nouvelle Orléans, United States (2018)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
8
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern