DE eng

Search in the Catalogues and Directories

Hits 1 – 19 of 19

1
Multilingual and Cross-Lingual Intent Detection from Spoken Data ...
BASE
Show details
2
Multilingual and Cross-Lingual Intent Detection from Spoken Data ...
BASE
Show details
3
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations ...
Coope, Sam; Farghly, Tyler; Gerz, Daniela. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
4
Efficient Intent Detection with Dual Sentence Encoders ...
Casanueva, Inigo; Temcinas, Tadas; Gerz, Daniela. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
5
Multidirectional Associative Optimization of Function-Specific Word Representations ...
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
6
Representation Learning beyond Semantic Similarity: Character-aware and Function-specific Approaches ...
Gerz, Daniela Susanne. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
7
On the relation between linguistic typology and (limitations of) multilingual language modeling ...
Gerz, Daniela; Vulić, I; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2020
BASE
Show details
8
On the relation between linguistic typology and (limitations of) multilingual language modeling
Gerz, Daniela; Vulić, I; Ponti, Edoardo. - : Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018, 2020
BASE
Show details
9
Representation Learning beyond Semantic Similarity: Character-aware and Function-specific Approaches
Gerz, Daniela Susanne. - : University of Cambridge, 2020. : Theoretical and Applied Linguistics, 2020. : Lucy Cavendish College, 2020
Abstract: Representation learning is a research area within machine learning and natural language processing (NLP) concerned with building machine-understandable representations of discrete units of text. Continuous representations are at the core of modern machine learning applications, and representation learning has thereby become one of the central research areas in NLP. The induction of text representations is typically based on the distributional hypothesis, and consequently encodes general information about word similarity. Words or phrases with similar meaning obtain similar representations in a vector space constructed for this purpose. This established methodology excels for morphologically-simple languages such as English, and in data-rich settings. However, several useful lexical relations such as entailment or selectional preference, are not captured or get conflated with other relations. Another challenge is dealing with low-data regimes for morphologically-complex and under-resourced languages. In this thesis we construct novel representation learning methods that go beyond the limitations of the distributional hypothesis and investigate solutions that induce vector spaces with diverse properties. In particular, we look at how the vector space induction process influences the contained information, and how the information manifests in a number of core NLP tasks: semantic similarity, lexical entailment, selectional preference, and language modeling. We contribute novel evaluations of state-of-the-art models highlighting their current capabilities and limitations. An analysis of language modeling in 50 typologically-diverse languages demonstrates that representations can indeed pose a performance bottleneck. We introduce a novel approach to leveraging subword-level information in word representations: our solution lifts this bottleneck in low-resource scenarios. Finally, we introduce a novel paradigm of function-specific representation learning that aims to integrate fine-grained semantic relations and real-world knowledge into the word vector spaces. We hope this thesis can serve as a valuable overview on word representations, and inspire future work in modeling \textit{semantic similarity and beyond}. ; ERC Consolidator Grant LEXICAL (648909)
Keyword: multilingual; representation learning; word vector spaces
URL: https://www.repository.cam.ac.uk/handle/1810/304962
https://doi.org/10.17863/CAM.52043
BASE
Hide details
10
Multidirectional Associative Optimization of Function-Specific Word Representations
Gerz, Daniela; Vulic, Ivan; Rei, Marek. - : Association for Computational Linguistics, 2020. : 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020
BASE
Show details
11
Efficient Intent Detection with Dual Sentence Encoders
Casanueva, Inigo; Temcinas, Tadas; Gerz, Daniela. - : NLP FOR CONVERSATIONAL AI, 2020
BASE
Show details
12
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
Coope, Sam; Farghly, Tyler; Gerz, Daniela. - : Association for Computational Linguistics, 2020. : 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020
BASE
Show details
13
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines ...
Shareghi, Ehsan; Gerz, Daniela; Vulic, Ivan. - : Apollo - University of Cambridge Repository, 2019
BASE
Show details
14
Scoring Lexical Entailment with a Supervised Directional Similarity Network ...
BASE
Show details
15
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction ...
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
16
Scoring lexical entailment with a supervised directional similarity network ...
Rei, Marek; Gerz, Daniela; Vulić, I. - : Apollo - University of Cambridge Repository, 2018
BASE
Show details
17
Language Modeling for Morphologically Rich Languages: Character-Aware Modeling for Word-Level Prediction
Gerz, Daniela; Vulić, Ivan; Ponti, Edoardo. - : MIT Press - Journals, 2018. : Transactions of the Association for Computational Linguistics, 2018
BASE
Show details
18
Scoring lexical entailment with a supervised directional similarity network
Rei, Marek; Gerz, Daniela; Vulić, I. - : Association for Computational Linguistics, 2018. : ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers), 2018
BASE
Show details
19
HyperLex: A Large-Scale Evaluation of Graded Lexical Entailment ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
19
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern