1 |
Masked language models directly encode linguistic uncertainty ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Masked language models directly encode linguistic uncertainty
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Will it Unblend?
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2021)
|
|
BASE
|
|
Show details
|
|
5 |
UniMorph 3.0: Universal Morphology
|
|
|
|
In: Proceedings of the 12th Language Resources and Evaluation Conference (2020)
|
|
BASE
|
|
Show details
|
|
7 |
Compositionality in distributionally acquired phonetic category representations ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Masking auditory feedback does not eliminate repetition reduction ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Masking auditory feedback does not eliminate repetition reduction ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Downstream Behavioral and Electrophysiological Consequences of Word Prediction on Recognition Memory
|
|
|
|
BASE
|
|
Show details
|
|
11 |
The world is not enough to explain lengthening of phonological competitors ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Self-priming in production: evidence for a hybrid model of syntactic priming ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Remembering you read “doctoral dissertation”: Phrase frequency effects in recall and recognition memory
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Knowing a thing is "a thing": The use of acoustic features in multiword expression extraction
|
|
|
|
Abstract:
Speakers of a language need to have complex linguistic representations for speaking, often on the level of non-literal, idiomatic expressions like black sheep. Typically, datasets of these so-called multiword expressions come from hand-crafted ontologies or lexicons, because identifying expressions like these in an unsupervised manner is still an unsolved problem in natural language processing. In this thesis I demonstrate that prosodic features, which are helpful in parsing syntax and interpreting meaning, can also be used to identify multiword expressions. To do this, I extracted noun phrases from the Buckeye corpus, which contains spontaneous spoken language, and matched these noun phrases to page titles in Wikipedia, a massive, freely available encyclopedic ontology of entities and phenomena. By incorporating prosodic features into a model that distinguishes between multiword expressions that are found in Wikipedia titles and those that are not, we see increases in classifier performance that suggests that prosodic cues can help with the automatic extraction of multiword expressions from spontaneous speech, helping models and potentially listeners decide whether something is "a thing" or not.
|
|
Keyword:
Collocations; Language models; Phrases; Speech processing
|
|
URL: http://hdl.handle.net/2142/92965
|
|
BASE
|
|
Hide details
|
|
15 |
“hotdog”, not “hot” “dog”: The phonological planning of compound words
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Hotdog not hot dog: The phonological planning of compound words
|
|
|
|
BASE
|
|
Show details
|
|
|
|