Page: 1 2 3 4 5 6 7 8... 137
61 |
Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems ...
|
|
|
|
BASE
|
|
Show details
|
|
62 |
19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology - Part 2 ...
|
|
|
|
BASE
|
|
Show details
|
|
63 |
18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology - Part 1 ...
|
|
|
|
BASE
|
|
Show details
|
|
64 |
SpeakEasy Pronunciation Trainer: Personalized Multimodal Pronunciation Training ...
|
|
|
|
BASE
|
|
Show details
|
|
65 |
The Match-Extend Serialization Algorithm in Multiprecedence ...
|
|
|
|
BASE
|
|
Show details
|
|
66 |
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication ...
|
|
|
|
BASE
|
|
Show details
|
|
67 |
Recognizing Reduplicated Forms: Finite-State Buffered Machines ...
|
|
|
|
BASE
|
|
Show details
|
|
68 |
Correcting Chinese Spelling Errors with Phonetic Pre-training ...
|
|
|
|
BASE
|
|
Show details
|
|
69 |
PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction ...
|
|
|
|
BASE
|
|
Show details
|
|
70 |
SpeakEasy Pronunciation Trainer: Personalized Multimodal Pronunciation Training ...
|
|
|
|
BASE
|
|
Show details
|
|
71 |
Data for: Psycholinguistic dataset on language use in 1145 novels published in English and Dutch ...
|
|
|
|
Abstract:
LIWC and n-gram counts of English and Dutch novels ================================================== This dataset consists of CSV files with word counts in several corpora: - 694 English language novels from different genders and orientations - 401 bestselling Dutch language novels - 50 novels nominated for Dutch literary prizes Each corpus comes with: - LIWC counts; this file also includes the available metadata for each novel. The English data was created with LIWC 2015. The Dutch data was created with the validated translation of LIWC 2001. - Word counts (unigrams) and bigram counts per novel. All text has been converted to lowercase. Contractions are tokenized into separate tokens, e.g., can't => ca n't Two restrictions are applied: - only unigrams or bigrams that occur in at least 10 texts are retained - only the 100k most frequent are retained - Overall word counts and bigram counts; i.e., the sum across all novels. All files are encoded in UTF-8. ...
|
|
Keyword:
Arts and Humanities; Computational Linguistics
|
|
URL: https://dx.doi.org/10.17632/x3m2gjkhx5.1 https://data.mendeley.com/datasets/x3m2gjkhx5/1
|
|
BASE
|
|
Hide details
|
|
72 |
Data for: Psycholinguistic dataset on language use in 1145 novels published in English and Dutch ...
|
|
|
|
BASE
|
|
Show details
|
|
73 |
Supplementary Material to ‘Evaluating NLG-frameworks for multilingual surface realization in conversational assistants’ ...
|
|
|
|
BASE
|
|
Show details
|
|
74 |
Supplementary Material to ‘Evaluating NLG-frameworks for multilingual surface realization in conversational assistants’ ...
|
|
|
|
BASE
|
|
Show details
|
|
75 |
The Unsolved Problem of Language Identification: A GMM-based Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
76 |
The Unsolved Problem of Language Identification: A GMM-based Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
77 |
Graph-to-Graph Translations To Augment Abstract Meaning Representation Tense And Aspect ...
|
|
|
|
BASE
|
|
Show details
|
|
78 |
A Journey in Linguistic Computing from Father Busa to Linguistic Linked Data ...
|
|
|
|
BASE
|
|
Show details
|
|
79 |
A Journey in Linguistic Computing from Father Busa to Linguistic Linked Data ...
|
|
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8... 137
|
|