1 |
Data and R codes for Exploring diachronic salience of emotion metaphors: A contrastive study of HAPPINESS metaphors in Classical Malay and Indonesian ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Constructional equivalence in the Indonesian translations of ROB and STEAL ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Le particelle razve e neuželi alla luce del Corpus parallelo russo-italiano
|
|
Noseda, Valentina (orcid:0000-0002-5148-1241). - : Aracne, 2021. : country:ITA, 2021. : place:Roma, 2021
|
|
BASE
|
|
Show details
|
|
8 |
ERRATAS database of editorial principles and practices in printed editions of historical correspondence ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
ERRATAS database of editorial principles and practices in printed editions of historical correspondence ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Editions and other sources used in the Corpora of Early English Correspondence (CEEC-400) ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Editions and other sources used in the Corpora of Early English Correspondence (CEEC-400) ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
ПРИМЕНЕНИЕ КОРПУСНЫХ МЕТОДОВ ДЛЯ ЛЕКСИЧЕСКОГО АНАЛИЗА СПЕЦИАЛИЗИРОВАННЫХ КОРПУСОВ ... : APPLICATION OF CORPUS METHODS FOR LEXICAL ANALYSIS OF SPECIALIZED CORPORA ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
The Creation and Complexity Analysis of a Corpus of Educational Materials in Irish (EduGA) ; n/a
|
|
|
|
BASE
|
|
Show details
|
|
14 |
The New Chinese Corpus of Literary Texts Litchi
|
|
|
|
In: Acta Linguistica Asiatica, Vol 10, Iss 2 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
15 |
Size of corpora and collocations: The case of Russian
|
|
|
|
In: Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave, Vol 8, Iss 2 (2020) (2020)
|
|
Abstract:
With the arrival of information technologies to linguistics, compiling a large corpus of data, and of web texts in particular, has now become a mere technical matter. These new opportunities have revived the question of corpus volume that can be formulated in the following way: are larger corpora better for linguistic research or, more precisely, do lexicographers need to analyze bigger amounts of collocations? The paper deals with experiments on collocation identification in low-frequency lexis using corpora of different volumes (1 million, 10 million, 100 million and 1.2 billion words). We have selected low-frequency adjectives, nouns and verbs in the Russian Frequency Dictionary and tested the following hypotheses: 1) collocations in low-frequency lexis are better represented by larger corpora; 2) frequent collocations presented in dictionaries have low occurrences in small corpora; 3) statistical measures for collocation extraction behave differently in corpora of different volumes. The results prove the fact that corpora of under 100 M are not representative enough to study collocations, especially those with nouns and verbs. MI and Dice tend to extract less reliable collocations as the corpus volume extends, whereas t-score and Fisher’s exact test demonstrate better results for larger corpora.
|
|
Keyword:
Collocations; corpus linguistics; corpus size; P1-1091; Philology. Linguistics; Russian corpora; statistical measures
|
|
URL: https://doi.org/10.4312/slo2.0.2020.2.58-77 https://doaj.org/article/64f65f09e6c54299b252a572340a23b3
|
|
BASE
|
|
Hide details
|
|
16 |
ТЕХНОЛОГИЯ ПАРАЛЛЕЛЬНЫХ КОРПУСОВ ТЕКСТОВ И ЕЕ ИСПОЛЬЗОВАНИЕ В ПРОЦЕССЕ ОБУЧЕНИЯ ПЕРЕВОДУ ... : THE TECHNOLOGY OF PARALLEL TEXT CORPORA AND ITS USE IN THE PROCESS OF TRANSLATION TRAINING ...
|
|
Авдеев А.А.. - : НАУЧНЫЙ ЖУРНАЛ СОВРЕМЕННЫЕ ЛИНГВИСТИЧЕСКИЕ И МЕТОДИКО-ДИДАКТИЧЕСКИЕ ИССЛЕДОВАНИЯ, 2019
|
|
BASE
|
|
Show details
|
|
17 |
From a Smoking Gun to Spent Fuel: Principled Subsampling Methods for Building Big Language Data Corpora from Monitor Corpora
|
|
|
|
In: Data ; Volume 4 ; Issue 2 (2019)
|
|
BASE
|
|
Show details
|
|
18 |
To be or not to be .? Analysis of misuses with be in a learner corpus in an online environment ; ¿Ser o no ser.? Análisis de errores con to be en un corpus de estudiantes en un entorno online
|
|
|
|
BASE
|
|
Show details
|
|
19 |
THE ROLE OF ELECTRONIC CORPORA IN TRANSLATION TRAINING
|
|
|
|
In: Studies in Linguistics, Culture, and FLT, Vol 7, Pp 48-58 (2019) (2019)
|
|
BASE
|
|
Show details
|
|
20 |
Eesti keele kui teise keele õpikute lausete analüüs ja selle rakendamine eri keeleoskustasemete sõnastike näitelausete automaatsel valikul
|
|
|
|
In: Eesti Rakenduslingvistika Ühingu Aastaraamat, Vol 15, Pp 99-119 (2019) (2019)
|
|
BASE
|
|
Show details
|
|
|
|