49 |
Special Section on Corpora: Introduction to Part One
|
|
|
|
Abstract:
In the 1990s the empirical study of language from large bodies of recorded documents has assumed a new importance, and this was reflected by the European Commission's decision to support the project NERC, the Network of European Research Corpora, led by Antonio Zampolli. Its aim was to study the need for, and possible provision of analysed corpora for European languages. NERC's first action was to organize an International Workshop, held at Pisa in January 1992, and attended by invited scholars from Europe and North America, to gather and cross-fertilize a variety of experience and views on how to further the project's aims. Re-worked versions of some of the papers presented then, together with a small number describing related work by other scholars, are now published as a special supplement to this and the next number of Literary and Linguistic Computing . They are presented in an order which corresponds to NERC's own structure. After a general statement from José Soler of the European Commission on the importance of this field of study, this follows a spectrum of interest: from examinations of the demand for corpora (McNaught) and the administrative complications in making them available (Hockey and Walker), through analysis of the conceptual (Biber) and practical (Crowdy, Part 1) problems in selection of texts, to the issues that arise when designing (Sampson) and applying (Leech) a system of categories for annotating the language in the texts. A particular problem here is treatment of spoken texts when reduced to written form, and Ballester et al . offers a solution for Spanish, Crowdy Part 2, for English. After these studies in annotation, the focus shifts to statistical techniques for exposing the semantics of uninterpreted text, sometimes known as ‘knowledge acquisition’ (Bindi et al . and Brown et al .). Finally, this supplement contains reports from some current projects which make essential use of large corpora and their annotation categories for particular applications: designing lexicons (Antoni-Lay et al ., Khatchadourian and Modiano), multilingual text processing (Cowie et al .), and speech technology assessment (Fourcin and Gibbon).
|
|
Keyword:
Articles
|
|
URL: https://doi.org/10.1093/llc/8.4.221 http://llc.oxfordjournals.org/cgi/content/short/8/4/221
|
|
BASE
|
|
Hide details
|
|
52 |
Predictable Meaning Shift: Some Linguistic Properties of Lexical Implication Rules
|
|
|
|
In: http://acl.ldc.upenn.edu/W/W91/W91-0208.pdf (1992)
|
|
BASE
|
|
Show details
|
|
57 |
Case-linking : a theory of case and verb diathesis applied to classical Sanskrit.
|
|
|
|
BASE
|
|
Show details
|
|
58 |
Publisher: Walker Publishing Company Pages ISBN Price
|
|
|
|
In: http://www.tesl-ej.org/pdf/ej60/r4.pdf
|
|
BASE
|
|
Show details
|
|
59 |
Is Machine Translation a Cultural Threat to Anyone?
|
|
|
|
In: http://www.mt-archive.info/TMI-1999-Ostler.pdf
|
|
BASE
|
|
Show details
|
|
|
|