DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Creating a frequency dictionary of spoken Hebrew : a reproducible use of technology to overcome scarcity of data
Abstract: Studies using word frequency dictionaries—on topics such as vocabulary acquisition, vocabulary load, extensive reading, and vocabulary testing—have historically centered around corpora and morphological issues specific to European languages, especially English. One of the reasons for this is the lack of resources that often plagues departments of less commonly taught languages. Corpora of spoken language are particularly difficult to obtain—the funding and time necessary often make such a project impossible. This thesis is an effort to provide some of the methodology and tools necessary for educators interested in creating frequency dictionaries for research purposes, for their own classrooms, or even for wider dissemination. In doing so, it will provide an overview of some of the key decisions that must be taken into account for such a project. Throughout this thesis, the creation process behind the Frequency Dictionary of Spoken Hebrew (FDOSH)—a list of the most common words in conversational Modern Hebrew—will be explained. The tools used to create the FDOSH, including corpus resources and customized scripts, are provided as part of a repository of supplementary materials. The goal is to make the entire dictionary-creation process as reproducible as possible while allowing for flexibility and transparency in the tools used. It does this by using well-documented open-source scripts written in an easily readable programming language, Python. Beyond providing these tools, the present project explores the theory and many of the considerations that play an important role in the creation of a frequency dictionary. These include issues such as corpus size, corpus text type, whether the list is intended for general or specialized use, word family levels, and objective criteria. Issues regarding Hebrew’s synthetic morphology and ambiguous non-vocalized writing system are also addressed. The project aims to serve as a catalyst for future research that may build upon the ideas discussed here. The development and open dissemination of tools such as these can only lead to greater cooperation among educators and researchers, to the benefit of all involved. ; Middle Eastern Studies
Keyword: Conversational language; conversational Modern Hebrew; Corpus linguistics; Frequency dictionary; Frequency dictionary creation; Frequency Dictionary of Spoken Hebrew; Frequency list; Hebrew language education; Language education; Less commonly taught languages; Modern Hebrew; Open scholarship; Open source educational materials; Open source software; Spoken language dictionary; Vocabulary acquisition
URL: https://doi.org/10.15781/T2599ZN83
http://hdl.handle.net/2152/72487
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern