Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 20 of 20

1	Improving the Accessibility of Arabic Electronic Theses and Dissertations (ETDs) with Metadata and Classification
	Abdelrahman, Eman. - : Virginia Tech, 2021
	Abstract: Much research work has been done to extract data from scientific papers, journals, and articles. However, Electronic Theses and Dissertations (ETDs) remain an unexplored genre of data in the research fields of natural language processing and machine learning. Moreover, much of the related research involved data that is in the English language. Arabic data such as news and tweets have begun to receive some attention in the past decade. However, Arabic ETDs remain an untapped source of data despite the vast number of benefits to students and future generations of scholars. Some ways of improving the browsability and accessibility of data include data annotation, indexing, parsing, translation, and classification. Classification is essential for the searchability and management of data, which can be manual or automated. The latter is beneficial when handling growing volumes of data. There are two main roadblocks to performing automatic subject classification on Arabic ETDs. The first is the unavailability of a public corpus of Arabic ETDs. The second is the Arabic languages linguistic complexity, especially in academic documents. This research presents the Otrouha project, which aims at building a corpus of key metadata of Arabic ETDs as well as providing a methodology for their automatic subject classification. The first goal is aided by collecting data from the AskZad Digital Library. The second goal is achieved by exploring different machine learning and deep learning techniques. The experiments results show that deep learning using pretrained language models gave the highest classification performance, indicating that language models significantly contribute to natural language understanding. ; M.S. ; An Electronic Thesis or Dissertation (ETD) is an openly-accessible electronic version of a graduate students research thesis or dissertation. It documents their main research effort that has taken place and becomes available in the University Library instead of a paper copy. Over time, collections of ETDs have been gathered and made available online through different digital libraries. ETDs are a valuable source of information for scholars and researchers, as well as librarians. With the digitalization move in most Middle Eastern Universities, the need to make Arabic ETDs more accessible significantly increases as their numbers increase. One of the ways to improve their accessibility and searchability is through providing automatic classification instead of manual classification. This thesis project focuses on building a corpus of metadata of Arabic ETDs and building a framework for their automatic subject classification. This is expected to pave the way for more exploratory research on this valuable genre of data.
	Keyword: Arabic Electronic Theses and Dissertations (ETDs); Automatic Classification; Deep learning (Machine learning); Digital Libraries; Machine learning; NLP; Pretrained Language Models
	URL: http://hdl.handle.net/10919/107790
	BASE
	Hide details

2	Otrouha: A Corpus of Arabic ETDs and a Framework for Automatic Subject Classification ; The Journal of Electronic Theses and Dissertations
	Abdelrahman, Eman; Alotaibi, Fatimah; Fox, Edward A.. - 2021
	BASE
	Show details

3	Natural Language Processing Advancements By Deep Learning: A Survey ...
	Torfi, Amirsina; Shirvani, Rouzbeh A.; Keneshloo, Yaser. - : arXiv, 2020
	BASE
	Show details

4	Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning ; Data and Information Management
	Li, Liuqing; Geissinger, Jack H.; Ingram, William A.. - : Sciendo, 2020
	BASE
	Show details

5	A Framework for Hadoop Based Digital Libraries of Tweets
	Bock, Matthew. - : Virginia Tech, 2017
	BASE
	Show details

6	Using Dependency Parses to Augment Feature Construction for Text Mining
	Guo, Sheng. - : Virginia Tech, 2012
	BASE
	Show details

7	Natural Language Toolkit (NLTK)
	Shu, Xiaokui; Cohen, Ron. - 2010
	BASE
	Show details

8	Using Concept Maps as a Tool for Cross-Language Relevance Determination
	Richardson, W. Ryan. - : Virginia Tech, 2007
	BASE
	Show details

9	Update on the Networked Digital Library of Theses and Dissertations
	Fox, Edward A.. - : Graduate School of Library Science, University of Illinois at Urbana-Champaign, 2000
	BASE
	Show details

10	Incremental Clustering for Very Large Document Databases: Initial MARIAN Experience
	Can, Fazli; Fox, Edward A; Snavely, Cory D...
	In: Information sciences. - New York, NY : Elsevier Science Inc. 84 (1995) 1-2, 101-114
	OLC Linguistik
	Show details

11	A query language for information graphs
	Betrabet, Sangita. - : Virginia Tech, 1993
	BASE
	Show details

12	Integrated Access to a Large Medical Literature Database
	Fox, Edward A.; Koushik, Prabhakar M.; Chen, Qi-Fan. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1991
	BASE
	Show details

13	Building a Lexicon from Machine-Readable Dictionaries for Improved Information Retrieval1
	NUTTER, J. TERRY; FOX, EDWARD A.; EVENS, MARTHA W.. - : Oxford University Press, 1990
	BASE
	Show details

14	Building a Lexicon from Machine-Readable Dictionaries for Improved Information Retrieval
	Nutter, J. Terry; Fox, Edward A.; Evens, Martha W.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1990
	BASE
	Show details

15	A More Cost Effective Algorithm for Finding Perfect Hash Functions
	Fox, Edward A.; Chen, Qi-Fan; Heath, Lenwood S.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1988
	BASE
	Show details

16	Creation of a Prolog Fact Base from the Collins English Dictionary
	Wohlwend, Robert C.; Fox, Edward A.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1988
	BASE
	Show details

17	Development of the CODER System: A Test-bed for Artificial Intelligence Methods in Information Retrieval
	Fox, Edward A.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1986
	BASE
	Show details

18	Building the CODER Lexicon: The Collins English Dictionary and its Adverb Definitions
	Fox, Edward A.; Wohlwend, Robert C.; Sheldon, Phyllis R.. - 1986
	BASE
	Show details

19	Building the CODER Lexicon: The Collins English Dictionary and Its Adverb Definitions
	Fox, Edward A.; Wohlwend, Robert C.; Sheldon, Phyllis R.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1986
	BASE
	Show details

20	A Knowledge-Based System for Composite Document Analysis and Retrieval: Design Issues in the CODER Project
	Fox, Edward A.; France, Robert K.. - : Department of Computer Science, Virginia Polytechnic Institute & State University, 1986
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern