1 |
Universal Segmentations 1.0 (UniSegments 1.0)
|
|
Žabokrtský, Zdeněk; Bafna, Nyati; Bodnár, Jan; Kyjánek, Lukáš; Svoboda, Emil; Ševčíková, Magda; Vidra, Jonáš; Angle, Sachi; Ansari, Ebrahim; Arkhangelskiy, Timofey; Batsuren, Khuyagbaatar; Bella, Gábor; Bertinetto, Pier Marco; Bonami, Olivier; Celata, Chiara; Daniel, Michael; Fedorenko, Alexei; Filko, Matea; Giunchiglia, Fausto; Haghdoost, Hamid; Hathout, Nabil; Khomchenkova, Irina; Khurshudyan, Victoria; Levonian, Dmitri; Litta, Eleonora; Medvedeva, Maria; Muralikrishna, S. N.; Namer, Fiammetta; Nikravesh, Mahshid; Padó, Sebastian; Passarotti, Marco; Plungian, Vladimir; Polyakov, Alexey; Potapov, Mihail; Pruthwik, Mishra; Rao B, Ashwath; Rubakov, Sergei; Samar, Husain; Sharma, Dipti Misra; Šnajder, Jan; Šojat, Krešimir; Štefanec, Vanja; Talamo, Luigi; Tribout, Delphine; Vodolazsky, Daniil; Vydrin, Arseniy; Zakirova, Aigul; Zeller, Britta. - : Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL), 2022
|
|
Abstract:
Universal Segmentations (UniSegments) is a collection of lexical resources capturing morphological segmentations harmonised into a cross-linguistically consistent annotation scheme for many languages. The annotation scheme consists of simple tab-separated columns that stores a word and its morphological segmentations, including pieces of information about the word and the segmented units, e.g., part-of-speech categories, type of morphs/morphemes etc. The current public version of the collection contains 38 harmonised segmentation datasets covering 30 different languages.
|
|
Keyword:
Armenian language; Bengali language; Catalan language; Croatian language; Czech language; English language; Erzya language; Finnish language; French language; German language; Hindi language; Hungarian language; Italian language; Kannada language; Komi-Zyrian language; Latin language; Malayalam language; Marathi language; Mari (Russia) language; Moksha language; Mongolian language; morph; morphemes; morphological dictionary; morphological segmentation; morphology; multilingual; Persian language; Polish language; Portuguese language; Russian language; segmentation; Serbo-Croatian language; Spanish language; Swedish language; Tajik language; Udmurt language; unisegments; universal segmentations; word segmentation
|
|
URL: http://hdl.handle.net/11234/1-4629
|
|
BASE
|
|
Hide details
|
|
2 |
Analyzing parser errors to improve parsing accuracy and to inform tree banking decisions
|
|
|
|
In: http://elanguage.net/journals/lilt/article/viewFile/2693/2651/ (2012)
|
|
BASE
|
|
Show details
|
|
3 |
On the Role of Morphosyntactic Features in Hindi Dependency Parsing
|
|
|
|
In: http://aclweb.org/anthology-new/W/W10/W10-1411.pdf (2010)
|
|
BASE
|
|
Show details
|
|
4 |
Developing Verb Frames for Hindi
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2008/pdf/491_paper.pdf (2008)
|
|
BASE
|
|
Show details
|
|
5 |
Developing Verb Frames for Hindi
|
|
|
|
In: http://researchweb.iiit.ac.in/~samar/data/verb classification.pdf (2008)
|
|
BASE
|
|
Show details
|
|
6 |
Simple preposition correspondence: a problem in English to Indian language machine translation
|
|
|
|
In: http://www.mt-archive.info/ACL-2007-Husain.pdf (2007)
|
|
BASE
|
|
Show details
|
|
7 |
The 6th Workshop on Asian Languae Resources, 2008 Towards an Annotated Corpus of Discourse Relations in Hindi
|
|
|
|
In: http://aclweb.org/anthology/I/I08/I08-7010.pdf
|
|
BASE
|
|
Show details
|
|
8 |
Simple Preposition Correspondence: A problem in English to Indian language Machine Translation
|
|
|
|
In: http://acl.ldc.upenn.edu/w/w07/W07-1608.pdf
|
|
BASE
|
|
Show details
|
|
9 |
Simple Preposition Correspondence: A problem in English to Indian language Machine Translation
|
|
|
|
In: http://www.iiit.ac.in/techreports/2007_74.pdf
|
|
BASE
|
|
Show details
|
|
10 |
Towards an Annotated Corpus of Discourse Relations in Hindi
|
|
|
|
In: http://www.cis.upenn.edu/~rjprasad/papers/prasad_etal_ijcnlp08.pdf
|
|
BASE
|
|
Show details
|
|
11 |
Towards a psycholinguistically motivated dependency grammar for Hindi
|
|
|
|
In: http://www.ling.uni-potsdam.de/%7Evasishth/pdfs/HusainBhattVasishthDepLing2013.pdf
|
|
BASE
|
|
Show details
|
|
12 |
A Graph Based Method for Building Multilingual Weakly Supervised Dependency Parsers
|
|
|
|
In: http://ltrc.iiit.ac.in/anil/papers/udep-gotal-08.pdf
|
|
BASE
|
|
Show details
|
|
13 |
Towards a psycholinguistically motivated dependency grammar for Hindi
|
|
|
|
In: http://aclweb.org/anthology/W/W13/W13-3713.pdf
|
|
BASE
|
|
Show details
|
|
14 |
Grammar Extraction from Treebanks for Hindi and Telugu
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2010/pdf/854_Paper.pdf
|
|
BASE
|
|
Show details
|
|
15 |
Dependency Annotation Scheme for Indian Languages
|
|
|
|
In: http://www.iiit.ac.in/techreports/2007_78.pdf
|
|
BASE
|
|
Show details
|
|
16 |
Exploring Translation Similarities for Building a Better Sentence Aligner
|
|
|
|
In: http://ltrc.iiit.ac.in/anil/papers/sen-align-iicai-07.pdf
|
|
BASE
|
|
Show details
|
|
17 |
January 2008Dependency Annotation Scheme for Indian Languages
|
|
|
|
In: http://web2py.iiit.ac.in/publications/default/download/inproceedings.pdf.a624beafd9272137.5261666979612d494a434e4c5030382e706466.pdf
|
|
BASE
|
|
Show details
|
|
18 |
2008a. Dependency annotation scheme for Indian languages
|
|
|
|
In: http://www.aclweb.org/anthology-new/I/I08/I08-2099.pdf
|
|
BASE
|
|
Show details
|
|
|
|