1 |
Available online at www.sciencedirect.com
|
|
|
|
In: http://www.lextutor.ca/cv/crossley_cobb_mcn_13.pdf (2013)
|
|
BASE
|
|
Show details
|
|
2 |
Informatica 37 (2013) 193–201 193 Vector Disambiguation for Translation Extraction from Comparable
|
|
|
|
In: http://www.informatica.si/PDF/37-2/14_Apidianaki - Vector Disambiguation for Translation Ex.pdf (2013)
|
|
Abstract:
We present a new data-driven approach for enhancing the extraction of translation equivalents from comparable corpora which exploits bilingual lexico-semantic knowledge harvested from a parallel corpus. First, the bilingual lexicon obtained from word-aligning the parallel corpus replaces an external seed dictionary, making the approach knowledge-light and portable. Next, instead of using simple one-to-one mappings between the source and the target language, translation equivalents are clustered into sets of synonyms by a cross-lingual Word Sense Induction method. The obtained sense clusters enable us to expand the translation of vector features with several translation variants using a cross-lingual Word Sense Disambiguation method. Consequently, the vector features are disambiguated and translated with the translation variants included in the semantically most appropriate cluster, thus producing less noisy and richer vectors that allow for a more successful cross-lingual vector comparison than in previous methods. Povzetek: V prispevku predstavljamo pristop za izboljšanje luščenja prevodnih ustreznic iz primerljivih korpusov z dodatnim virom leksiko-semantičnega znanja, izluščenega iz vzporednega korpusa.
|
|
Keyword:
comparable corpora; sense clustering; word sense disambiguation
|
|
URL: http://www.informatica.si/PDF/37-2/14_Apidianaki - Vector Disambiguation for Translation Ex.pdf http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.368.1191
|
|
BASE
|
|
Hide details
|
|
3 |
Identifying word translations from comparable documents without a seed lexicon
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/888_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
4 |
Development and application of a cross-language document comparability metric
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/804_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
5 |
Web-Corpora from Top-Level Domains Represent National Varieties of English
|
|
|
|
In: http://lexicometrica.univ-paris3.fr/jadt/jadt2012/Communications/Cook,+Paul+et+al.+-+Do+web+Corpora+from+Top-Level+Domains.pdf (2012)
|
|
BASE
|
|
Show details
|
|
6 |
PaCo2: A Fully Automated tool for gathering Parallel Corpora from the Web
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/231_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
7 |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/188_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
8 |
The Twins corpus of museum visitor questions
|
|
|
|
In: http://people.ict.usc.edu/~traum/Papers/twins-corpus.pdf (2012)
|
|
BASE
|
|
Show details
|
|
9 |
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies
|
|
|
|
In: http://publications.idiap.ch/downloads/papers/2012/Cartoni_LREC_2012.pdf (2012)
|
|
BASE
|
|
Show details
|
|
10 |
Evaluating DBMS-based access strategies to very large multi-layer corpora
|
|
|
|
In: http://www1.ids-mannheim.de/fileadmin/gra/texte/LREC2012_final.pdf (2012)
|
|
BASE
|
|
Show details
|
|
11 |
Reference Lists for the Evaluation of Term Extraction Tools
|
|
|
|
In: http://hal.inria.fr/docs/00/81/65/66/PDF/tke2012-submission-36.pdf (2012)
|
|
BASE
|
|
Show details
|
|
12 |
A new web interface to facilitate access to corpora: development of the ASLLRP data access interface
|
|
|
|
In: http://www.bu.edu/linguistics/UG/LREC2012/LREC-dai-final.pdf (2012)
|
|
BASE
|
|
Show details
|
|
13 |
TimeBankPT: A TimeML annotated corpus of Portuguese
|
|
|
|
In: http://www.di.fc.ul.pt/~ahb/CostaBrancoLREC2012.pdf (2012)
|
|
BASE
|
|
Show details
|
|
14 |
A generic formalism to represent linguistic corpora in RDF and OWL/DL
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/915_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
15 |
Building a learner corpus
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/992_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
16 |
Pexacc: A parallel sentence mining algorithm from comparable corpora
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/382_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
17 |
Text Simplification Tools for Spanish
|
|
|
|
In: http://www.lrec-conf.org/proceedings/lrec2012/pdf/762_Paper.pdf (2012)
|
|
BASE
|
|
Show details
|
|
18 |
Native language detection with ’cheap’ learner corpora
|
|
|
|
In: http://ftp.cs.toronto.edu/pub/gh/Brooke+Hirst-LRCbook-2013.pdf (2011)
|
|
BASE
|
|
Show details
|
|
19 |
Native language detection with ’cheap’ learner corpora
|
|
|
|
In: http://ftp.cs.toronto.edu/pub/gh/Brooke+Hirst-LCR-2012-OLD.pdf (2011)
|
|
BASE
|
|
Show details
|
|
20 |
Collecting spatial information for locations in a text-to-scene conversion system
|
|
|
|
In: http://ceur-ws.org/Vol-759/paper03.pdf (2011)
|
|
BASE
|
|
Show details
|
|
|
|