Page: 1 2 3 4 5 6 7 8 9... 11
81 |
Word Sense Disambiguation with GermaNet ... : Disambiguierung von Wortbedeutungen mit GermaNet ...
|
|
|
|
BASE
|
|
Show details
|
|
82 |
Anaphora resolution in L1 English-L2 Spanish: evidence from the CEDEL2 corpus
|
|
|
|
BASE
|
|
Show details
|
|
83 |
Temporal reference: empirical and theoretical perspectives: converging evidence from English and Romance
|
|
|
|
BASE
|
|
Show details
|
|
84 |
Awareness of Academic Adjective-Noun Collocations in Theory and Pedagogy: A Corpus and List-based Study
|
|
|
|
BASE
|
|
Show details
|
|
85 |
Corpus-based study of the use of English general extenders spoken by Japanese users of English across speaking proficiency levels and task types
|
|
|
|
BASE
|
|
Show details
|
|
87 |
Extração multilíngue de termos multipalavra em corpora comparáveis
|
|
|
|
BASE
|
|
Show details
|
|
88 |
Extração multilíngue de termos multipalavra em corpora comparáveis
|
|
|
|
Abstract:
Este trabalho investiga técnicas de extração de termos multipalavra a partir de corpora comparáveis, que são conjuntos de textos em duas (ou mais) línguas sobre o mesmo domínio. A extração de termos, especialmente termos multipalavra é muito importante para auxiliar a criação de terminologias, ontologias e o aperfeiçoamento de tradutores automáticos. Neste trabalho utilizamos um corpus comparável português/inglês e queremos encontrar termos e seus equivalentes em ambas as línguas. Para isso começamos com a extração dos termos separadamente em cada língua, utilizando padrões morfossintáticos para identificar os n-gramas (sequências de n palavras) mais prováveis de serem termos importantes para o domínio. A partir dos termos de cada língua, utilizamos o contexto, isto é, as palavras que ocorrem no entorno dos termos para comparar os termos das diferentes línguas e encontrar os equivalentes bilíngues. Tínhamos como objetivos principais neste trabalho fazer a identificação monolíngue de termos, aplicar as técnicas de alinhamento para o português e avaliar os diferentes parâmetros de tamanho e tipo (PoS utilizados) de janela para a extração de contexto. Esse é o primeiro trabalho a aplicar essa metodologia para o Português e apesar da falta de alguns recursos léxicos e computacionais (como dicionários bilíngues e parsers) para essa língua, conseguimos alcançar resultados comparáveis com o estado da arte para trabalhos em Francês/Inglês. ; This work investigates techniques for multiword term extraction from comparable corpora, which are sets of texts in two (or more) languages about the same topic. Term extraction, specially multiword terms is very important to help the creation of terminologies, ontologies and the improvement of machine translation. In this work we use a comparable corpora Portuguese/ English and want to find terms and their equivalents in both languages. To do this we start with separate term extraction for each language. Using morphossintatic patterns to identify n-grams (sequences of n words) most likely to be important terms of the domain. From the terms of each language, we use their context, i. e., the words that occurr around the term to compare the terms of different languages and to find the bilingual equivalents. We had as main goals in this work identificate monolingual terms, apply alignment techniques for Portuguese and evaluate the different parameters of size and type (used PoS) of window to the context extraction. This is the first work to apply this methodology to Portuguese and in spite of the lack of lexical and computational resources (like bilingual dictionaries and parsers) for this language, we achieved results comparable to state of the art in French/English.
|
|
Keyword:
Comparable corpora alignment; Corpus; Língua portuguesa; Multilingual alignment; Natural language processing; Processamento : Linguagem natural; Term extraction
|
|
URL: http://hdl.handle.net/10183/118257
|
|
BASE
|
|
Hide details
|
|
90 |
Word Sense Disambiguation with GermaNet ; Disambiguierung von Wortbedeutungen mit GermaNet
|
|
|
|
BASE
|
|
Show details
|
|
92 |
The Oxford handbook of corpus phonology
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
93 |
Contrastive register variation : a quantitative approach to the comparison of English and German
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
94 |
Modes de conceptualisation métaphoriques et théorie de l’évolution : analyse textuelle et traduction
|
|
|
|
BASE
|
|
Show details
|
|
96 |
Narrowing the gap between termbases and corpora in commercial environments ; Shang yong shu yu ku he yu liao ku zhi jian de xie tiao wen ti yan jiu ; 商用術語庫和語料庫之間的協調問題研究
|
|
|
|
BASE
|
|
Show details
|
|
97 |
: Lexical Alignment from comparable corpora : neoclassical compounds and relational adjectives ; Alignement lexical en corpus comparables : le cas des composés savants et des adjectifs relationnels
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-00949025 ; Traitement du texte et du document. Université de Nantes, 2014. Français (2014)
|
|
BASE
|
|
Show details
|
|
98 |
Corpus parallèles, corpus comparables : quels contrastes ?
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-01184585 ; Informatique et langage [cs.CL]. Université de Poitiers, 2014 (2014)
|
|
BASE
|
|
Show details
|
|
99 |
Corpus-based recognition of relations between named entities ; Acquisition de relations entre entités nommées à partir de corpus
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01124260 ; Ordinateur et société [cs.CY]. Institut National des Langues et Civilisations Orientales- INALCO PARIS - LANGUES O', 2014. Français. ⟨NNT : 2014INAL0008⟩ (2014)
|
|
BASE
|
|
Show details
|
|
100 |
Corpus-based recognition of relations between named entities ; Acquisition de relations entre entités nommées à partir de corpus
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01124260 ; Ordinateur et société [cs.CY]. Institut National des Langues et Civilisations Orientales- INALCO PARIS - LANGUES O', 2014. Français. ⟨NNT : 2014INAL0008⟩ (2014)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7 8 9... 11
|
|