Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 19 of 19

1	How much context span is enough? Examining context-related issues for document-level MT
	Castilho, Sheila
	In: Castilho, Sheila orcid:0000-0002-8416-6555 (2022) How much context span is enough? Examining context-related issues for document-level MT. In: 13th Language Resources and Evaluation Conference, 21-23 June 2022, Marseille, France. (In Press) (2022)
	BASE
	Show details

2	DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
	Castilho, Sheila; Cavalheiro Camargo, João Lucas; Menezes, Miguel...
	In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
	BASE
	Show details

3	Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation
	Castilho, Sheila
	In: Castilho, Sheila orcid:0000-0002-8416-6555 (2021) Towards document-level human MT evaluation: On the Issues of annotator agreement, effort and misevaluation. In: 16th Conference of the European Chapter of the Association for Computational Linguistics - EACL 2021., 19-23 April 2021, Online. (In Press) (2021)
	BASE
	Show details

4	Contextualization of Web contents through semantic enrichment from linked open data ; Contextualisation des contenus Web par l'enrichissement sémantique à partir de données
	Kumar, Amit. - : HAL CCSD, 2021
	In: https://tel.archives-ouvertes.fr/tel-03561788 ; Databases [cs.DB]. Normandie Université, 2021. English. ⟨NNT : 2021NORMC243⟩ (2021)
	Abstract: Thirty years of the Web have led to a tremendous amount of contents and the enormous growth is still ongoing, even accelerating. Thus, Web users are confronted with an abundance of information. While this is clearly beneficial, there is a risk of “information overload” and it is very hard for a Web user to access, contextualize and digest Web contents. Thus, there is an increasing need of categorizing, summarizing, and/or interpretability of Web contents in order to get a proper contextualization. While contents of the early years have been predominantly “simple” HTML documents, more recent ones have become more and more “machine-interpretable” and contribute to the ever growing Linked Open Data (LOD) cloud. LOD provides us a multitude of research opportunities for investigating and harvesting insights about Web contents.In this thesis, we investigate a variety of tasks related to semantic contextualization of Web contents. Specifically, we address three facets in the context of distillation of the Web contents, namely, entity-driven content analysis, semantic annotation & retrieval, and semantic user tracing.We hypothesize that named entities and their types present in a Web document convey substantial semantic information. We have displayed by employing multiple studies that projecting Web contents to the entity-level captures their fundamental semantics. Thus, it provides significant knowledge about the Web contents and, subsequently, comprehensibility. We report novel findings over diverse tasks in an attempt to accomplish our overall goal of a better contextualization of Web contents. ; Les trente années d'existence du Web ont donné lieu à une quantité phénoménale de contenus et cette croissance énorme se poursuit, voire s'accélère. Les utilisateurs du Web sont donc confrontés à une abondance d'informations. Bien que cela soit clairement bénéfique, il existe un risque de “surcharge d'informations” et il est très difficile pour un utilisateur du Web d'accéder, de contextualiser et de digérer les contenus du Web. Il est donc de plus en plus nécessaire pour catégoriser, de résumer et/ou d'interpréter les contenus du Web afin d'obtenir une contextualisation adéquate. Alors que les contenus des premières années étaient principalement de “simples” documents HTML, les plus récents sont devenus de plus en plus "interprétables par les machines" et contribuent au nuage de données ouvertes liées (LOD) en constante expansion. LeLOD nous offre une multitude de possibilités de recherche pour étudier et récolter des informations sur les contenus du Web.Dans cette thèse, nous étudions une variété de tâches liées à la contextualisation sémantique des contenus Web. Plus précisément, nous abordons trois facettes dans le contexte de la distillation des contenus Web, à savoir, l'analyse de contenu axée sur les entités, l'annotation et la recherche sémantiques, et le traçage sémantique des utilisateurs. Nous supposons que les entités nommées et leurs types présents dans un document Web véhiculent des informations sémantiques substantielles. Nous avons démontré, à l'aide de multiples études, que la projection des contenus Web au niveau des entités permet de capturer leur sémantique fondamentale. Ainsi, elle fournit des connaissances significatives sur le contenu du Web et, par conséquent, une meilleure compréhension. Nous présentons de nouveaux résultats sur diverses tâches dans le but d'atteindre notre objectif global d'une meilleure contextualisation des contenus Web.
	Keyword: [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]; Analyse au niveau des entités; Classification des types d'entités; Données Web multilingues; Entity-level Analytics; Entity-type Classification; Multilingual Web Data; Représentation sémantique des documents; Représentation sémantique des utilisateurs; Semantic Document Representation; Semantic User Representation; Web Semantics
	URL: https://tel.archives-ouvertes.fr/tel-03561788 https://tel.archives-ouvertes.fr/tel-03561788/file/sygal_fusion_30387-kumar-amit_61fe9a1f4d7e4.pdf https://tel.archives-ouvertes.fr/tel-03561788/document
	BASE
	Hide details

5	A Novel Deep Learning ArCAR System for Arabic Text Recognition with Character-Level Representation
	Abdullah Y. Muaad; Mugahed A. Al-antari; Sungyoung Lee; Hanumanthappa Jayappa Davanagere
	In: Computer Sciences & Mathematics Forum; Volume 2; Issue 1; Pages: 14 (2021)
	BASE
	Show details

6	What cultural aspects should be taught in FL lessons? – A model for evaluating the cultural content in FL course-books
	Sándorová, Zuzana
	In: Acta Scientiarum. Language and Culture; Vol 43 No 2 (2021): July-Dec.; e52066 ; Acta Scientiarum. Language and Culture; v. 43 n. 2 (2021): July-Dec.; e52066 ; 1983-4683 ; 1983-4675 (2021)
	BASE
	Show details

7	Redonner du sens à l’accord interannotateurs : vers une interprétation des mesures d’accord en termes de reproductibilité de l’annotation
	Bregeon, Dany; Antoine, Jean-Yves; Villaneau, Jeanne...
	In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-02375240 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2019, 60 (2), pp.23 (2019)
	BASE
	Show details

8	Εntity-level Εvent Ιmpact Αnalytics ; Analyse de l’Impact des Événements au Niveau des Entités
	Govind, Govind. - : HAL CCSD, 2019
	In: https://hal.archives-ouvertes.fr/tel-02102795 ; Document and Text Processing. Normandie Université, Unicaen, EnsiCaen, CNRS, GREYC UMR 6072, 2019. English (2019)
	BASE
	Show details

9	A CNN-BiLSTM Model for Document-Level Sentiment Analysis
	Maryem Rhanoui; Mounia Mikram; Siham Yousfi...
	In: Machine Learning and Knowledge Extraction ; Volume 1 ; Issue 3 ; Pages 48-847 (2019)
	BASE
	Show details

10	ЛЕКСИЧЕСКАЯ КАТЕГОРИЯ «ДОКУМЕНТ» В КОГНИТИВНОМ АСПЕКТЕ
	СЛАУТИНА МАРИНА ВАСИЛЬЕВНА. - : Федеральное государственное бюджетное образовательное учреждение высшего профессионального образования «Волгоградский государственный университет», 2016
	BASE
	Show details

11	Semantic Hierarchical Document Signature For Determining Sentence Similarity
	Manna, Sukanya; Gedeon, Tamas (Tom)
	In: Proceedings of the 19th international conference on Fuzzy Systems (2015)
	BASE
	Show details

12	DAnIEL, parsimonious yet high-coverage multilingual epidemic surveillance ; DAnIEL : Veille épidémiologique multilingue parcimonieuse
	Lejeune, Gaël; Brixtel, Romain; Lecluze, Charlotte...
	In: 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013) ; https://hal.archives-ouvertes.fr/hal-01074881 ; 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013), Jun 2013, Sables d'Olonne, France. p.787-788 (2013)
	BASE
	Show details

13	Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
	Dingding Wang; Tao Li; Shenghuo Zhu...
	In: http://users.cis.fiu.edu/%7Etaoli/pub/sigir08-p307-wang.pdf (2008)
	BASE
	Show details

14	A computational effective document semantic representation
	Williams, Robert. - : IEEE, 2007
	BASE
	Show details

15	Script-Independent Text Line Segmentation in Freestyle Handwritten Documents
	Li, Yi; Zheng, Yefeng; Doermann, David...
	In: DTIC (2006)
	BASE
	Show details

16	Towards a Quantitative Theory of Variability ; Towards a Quantitative Theory of Variability: Language, brain and computation
	Blache, Philippe
	In: UG and External Systems ; https://hal.archives-ouvertes.fr/hal-00134205 ; Ana-Maria Di Sciullo. UG and External Systems, John Benjamins, pp.375-388, 2005 (2005)
	BASE
	Show details

17	Theater Strategy and the Theater Campaign Plan: Both Are Essential
	Mendel, William W.
	In: DTIC (1988)
	BASE
	Show details

18	Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization
	Dingding Wang; Tao Li; Shenghuo Zhu...
	In: http://users.cs.fiu.edu/~taoli/tenure/fp557-Wang.pdf
	BASE
	Show details

19	Handwritten Text Image Compression for Indic Script
	Smita V. Khangar; Latesh G. Malik
	In: http://research.ijcaonline.org/volume47/number5/pxc3879888.pdf
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern