Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6...168

Hits 21 – 40 of 3.344

21	Modeling Sense Structure in Word Usage Graphs with the Weighted Stochastic Block Model ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Castaneda, Enrique; Kuhn, Jonas. - : Underline Science Inc., 2021
	BASE
	Show details

22	InFillmore: Frame-Guided Language Generation with Bidirectional Context ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Belyy, Anton; Ou, Jiefu. - : Underline Science Inc., 2021
	BASE
	Show details

23	LINGUISTATICAL STUDY OF CHINESE INTERNET LEXICOLOGY ...
	Mukhamedjanova Sh.B.. - : Zenodo, 2021
	BASE
	Show details

24	LINGUISTATICAL STUDY OF CHINESE INTERNET LEXICOLOGY ...
	Mukhamedjanova Sh.B.. - : Zenodo, 2021
	BASE
	Show details

25	Warum wir so wenig über die Sprachen in Deutschland wissen: Spracheinstellungen als Erkenntnisbarriere
	Adler, Astrid; Ribeiro Silveira, Maria
	In: Diskurs Kindheits- und Jugendforschung / Discourse. Journal of Childhood and Adolescence Research ; 16 ; 4 ; 403-419 ; Perspektiven von Kindern und Jugendlichen auf sprachliche Diversität und Sprachbildungsprozesse (2021)
	BASE
	Show details

26	Essays on Representation Learning for Political Science Research
	Wu, Patrick. - 2021
	BASE
	Show details

27	Vocabulário escrito de estudantes de escolas públicas do Rio Grande do Sul : um estudo léxico-estatístico
	Silva, Bruna Rodrigues da. - 2021
	BASE
	Show details

28	Languages in space and time : models and methods from complex systems theory
	Patriarca, Marco; Léonard, Jean-Léo; Heinsalu, Els. - Cambridge, United Kingdom : Cambridge University Press, 2020
	BLLDB
	UB Frankfurt Linguistik
	Show details

29	Data collection research methods in applied linguistics
	Rose, Heath; McKinley, Jim; Baffoe-Djan, Jessica Briggs. - Sydney : Bloomsbury Academic, 2020
	BLLDB
	UB Frankfurt Linguistik
	Show details

30	Statistics for linguists : an introduction using R
	Winter, Bodo. - New York [u.a.] : Routledge, 2020
	Leibniz-Zentrum Allgemeine Sprachwissenschaft
	Show details

31	Beke Hansen: Corpus linguistics and sociolinguistics. Leiden: Brill/Rodopi, 2018
	Zuppardi, Maria Carolina
	In: Corpora. - Edinburgh : Univ. Press 15 (2020) 1, 121-124
	BLLDB
	Show details

32	Caractériser un texte en français : les passages-clés des niveaux A1 et A2 du CECRL.
	Ruggia, Simona
	In: Actes des JADT 2020 ; JADT 2020 15èmes Journées internationales d’Analyse statistique des Données Textuelles ; https://hal.archives-ouvertes.fr/hal-02430322 ; JADT 2020 15èmes Journées internationales d’Analyse statistique des Données Textuelles, Jun 2020, Toulouse, France. 11 p ; https://jadt2020.sciencesconf.org/ (2020)
	BASE
	Show details

33	Identifying Language and Cognitive Profiles in Children With ASD via a Cluster Analysis Exploration: Implications for the New ICD-11
	Silleresi, Silvia; Prevost, Philippe; Zebib, Racha...
	In: ISSN: 1939-3806 ; EISSN: 1939-3806 ; Autism Research ; https://hal.archives-ouvertes.fr/hal-02880841 ; Autism Research, International Society for Autism Research, Wiley Periodicals, Inc., 2020, ⟨10.1002/aur.2268⟩ (2020)
	BASE
	Show details

34	NTeALan Dictionaries Platforms: An Example Of Collaboration-Based Model
	MBONING TCHIAZE, Elvis; Baleba, Daniel; Marc Bassahak, Jean...
	In: Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020) ; https://hal.archives-ouvertes.fr/hal-02701912 ; Proceedings of the 1st International Workshop on Language Technology Platforms (IWLTP 2020), 2020, pp.11 - 16 (2020)
	BASE
	Show details

35	Standard-based Lexical Models for Automatically Structured Dictionaries ; Modèles lexicaux standardisés pour les dictionnaires à structure automatique
	Khemakhem, Mohamed. - : HAL CCSD, 2020
	In: https://tel.archives-ouvertes.fr/tel-03153438 ; Computation and Language [cs.CL]. Université de Paris, 2020. English (2020)
	BASE
	Show details

36	Question Answering with Hybrid Data and Models ; Question-réponse utilisant des données et modèles hybrides
	Ramachandra Rao, Sanjay Kamath. - : HAL CCSD, 2020
	In: https://tel.archives-ouvertes.fr/tel-02890467 ; Document and Text Processing. Université Paris-Saclay, 2020. English. ⟨NNT : 2020UPASS024⟩ (2020)
	Abstract: Question Answering is a discipline which lies in between natural language processing and information retrieval domains. Emergence of deep learning approaches in several fields of research such as computer vision, natural language processing, speech recognition etc. has led to the rise of end-to-end models.In the context of GoASQ project, we investigate, compare and combine different approaches for answering questions formulated in natural language over textual data on open domain and biomedical domain data. The thesis work mainly focuses on 1) Building models for small scale and large scale datasets, and 2) Leveraging structured and semantic information into question answering models. Hybrid data in our research context is fusion of knowledge from free text, ontologies, entity information etc. applied towards free text question answering.The current state-of-the-art models for question answering use deep learning based models. In order to facilitate using them on small scale datasets on closed domain data, we propose to use domain adaptation. We model the BIOASQ biomedical question answering task dataset into two different QA task models and show how the Open Domain Question Answering task suits better than the Reading Comprehension task by comparing experimental results. We pre-train the Reading Comprehension model with different datasets to show the variability in performance when these models are adapted to biomedical domain. We find that using one particular dataset (SQUAD v2.0 dataset) for pre-training performs the best on single dataset pre-training and a combination of four Reading Comprehension datasets performed the best towards the biomedical domain adaptation. We perform some of the above experiments using large scale pre-trained language models like BERT which are fine-tuned to the question answering task. The performance varies based on the type of data used to pre-train BERT. For BERT pre-training on the language modelling task, we find the biomedical data trained BIOBERT to be the best choice for biomedical QA.Since deep learning models tend to function in an end-to-end fashion, semantic and structured information coming from expert annotated information sources are not explicitly used. We highlight the necessity for using Lexical and Expected Answer Types in open domain and biomedical domain question answering by performing several verification experiments. These types are used to highlight entities in two QA tasks which shows improvements while using entity embeddings based on the answer type annotations. We manually annotated an answer variant dataset for BIOASQ and show the importance of learning a QA model with answer variants present in the paragraphs.Our hypothesis is that the results obtained from deep learning models can further be improved using semantic features and collective features from different paragraphs for a question. We propose to use ranking models based on binary classification methods to better rank Top-1 prediction among Top-K predictions using these features, leading to an hybrid model that outperforms state-of-art-results on several datasets. We experiment with several overall Open Domain Question Answering models on QA sub-task datasets built for Reading Comprehension and Answer Sentence Selection tasks. We show the difference in performance when these are modelled as overall QA task and highlight the wide gap in building end-to-end models for overall question answering task. ; La recherche de réponses à des questions relève de deux disciplines : le traitement du langage naturel et la recherche d’information. L’émergence de l’apprentissage profond dans plusieurs domaines de recherche tels que la vision par ordinateur, le traitement du langage naturel etc. a conduit à l’émergence de modèles de bout en bout. Les travaux actuels de l’état de l’art en question-réponse (QR) visent à mettre en oeuvre de tels modèles. Dans le cadre du projet GoASQ, l’objectif est d’étudier, comparer et combiner différentes approches pour répondre à des questions formulées en langage naturel sur des données textuelles, en domaine ouvert et en domaine biomédical. Ce travail se concentre principalement sur 1) la construction de modèles permettant de traiter des ensembles de données à petite et à grande échelle, et 2) l’exploitation de connaissances sémantiques pour répondre aux questions par leur intégration dans les différents modèles. Nous visons à fusionner des connaissances issues de textes libres, d’ontologies, de représentations d’entités, etc. Afin de faciliter l’utilisation des modèles neuronaux sur des données de domaine de spécialité, nous nous plaçons dans le cadre de l’adaptation de domaine. Nous avons proposé deux modèles de tâches de QR différents, évalués sur la tâche BIOASQ de réponse à des questions biomédicales. Nous montrons par nos résultats expérimentaux que le modèle de QR ouvert convient mieux qu’une modélisation de type Compréhension machine. Nous pré-entrainons le modèle de Compréhension machine, qui sert de base à notre modèle, sur différents ensembles de données pour montrer la variabilité des performances. Nous constatons que l’utilisation d’un ensemble de données particulier pour le pré-entraînement donne les meilleurs résultats lors du test et qu’une combinaison de quatre jeux de données donne les meilleurs résultats lors de l’adaptation au domaine biomédical. Nous avons testé des modèles de langage à grande échelle, comme BERT, qui sont adaptés à la tâche de réponse aux questions. Les performances varient en fonction du type des données utilisées pour pré-entrainer BERT. Ainsi, le modèle de langue appris sur des données biomédicales, BIOBERT, constitue le meilleur choix pour le QR biomédical. Les modèles d’apprentissage profond visent à fonctionner de bout en bout. Les informations sémantiques provenant de sources de connaissances construites par des experts n’y sont généralement pas introduites. Nous avons annoté manuellement et automatiquement un jeu de données par les variantes des réponses de BIOASQ et montré l’importance d’apprendre un modèle de QR avec ces variantes. Ces types sont ensuite utilisés pour mettre en évidence les entités dans les jeux de données, ce qui montre des améliorations sur l’état de l’art. Par ailleurs l’exploitation de représentations vectorielles d’entités dans les modèles se montre positif pour le domaine ouvert. Nous faisons l’hypothèse que les résultats obtenus à partir de modèles d’apprentissage profond peuvent être encore améliorés en utilisant des traits sémantiques et des traits collectifs calculés à partir des différents paragraphes sélectionnés pour répondre à une question. Nous utilisons des modèles de classification binaires pour améliorer la prédiction de la réponse parmi les K candidats à l’aide de ces caractéristiques, conduisant à un modèle hybride qui surpasse les résultats de l’état de l’art. Enfin, nous avons évalué des modèles de QR ouvert sur des ensembles de données construits pour les tâches de Compréhension machine et Sélection de phrases. Nous montrons la différence de performance lorsque la tâche à résoudre est une tâche de QR ouverte et soulignons le fossé important qu’il reste à franchir dans la construction de modèles de bout en bout pour la tâche complète de réponse aux questions.
	Keyword: [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; Apprentissage automatique; Machine Learning; Natural Language Processing; Question Answering; Question-réponse; Traitement langage naturel
	URL: https://tel.archives-ouvertes.fr/tel-02890467/file/79996_RAMACHANDRA_RAO_2020_archivage.pdf https://tel.archives-ouvertes.fr/tel-02890467/document https://tel.archives-ouvertes.fr/tel-02890467
	BASE
	Hide details

37	It’s complicated! ; It’s complicated!: On Natural Language Processing Tools and Digital Humanities
	Poibeau, Thierry
	In: “Tool Criticism 3.0” Workshop ; https://hal.archives-ouvertes.fr/hal-03084644 ; “Tool Criticism 3.0” Workshop, Jul 2020, Online (due to COVID, initially planned in Ottawa), Canada (2020)
	BASE
	Show details

38	Caractérisation de registres de langue par extraction de motifs séquentiels émergents
	Mekki, Jade; Béchet, Nicolas; Battistelli, Delphine...
	In: JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles ; https://hal.archives-ouvertes.fr/hal-03078450 ; JADT 2020 : 15èmes Journées Internationales d'Analyse statistique des Données Textuelles, Jun 2020, Toulouse, France (2020)
	BASE
	Show details

39	OFROM, corpus oral de français de Suisse romande: une ressource pour la recherche ... et pour l'enseignement/apprentissage du français
	Johnsen, Laure Anne
	In: Babylonia. - Comano : Fondazione Lingue e Culture (2020) 1, 44-53
	BLLDB
	Show details

40	Categorizing Languages and Speakers: Processes of Erasure in Data Treatment and Presentation
	Busch, Brigitta. - : Groupe de recherche diversité urbaine, 2020. : Érudit, 2020
	BASE
	Show details

Page: 1 2 3 4 5 6...168

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern