Home
Catalogue search
Refine your search:
Keyword:
ARTIFICIAL INTELLIGENCE (1)
Computational Linguistics (1)
Conditional random fields (1)
DATA ANALYSIS (1)
Digital Engagement (1)
Digital Humanities (1)
MACHINE LEARNING (1)
Natural language processing (1)
Verbal multiword Expressions (1)
multi-word expressions (1)
more
Creator / Publisher
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 1 of 1
1
Semantic reranking of CRF label sequences for verbal multiword expression identification ; Multiword expressions at length and in depth: Extended papers from the MWE 2017 workshop
Maldonado Guerra, Alfredo
;
Moreau, Erwan
;
Vogel, Carl
;
Alsulaimani, Ashjan
;
Han, Lifeng
;
Chowdhury, Koel Dutta
. - : Language Science Press, 2018
Abstract:
PUBLISHED ; Verbal multiword Expressions (VMWE) identification can be addressed successfully as a sequence labelling problem via conditional random fields (CRFs) by returning the one label sequence with maximal probability. This work describes a system that reranks the top 10 most likely CRF candidate VMWE sequences using a decision tree regression model. The reranker aims to operationalise the intuition that a non-compositional MWE can have a different distributional behaviour than that of its constituent words. This is why it uses semantic features based on comparing the context vector of a candidate expression against those of its constituent words. However, not all VMWE are non-compostional, and analysis shows that non-semantic features also play an important role in the behaviour of the reranker. In fact, the analysis shows that the combination of the sequential approach of the CRF component with the context-based approach of the reranker is the main factor of improvement: our reranker achieves a 12% macro-average F1-score improvement on the basic CRF method, as measured using data from PARSEME shared task on VMWE identification.
Keyword:
ARTIFICIAL INTELLIGENCE
;
Computational Linguistics
;
Conditional random fields
;
DATA ANALYSIS
;
Digital Engagement
;
Digital Humanities
;
MACHINE LEARNING
;
multi-word expressions
;
Natural language processing
;
text analytics
;
Verbal multiword Expressions
URL:
http://langsci-press.org/catalog/view/204/1647/1302-1
https://doi.org/10.5281/zenodo.1469559
http://people.tcd.ie/vogel
http://hdl.handle.net/2262/91208
http://people.tcd.ie/maldona
http://people.tcd.ie/moreaue
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
1
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern