1 |
Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
ViTA: Visual-Linguistic Translation by Aligning Object Tags ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
HASOCOne@FIRE-HASOC2020: Using BERT and Multilingual BERT models for Hate Speech Detection ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Multilingual Pre-Trained Transformers and Convolutional NN Classification Models for Technical Domain Identification ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
gundapusunil at SemEval-2020 Task 9: Syntactic Semantic LSTM Architecture for SENTIment Analysis of Code-MIXed Data ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
A SentiWordNet Strategy for Curriculum Learning in Sentiment Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Word Level Language Identification in English Telugu Code Mixed Data ...
|
|
|
|
Abstract:
In a multilingual or sociolingual configuration Intra-sentential Code Switching (ICS) or Code Mixing (CM) is frequently observed nowadays. In the world, most of the people know more than one language. CM usage is especially apparent in social media platforms. Moreover, ICS is particularly significant in the context of technology, health, and law where conveying the upcoming developments are difficult in one's native language. In applications like dialog systems, machine translation, semantic parsing, shallow parsing, etc. CM and Code Switching pose serious challenges. To do any further advancement in code-mixed data, the necessary step is Language Identification. In this paper, we present a study of various models - Nave Bayes Classifier, Random Forest Classifier, Conditional Random Field (CRF), and Hidden Markov Model (HMM) for Language Identification in English - Telugu Code Mixed Data. Considering the paucity of resources in code mixed languages, we proposed the CRF model and HMM model for word level ... : 7 pages, 3 figures ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2010.04482 https://arxiv.org/abs/2010.04482
|
|
BASE
|
|
Hide details
|
|
9 |
A Sentiwordnet Strategy for Curriculum Learning in Sentiment Analysis
|
|
|
|
In: Natural Language Processing and Information Systems (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Conversational implicatures in English dialogue: Annotated dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
BCSAT : A Benchmark Corpus for Sentiment Analysis in Telugu Using Word-level Annotations ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Automatic Target Recovery for Hindi-English Code Mixed Puns ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Towards Automation of Sense-type Identification of Verbs in OntoSenseNet(Telugu) ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Towards Enhancing Lexical Resource and Using Sense-annotations of OntoSenseNet for Sentiment Analysis ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Context and Humor: Understanding Amul advertisements of India ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|