1 |
On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Don't Let Discourse Confine Your Model: Sequence Perturbations for Improved Event Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
TellMeWhy: A Dataset for Answering Why-Questions in Narratives ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Modeling Label Semantics for Predicting Emotional Reactions ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Residualized Factor Adaptation for Community Social Media Prediction Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The Fine Line between Linguistic Generalization and Failure in Seq2Seq-Attention Models ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Improved Document Representation for Classification Tasks for the Intelligence Community
|
|
|
|
In: School of Information Studies - Faculty Scholarship (2005)
|
|
BASE
|
|
Show details
|
|
|
|