Transfer and active learning for dissonance detection: Addressing the rare-class challenge

Jan 1, 2023ยท
Vasudha Varadarajan
,
Swanie Juhng
,
Syeda Mahwish
,
Xiaoran Liu
,
Jonah Luby
,
Christian Luhmann
,
H Andrew Schwartz
ยท 0 min read
Addressing needle-in-a-haystack problem for challenging NLP tasks
Abstract
While transformer-based systems have enabled greater accuracies with fewer training examples, data acquisition obstacles still persist for rare-class tasks – when the class label is very infrequent (e.g. < 5% of samples). Active learning has in general been proposed to alleviate such challenges, but choice of selection strategy, the criteria by which rare-class examples are chosen, has not been systematically evaluated. Further, transformers enable iterative transfer-learning approaches. We propose and investigate transfer- and active learning solutions to the rare class problem of dissonance detection through utilizing models trained on closely related tasks and the evaluation of acquisition strategies, including a proposed probability-of-rare-class (PRC) approach. We perform these experiments for a specific rare class problem of collecting language samples of cognitive dissonance from social media. We find that PRC is a simple and effective strategy to guide annotations and ultimately improve model accuracy while transfer-learning in a specific order can improve the cold-start performance of the learner but does not benefit iterations of active learning.
Type
Publication
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics