Abstract
For a highly subjective task such as recognising speaker intention and argumentation, the traditional way of generating gold standards is to aggregate a number of labels into a single one. However, this seriously neglects the underlying richness that characterises discourse and argumentation and is also, in some cases, straightforwardly impossible. In this paper, we present QT30nonaggr, the first corpus of non-aggregated argument annotation. QT30nonaggr encompasses 10% of QT30, the largest corpus of dialogical argumentation and analysed broadcast political debate currently available with 30 episodes of BBC’s ‘Question Time’ from 2020 and 2021. Based on a systematic and detailed investigation of annotation judgements across all steps of the annotation process, we structure the disagreement space with a taxonomy of the types of label disagreements in argument annotation, identifying the categories of annotation errors, fuzziness and ambiguity.
Original language | English |
---|---|
Title of host publication | Proceedings of the 1st Workshop on Perspectivist Approaches to NLP @LREC2022 |
Editors | Gavin Abercrombie, Valerio Basile, Sara Tonelli, Verena Rieser, Alexandra Uma |
Place of Publication | Paris |
Publisher | European Language Resources Association (ELRA) |
Pages | 1-9 |
Number of pages | 9 |
ISBN (Electronic) | 9791095546986 |
Publication status | Published - 2022 |
Event | 1st Workshop on Perspectivist Approaches to NLP - Marseille, France Duration: 20 Jun 2022 → 20 Jun 2022 |
Conference
Conference | 1st Workshop on Perspectivist Approaches to NLP |
---|---|
Abbreviated title | NLPerspectives 2022 |
Country/Territory | France |
City | Marseille |
Period | 20/06/22 → 20/06/22 |
Keywords
- argumentation and conflict
- broadcast political debate
- Inference Anchoring Theory
- Question Time
ASJC Scopus subject areas
- Information Systems and Management
- Information Systems
- Computer Science Applications