Publication
Title
Overview of the Cross-Domain Authorship Verification Task at PAN 2020
Author
Abstract
Authorship identification remains a highly topical research problem in computational text analysis with many relevant applications in contemporary society and industry. For this edition of PAN, we focused on authorship verification, where the task is to assess whether a pair of documents has been authored by the same individual. Like in previous editions, we continued to work with (English-language) fanfiction, written by non-professional authors. As a novelty, we substantially increased the size of the provided dataset to enable more datahungry approaches. In total, thirteen systems (from ten participating teams) have been submitted, which are substantially more diverse than the submissions from previous years. We provide a detailed comparison of these approaches and two generic baselines. Our findings suggest that the increased scale of the training data boosts the state of the art in the field, but we also confirm the conventional issue that the field struggles with an overreliance on topic-related information.
Language
English
Source (book)
Working notes of CLEF 2020 - Conference and Labs of the Evaluation Forum, 22-25 September, Thessaloniki, Greece
Publication
2020
Volume/pages
p. 1-14
Full text (open access)
UAntwerpen
Faculty/Department
Research group
Project info
InterStylar: A Stylometric Approach to Intertextuality in 12th century Latin Literature.
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Record
Identifier
Creation 18.12.2020
Last edited 08.12.2021
To cite this reference