Publication
Title
GLAD: Groningen Lightweight Authorship Detection : Notebook for PAN at CLEF 2015
Author
Abstract
We present a simple and effective approach to authorship verifica- tion for Dutch, English, Spanish and Greek, which can be easily ported to yet other languages. We train a binary linear classifier both on the features describing known and unknown documents individually, and on the joint features comparing these two types of documents. The list of feature types includes, among others, character n-grams, the lexical overlap, visual text properties and a compression measure. We obtain competitive results that outperform the baseline and position our system among the top PAN shared task participants.
Language
English
Source (journal)
CEUR Workshop Proceedings
Source (book)
Proceedings of CLEF 2015 Labs and Workshops, Notebook Papers, CEUR Workshop
Publication
2015
Volume/pages
p. 1-12
Full text (publisher's version - intranet only)
UAntwerpen
Research group
Publication type
Subject
External links
Record
Identifier
Creation 07.11.2016
Last edited 22.08.2023
To cite this reference