Title
|
|
|
|
GLAD: Groningen Lightweight Authorship Detection : Notebook for PAN at CLEF 2015
| |
Author
|
|
|
|
| |
Abstract
|
|
|
|
We present a simple and effective approach to authorship verifica- tion for Dutch, English, Spanish and Greek, which can be easily ported to yet other languages. We train a binary linear classifier both on the features describing known and unknown documents individually, and on the joint features comparing these two types of documents. The list of feature types includes, among others, character n-grams, the lexical overlap, visual text properties and a compression measure. We obtain competitive results that outperform the baseline and position our system among the top PAN shared task participants. |
| |
Language
|
|
|
|
English
| |
Source (journal)
|
|
|
|
CEUR Workshop Proceedings
| |
Source (book)
|
|
|
|
Proceedings of CLEF 2015 Labs and Workshops, Notebook Papers, CEUR Workshop
| |
Publication
|
|
|
|
2015
| |
Volume/pages
|
|
|
|
p. 1-12
| |
Full text (publisher's version - intranet only)
|
|
|
|
| |
|