TwiSty is a corpus developed for research in author profiling. It contains personality (MBTI) and gender annotations for a total of 18,168 authors spanning six languages. We distribute the Twitter ids of these authors as well as the ids of their available tweets at the time of corpus development. The tweets have undergone language identification and can be found in a Confirmed (as belonging to the language in which the author is situated) and Other category.
TwiSty : a multilingual Twitter Stylometry corpus for gender and personality profiling / Verhoeven, Ben. - Portorož, 2016
CLiPS Research Group, University of Antwerp
Faculty of Arts. Linguistics
Centre for Computational Linguistics and Psycholinguistics (CLiPS)
Deep linguistic features for computational stylometry.
Automatic Monitoring for Cyberspace Applications (AMiCA).
Publications with a UAntwerp address
To cite this reference