Publication
Title
From neighborhood to parenthood : the advantages of dependency representation over bigrams in Brown clustering
Author
Abstract
We present an effective modification of the popular Brown et al. 1992 word clustering algorithm, using a dependency language model. By leveraging syntax-based context, resulting clusters are better when evaluated against a wordnet for Dutch. The improvements are stable across parameters such as number of clusters, minimum frequency and granularity. Further refinement is possible through dependency relation selection. Our approach achieves a desired clustering quality with less data, resulting in a decrease in cluster creation times.
Language
English
Source (book)
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
Publication
2014
ISBN
978-1-941643-26-6
Volume/pages
p. 1382-1391
Full text (publisher's version - intranet only)
UAntwerpen
Publication type
Subject
External links
Record
Identifier
Creation 07.11.2016
Last edited 22.08.2023
To cite this reference