Title
|
|
|
|
From neighborhood to parenthood : the advantages of dependency representation over bigrams in Brown clustering
|
|
Author
|
|
|
|
|
|
Abstract
|
|
|
|
We present an effective modification of the popular Brown et al. 1992 word clustering algorithm, using a dependency language model. By leveraging syntax-based context, resulting clusters are better when evaluated against a wordnet for Dutch. The improvements are stable across parameters such as number of clusters, minimum frequency and granularity. Further refinement is possible through dependency relation selection. Our approach achieves a desired clustering quality with less data, resulting in a decrease in cluster creation times. |
|
|
Language
|
|
|
|
English
|
|
Source (book)
|
|
|
|
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers
|
|
Publication
|
|
|
|
2014
|
|
ISBN
|
|
|
|
978-1-941643-26-6
|
|
Volume/pages
|
|
|
|
p. 1382-1391
|
|
Full text (publisher's version - intranet only)
|
|
|
|
|
|