Publication
Title
A memory-based approach to Kïkamba named entity recognition
Author
Abstract
This paper describes the development of a data-driven part-of-speech tagger and named entity recognizer for the resource-scarce Bantu language of Kıkamba. A small webmined corpus for Kıkamba was manually annotated for both classification tasks and used as training material for a memory-based tagger. The encouraging experimental results show that basic language technology tools can be developed using limit amounts of data and state-of-the-art language-independent machine learning techniques.
Language
English
Source (book)
Proceedings of the Conference on Human Language Technology for Development
Publication
S.l. : 2011
Volume/pages
p. 106-111
UAntwerpen
Faculty/Department
Research group
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Record
Identification
Creation 17.03.2012
Last edited 12.09.2013
To cite this reference