Title
A memory-based approach to Kïkamba named entity recognition
Author
Faculty/Department
Faculty of Arts. Linguistics and Literature
Publication type
conferenceObject
Publication
S.l. , [*]
Subject
Computer. Automation
Linguistics
Source (book)
Proceedings of the Conference on Human Language Technology for Development
Carrier
E
Target language
English (eng)
Affiliation
University of Antwerp
Abstract
This paper describes the development of a data-driven part-of-speech tagger and named entity recognizer for the resource-scarce Bantu language of Kıkamba. A small webmined corpus for Kıkamba was manually annotated for both classification tasks and used as training material for a memory-based tagger. The encouraging experimental results show that basic language technology tools can be developed using limit amounts of data and state-of-the-art language-independent machine learning techniques.
Handle