Publication
Title
Combining instance and feature neighbors for efficient multi-label classification
Author
Abstract
Multi-label classification problems occur naturally in different domains. For example, within text categorization the goal is to predict a set of topics for a document, and within image scene classification the goal is to assign labels to different objects in an image. In this work we propose a combination of two variations of k nearest neighborhoods (kNN) where the first neighborhood is computed instance (or row) based and the second neighborhood is feature (or column) based. Instance based kNN is inspired by user-based collaborative filtering, while feature kNN is inspired by item-based collaborative filtering. Finally we apply a linear combination of instance and feature neighbors scores and apply a single threshold to predict the set of labels. Experiments on various multi-label datasets show that our algorithm outperforms other state-of-the-art methods such as ML-kNN, IBLR and Binary Relevance with SVM, on different evaluation metrics. Finally our algorithm uses an inverted index during neighborhood search and scales to extreme datasets that have millions of instances, features and labels.
Language
English
Source (book)
2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA), 19-21 Oct. 2017, Tokyo, Japan
Publication
Tokyo : IEEE , 2017
ISBN
978-1-5090-5005-5 [POD]
978-1-5090-5004-8
DOI
10.1109/DSAA.2017.70
Volume/pages
p. 109-118
Full text (Publisher's DOI)
Full text (open access)
UAntwerpen
Faculty/Department
Research group
Project info
CalcUA as central calculation facility: supporting core facilities.
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Record
Identifier
Creation 12.12.2018
Last edited 22.01.2024
To cite this reference