Publication
Title
HiDER : query-driven entity resolution for historical data
Author
Abstract
Entity Resolution (ER) is the task of finding references that refer to the same entity across different data sources. Cleaning a data warehouse and applying ER on it is a computationally demanding task, particularly for large data sets that change dynamically. Therefore, a query-driven approach which analyses a small subset of the entire data set and integrates the results in real-time is significantly beneficial. Here, we present an interactive tool, called HiDER, which allows for query-driven ER in large collections of uncertain dynamic historical data. The input data includes civil registers such as birth, marriage and death certificates in the form of structured data, and notarial acts such as estate tax and property transfers in the form of free text. The outputs are family networks and event timelines visualized in an integrated way. The HiDER is being used and tested at BHIC center(Brabant Historical Information Center, https://www.bhic.nl); despite the uncertainties of the BHIC input data, the extracted entities have high certainty and are enriched by extra information.
Language
English
Source (journal)
Lecture notes in computer science. - Berlin, 1973, currens
Publication
Berlin : 2015
ISSN
0302-9743 [print]
1611-3349 [online]
Volume/pages
9286(2015), p. 281-284
ISI
000363667400033
Full text (Publishers DOI)
UAntwerpen
Faculty/Department
Publication type
Subject
External links
Web of Science
Record
Identification
Creation 23.06.2016
Last edited 21.03.2017