Publication
Title
The Eclipse and Mozilla defect tracking dataset : a genuine dataset for mining bug information
Author
Abstract
The analysis of bug reports is an important subfield within the mining software repositories community. It explores the rich data available in defect tracking systems to uncover interesting and actionable information about the bug triaging process. While bug data is readily accessible from systems like Bugzilla and JIRA, a common database schema and a curated dataset could significantly enhance future research because it allows for easier replication. Consequently, in this paper we propose the Eclipse and Mozilla Defect Tracking Dataset, a representative database of bug data, filtered to contain only genuine defects (i.e., no feature requests) and designed to cover the whole bug-triage life cycle (i.e., store all intermediate actions). We have used this dataset ourselves for predicting bug severity, for studying bug-fixing time and for identifying erroneously assigned components. Sharing these data with the rest of the community will allow for reproducibility, validation and comparison of the results obtained in bug-report analyses and experiments.
Language
English
Source (book)
Proceedings MSR13 : 10th IEEE Working Conference on Mining Software Repositories, Piscataway, N.J., USA
Publication
New York, N.Y. : IEEE, 2013
ISBN
978-1-4799-0345-0
Volume/pages
p. 203-206
Full text (Publisher's DOI)
UAntwerpen
Faculty/Department
Research group
Publication type
Subject
Affiliation
Publications with a UAntwerp address
External links
Record
Identification
Creation 19.02.2014
Last edited 22.11.2016
To cite this reference