Institutions | About Us | Help | Gaeilge
rian logo


Mark
Go Back
Blame-Based Noise Reduction: An Alternative Perspective on Noise Reduction for Lazy Learning
Pasquier, Francois-Xavier; Delany, Sarah Jane; Cunningham, Padraig
TCD-CS-2005-29 In this paper we present a new perspective on noise reduction for nearest-neighbour classifiers. Classic noise reduction algorithms such as Repeated Edited Nearest Neighbour remove cases from the training set if they are misclassified by their nearest neighbours in a leave-one-out cross validation. In the approach presented here, cases are identified for deletion based on their propensity to cause misclassifications. This approach was originally identified in a case-based spam filtering application where it became clear that certain training examples were damaging to the accuracy of the system. In this paper we evaluate the general applicability of the approach on a large variety of datasets and show that it generally beats the classic approach. We also compare the two techniques on artificial noise and show that both are far from perfect at removing noise and that there remains scope for further research in this area.
Keyword(s): Nearest-Neighbour Classifiers; Noise Reduction
Publication Date:
2005
Type: Report
Peer-Reviewed: Unknown
Language(s): English
Institution: Trinity College Dublin
Funder(s): Science Foundation Ireland; Enterprise Ireland
Citation(s): Pasquier, Francois-Xavier; Delany, Sarah Jane; Cunningham, Padraig. 'Blame-Based Noise Reduction: An Alternative Perspective on Noise Reduction for Lazy Learning'. - Dublin, Trinity College Dublin, Department of Computer Science, TCD-CS-2005-29, 2005, pp17
Publisher(s): Trinity College Dublin, Department of Computer Science
File Format(s): application/pdf
First Indexed: 2014-05-13 05:31:07 Last Updated: 2015-04-10 05:13:50