Document Detail


Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data.
MedLine Citation:
PMID:  21954215     Owner:  NLM     Status:  Publisher    
Abstract/OtherAbstract:
Machine learning algorithms such as genetic programming (GP) can evolve biased classifiers when data sets are unbalanced. Data sets are unbalanced when at least one class is represented by only a small number of training examples (called the minority class) while other classes make up the majority. In this scenario, classifiers can have good accuracy on the majority class but very poor accuracy on the minority class(es) due to the influence that the larger majority class has on traditional training criteria in the fitness function. This paper aims to both highlight the limitations of the current GP approaches in this area and develop several new fitness functions for binary classification with unbalanced data. Using a range of real-world classification problems with class imbalance, we empirically show that these new fitness functions evolve classifiers with good performance on both the minority and majority classes. Our approaches use the original unbalanced training data in the GP learning process, without the need to artificially balance the training examples from the two classes (e.g., via sampling).
Authors:
Urvesh Bhowan; Mark Johnston; Mengjie Zhang
Related Documents :
11543275 - Astrobiology: exploring the origins, evolution, and distribution of life in the universe.
15940845 - Valuing life: a plea for disaggregation.
22217595 - The european radiobiological archives: online access to data from radiobiological expe...
18375775 - A review of systematic reviews evaluating diabetes interventions: focus on quality of l...
10557115 - Teenagers, young people and family planning: a survey in five romanian high schools.
1111085 - The work of a rural practice.
Publication Detail:
Type:  JOURNAL ARTICLE     Date:  2011-9-26
Journal Detail:
Title:  IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society     Volume:  -     ISSN:  1941-0492     ISO Abbreviation:  -     Publication Date:  2011 Sep 
Date Detail:
Created Date:  2011-9-28     Completed Date:  -     Revised Date:  -    
Medline Journal Info:
Nlm Unique ID:  9890044     Medline TA:  IEEE Trans Syst Man Cybern B Cybern     Country:  -    
Other Details:
Languages:  ENG     Pagination:  -     Citation Subset:  -    
Export Citation:
APA/MLA Format     Download EndNote     Download BibTex
MeSH Terms
Descriptor/Qualifier:

From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine


Previous Document:  Cross-Domain Human Action Recognition.
Next Document:  Inhibition of caspase-8 activity caused by overexpression of bcl10 contributes to the pathogenesis o...