| Developing New Fitness Functions in Genetic Programming for Classification With Unbalanced Data. | |
| | |
MedLine Citation:
|
PMID: 21954215 Owner: NLM Status: Publisher |
Abstract/OtherAbstract:
|
Machine learning algorithms such as genetic programming (GP) can evolve biased classifiers when data sets are unbalanced. Data sets are unbalanced when at least one class is represented by only a small number of training examples (called the minority class) while other classes make up the majority. In this scenario, classifiers can have good accuracy on the majority class but very poor accuracy on the minority class(es) due to the influence that the larger majority class has on traditional training criteria in the fitness function. This paper aims to both highlight the limitations of the current GP approaches in this area and develop several new fitness functions for binary classification with unbalanced data. Using a range of real-world classification problems with class imbalance, we empirically show that these new fitness functions evolve classifiers with good performance on both the minority and majority classes. Our approaches use the original unbalanced training data in the GP learning process, without the need to artificially balance the training examples from the two classes (e.g., via sampling). |
| | |
Authors:
|
Urvesh Bhowan; Mark Johnston; Mengjie Zhang |
Related Documents
:
|
11543275 - Astrobiology: exploring the origins, evolution, and distribution of life in the universe. 15940845 - Valuing life: a plea for disaggregation. 22217595 - The european radiobiological archives: online access to data from radiobiological expe... 18375775 - A review of systematic reviews evaluating diabetes interventions: focus on quality of l... 10557115 - Teenagers, young people and family planning: a survey in five romanian high schools. 1111085 - The work of a rural practice. |
Publication Detail:
|
Type: JOURNAL ARTICLE Date: 2011-9-26 |
Journal Detail:
|
Title: IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society Volume: - ISSN: 1941-0492 ISO Abbreviation: - Publication Date: 2011 Sep |
Date Detail:
|
Created Date: 2011-9-28 Completed Date: - Revised Date: - |
Medline Journal Info:
|
Nlm Unique ID: 9890044 Medline TA: IEEE Trans Syst Man Cybern B Cybern Country: - |
Other Details:
|
Languages: ENG Pagination: - Citation Subset: - |
Export Citation:
|
APA/MLA Format Download EndNote Download BibTex |
| MeSH Terms | |
Descriptor/Qualifier:
|
|
From MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine
Previous Document: Cross-Domain Human Action Recognition.
Next Document: Inhibition of caspase-8 activity caused by overexpression of bcl10 contributes to the pathogenesis o...