Skip to content

Reducing imbalanced dataset (Undersampling) by Consensus Clustering (Simple Majority Voting function) and validating the changes using different classifier model with bagging and boosting techniques.

Notifications You must be signed in to change notification settings

arghac14/UndErNsembled

 
 

Repository files navigation

UndErNsembled:

About the model:

In this project, we reduced an imbalanced dataset (Undersampling) by Consensus Clustering using 'Simple Majority Voting' consensus function and further saw the increase in the accuracy of disease prediction by running multiple classifers with bagging and boosting technique.

Dataset:

The dataset we have is the colon cancer dataset of (62x2000) dimension.

Result:

This is the final result, i.e. comparison of different classifiers of predicting the disease accurately in both balanced and imbalanced data.

About

Reducing imbalanced dataset (Undersampling) by Consensus Clustering (Simple Majority Voting function) and validating the changes using different classifier model with bagging and boosting techniques.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%