research.datasets.ImbalancedBinaryDatasets

class research.datasets.ImbalancedBinaryDatasets(names='all')[source]

Class to download, transform and save binary class imbalanced datasets.

Methods

download()

Download the datasets and append undersampled versions of them.

fetch_breast_tissue()

Download and transform the Breast Tissue Data Set.

fetch_cleveland()

Download and transform the Heart Disease Cleveland Data Set.

fetch_dermatology()

Download and transform the Dermatology Data Set.

fetch_ecoli()

Download and transform the Ecoli Data Set.

fetch_eucalyptus()

Download and transform the Eucalyptus Data Set.

fetch_glass()

Download and transform the Glass Identification Data Set.

fetch_haberman()

Download and transform the Haberman’s Survival Data Set.

fetch_heart()

Download and transform the Heart Data Set.

fetch_iris()

Download and transform the Iris Data Set.

fetch_led()

Download and transform the LED Display Domain Data Set.

fetch_libras()

Download and transform the Libras Movement Data Set.

fetch_liver()

Download and transform the Liver Disorders Data Set.

fetch_new_thyroid_1()

Download and transform the Thyroid 1 Disease Data Set.

fetch_new_thyroid_2()

Download and transform the Thyroid 2 Disease Data Set.

fetch_page_blocks_1_3()

Download and transform the Page Blocks 1-3 Data Set.

fetch_pima()

Download and transform the Pima Indians Diabetes Data Set.

fetch_vehicle()

Download and transform the Vehicle Silhouettes Data Set.

fetch_vowel()

Download and transform the Vowel Recognition Data Set.

fetch_wine()

Download and transform the Wine Data Set.

fetch_yeast_1()

Download and transform the Yeast 1 Data Set.

save(path, db_name)

Save datasets.


MULTIPLICATION_FACTORS = [2, 3]
download()[source]

Download the datasets and append undersampled versions of them.

fetch_breast_tissue()[source]

Download and transform the Breast Tissue Data Set. The minority class is identified as the car and fad labels and the majority class as the rest of the labels.

http://archive.ics.uci.edu/ml/datasets/breast+tissue

fetch_cleveland()[source]

Download and transform the Heart Disease Cleveland Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=980

fetch_dermatology()[source]

Download and transform the Dermatology Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=1330

fetch_ecoli()[source]

Download and transform the Ecoli Data Set. The minority class is identified as the pp label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/ecoli

fetch_eucalyptus()[source]

Download and transform the Eucalyptus Data Set. The minority class is identified as the best label and the majority class as the rest of the labels.

https://www.openml.org/d/188

fetch_glass()[source]

Download and transform the Glass Identification Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/glass+identification

fetch_haberman()[source]

Download and transform the Haberman’s Survival Data Set. The minority class is identified as the 1 label and the majority class as the 0 label.

https://archive.ics.uci.edu/ml/datasets/Haberman’s+Survival

fetch_heart()[source]

Download and transform the Heart Data Set. The minority class is identified as the 2 label and the majority class as the 1 label.

http://archive.ics.uci.edu/ml/datasets/statlog+(heart)

fetch_iris()[source]

Download and transform the Iris Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/iris

fetch_led()[source]

Download and transform the LED Display Domain Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=998

fetch_libras()[source]

Download and transform the Libras Movement Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/Libras+Movement

fetch_liver()[source]

Download and transform the Liver Disorders Data Set. The minority class is identified as the 1 label and the majority class as the ‘2’ label.

https://archive.ics.uci.edu/ml/datasets/liver+disorders

fetch_new_thyroid_1()[source]

Download and transform the Thyroid 1 Disease Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=145

fetch_new_thyroid_2()[source]

Download and transform the Thyroid 2 Disease Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=146

fetch_page_blocks_1_3()[source]

Download and transform the Page Blocks 1-3 Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=124

fetch_pima()[source]

Download and transform the Pima Indians Diabetes Data Set. The minority class is identified as the 1 label and the majority class as the ‘0’ label.

https://www.kaggle.com/uciml/pima-indians-diabetes-database

fetch_vehicle()[source]

Download and transform the Vehicle Silhouettes Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/Statlog+(Vehicle+Silhouettes)

fetch_vowel()[source]

Download and transform the Vowel Recognition Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=127

fetch_wine()[source]

Download and transform the Wine Data Set. The minority class is identified as the 2 label and the majority class as the rest of the labels.

https://archive.ics.uci.edu/ml/datasets/wine

fetch_yeast_1()[source]

Download and transform the Yeast 1 Data Set. The minority class is identified as the positive label and the majority class as the negative label.

http://sci2s.ugr.es/keel/dataset.php?cod=153

save(path, db_name)

Save datasets.