research.datasets.ImbalancedBinaryDatasets¶
- class research.datasets.ImbalancedBinaryDatasets(names='all')[source]¶
Class to download, transform and save binary class imbalanced datasets.
Methods
download()Download the datasets and append undersampled versions of them.
Download and transform the Breast Tissue Data Set.
Download and transform the Heart Disease Cleveland Data Set.
Download and transform the Dermatology Data Set.
Download and transform the Ecoli Data Set.
Download and transform the Eucalyptus Data Set.
Download and transform the Glass Identification Data Set.
Download and transform the Haberman's Survival Data Set.
Download and transform the Heart Data Set.
Download and transform the Iris Data Set.
Download and transform the LED Display Domain Data Set.
Download and transform the Libras Movement Data Set.
Download and transform the Liver Disorders Data Set.
Download and transform the Thyroid 1 Disease Data Set.
Download and transform the Thyroid 2 Disease Data Set.
Download and transform the Page Blocks 1-3 Data Set.
Download and transform the Pima Indians Diabetes Data Set.
Download and transform the Vehicle Silhouettes Data Set.
Download and transform the Vowel Recognition Data Set.
Download and transform the Wine Data Set.
Download and transform the Yeast 1 Data Set.
save(path, db_name)Save datasets.
- MULTIPLICATION_FACTORS = [2, 3]¶
- fetch_breast_tissue()[source]¶
Download and transform the Breast Tissue Data Set. The minority class is identified as the car and fad labels and the majority class as the rest of the labels.
- fetch_cleveland()[source]¶
Download and transform the Heart Disease Cleveland Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_dermatology()[source]¶
Download and transform the Dermatology Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_ecoli()[source]¶
Download and transform the Ecoli Data Set. The minority class is identified as the pp label and the majority class as the rest of the labels.
- fetch_eucalyptus()[source]¶
Download and transform the Eucalyptus Data Set. The minority class is identified as the best label and the majority class as the rest of the labels.
- fetch_glass()[source]¶
Download and transform the Glass Identification Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.
https://archive.ics.uci.edu/ml/datasets/glass+identification
- fetch_haberman()[source]¶
Download and transform the Haberman’s Survival Data Set. The minority class is identified as the 1 label and the majority class as the 0 label.
- fetch_heart()[source]¶
Download and transform the Heart Data Set. The minority class is identified as the 2 label and the majority class as the 1 label.
- fetch_iris()[source]¶
Download and transform the Iris Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.
- fetch_led()[source]¶
Download and transform the LED Display Domain Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_libras()[source]¶
Download and transform the Libras Movement Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.
- fetch_liver()[source]¶
Download and transform the Liver Disorders Data Set. The minority class is identified as the 1 label and the majority class as the ‘2’ label.
- fetch_new_thyroid_1()[source]¶
Download and transform the Thyroid 1 Disease Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_new_thyroid_2()[source]¶
Download and transform the Thyroid 2 Disease Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_page_blocks_1_3()[source]¶
Download and transform the Page Blocks 1-3 Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_pima()[source]¶
Download and transform the Pima Indians Diabetes Data Set. The minority class is identified as the 1 label and the majority class as the ‘0’ label.
- fetch_vehicle()[source]¶
Download and transform the Vehicle Silhouettes Data Set. The minority class is identified as the 1 label and the majority class as the rest of the labels.
https://archive.ics.uci.edu/ml/datasets/Statlog+(Vehicle+Silhouettes)
- fetch_vowel()[source]¶
Download and transform the Vowel Recognition Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- fetch_wine()[source]¶
Download and transform the Wine Data Set. The minority class is identified as the 2 label and the majority class as the rest of the labels.
- fetch_yeast_1()[source]¶
Download and transform the Yeast 1 Data Set. The minority class is identified as the positive label and the majority class as the negative label.
- save(path, db_name)¶
Save datasets.