Imbalanced data is part of life! With a proper knowledge of the data set and appropriate techniques imbalanced data can be easily managed.
This video uses the Indian Liver Disease data (link below) to demonstrate the use of data up-scaling (and SMOTE) for minority class to improve accuracy.
References:
Dataset:
[ Ссылка ]
ROCAUC using Yellowbrick:
[ Ссылка ]
Code generated in the video can be downloaded from here: [ Ссылка ]
![](https://s2.save4k.ru/pic/VQuJvGTzBgw/maxresdefault.jpg)