Alleviating Class Imbalance with Data Augmentation

  • Data Augmentation falls under a Data-level solution to class imbalance and there are many different strategies for implementation.
  • A naive solution to oversampling with Data Augmentation would be a simple random oversampling with small Geometric Transformations such as a 30° rotation
  • One problem of oversampling with basic image transformations is that it could cause overfitting on the minority class which is being oversampled
  • The biases present in the minority class are more prevalent post-sampling with these techniques.
  • Neural Style Transfer is an interesting way to create new images. These new images can be created either through extrapolating style with a foreign style or by interpolating styles amongst instances within the dataset.
  • Oversampling with GANs can be done using the entire minority class as ‘real’ examples, or by using subsets of the minority class as inputs to GANs
  • The use of evolutionary sampling to find these subsets to input to GANs for class sampling is a promising area for future work.