SaliencyMix

  • @uddinSaliencyMixSaliencyGuided2021
  • extracts salient regions and pastes them on the corresponding location in the target image
  • The salient region is extracted around the maximum intensity pixel location in the saliency map
  • Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization
  • however, such information removal is undesirable.
  • On the other hand, recent strategies suggest to randomly cut and mix patches and their labels among training images, to enjoy the advantages of regional dropout without having any pointless pixel in the augmented images.
  • selects a representative image patch with the help of a saliency map and mixes this indicative patch with the target image
  • Furthermore, models that are trained with SaliencyMix help to improve the object detection performance
  • we cut a source patch and mix it to the target image and also mix their labels proportionally to the size of the mixed patches
  • But in order to prevent the model from learning any irrelevant feature representation, the proposed method enforces to select a source patch in a way so that it must contains information about the source object
  • It first extracts a saliency map of the source image to highlight the objects of interest and then selects a patch surrounding the peak salient region to mix with the target image.

Algorithm

  • If is randomly selected training source with label . Saliency map is where is the visual saliency. is a saliency model.
  • Search for a pixel with maximum intensity
  • Patch selected by centering on the pixel or with this pixel in the patch
  • Patch determined on a ration chosen from [Uniform Distribution](Uniform Distribution.md)

Mixing Patches and Labels

  • Another image
  • labels

DIFFERENT WAYS OF SELECTING AND MIXING THE SOURCE PATCH

  • schemes: (i) Salient to Corresponding, that selects the source patch from the most salient region and mix it to the corresponding location of the target image; (ii) Salient to Salient, that selects the source patch from the most salient region and mix it to the salient region of the target image; (iii) Salient to Non-Salient, that selects the source patch from the most salient region but mix it to the non-salient region of the target image; (iv) Non-Salient to Salient, that selects the source patch from the non-salient region of the source image but mix it to the salient region of the target image; and (v) Non-Salient to NonSalient, that selects the source patch from the non-salient region of the source image and also mix it to the non-salient region of the target image.
  • To find out the non-salient region, we use the least important pixel of an image.

Images

  • images/Pasted%20image%2020230508113658.png
  • images/Pasted%20image%2020230508113709.png
  • images/Pasted%20image%2020230508113721.png
  • images/Pasted%20image%2020230508113734.png
  • images/Pasted%20image%2020230508113752.png