Proxy Attention Let Is∈RW×H×C be a random source image applied saliency map Ivs Ivs=f(Is)∈RW×H Augmented image Ia ⊙ elementwise multi Mask M∈{0,1}W×H