toc: true title: BYOL Loss tags: [‘temp’] BYOL Loss Similarity loss between qθ(zθ) and sg(zξ′) θ is trained weights ξ is exponentially moving average of θ and sg is stop gradient fθ is discarded, yθ is used as image representation