Residual Flows

 

Inverting Residual Network Layers

  • form , if we ensure that is a contraction mapping (Banach fixed point theorem)
  • lipschitz constant must be less than 1
  • Assuming that the slope of the activation functions is not greater than one, this is equivalent to ensuring the largest eigenvalue of each weight matrix must be less than 1