Phase Transition Model Zoo

 
  • konstantin schurholt
  • transfer learning, model averaging, scaling models does not always work

Phase Transition in Physical Systems

  • between phases is an interpolation
  • order params with some control params
    • eg : temp and pressure in water
  • are there phases in NN?

Regimes in NN

Statistical Mechanics

Control Params in NN

  • temperature and load are relative

Temp

Load

Emperical Phases in NN Performance

  • taxonomizing local vs global
  • expected phases emerge on the load temperature landscape
  • load : complexity/capactity
  • temperature : noisiness of training
  • 5 phases, phase IV-B is the best
  • ~5k models trained from scratch
  • hessian : minima
    • flatness of the minima
      • flatter minima generalizes better : depending on where you are
  • mode connectivity
    • models trained with same config + different seed
    • if so, find a path where the loss is the same

Phase Transition in Model Zoo

  • phase transitions are a general phenomenon in NNs
  • all of these requires datasets of models that cover phase transitions
  • since temp/load are basically have the same effects, didnt train every possibility
    • Vision Transformer are very different
      • big optimal phase
      • for CIFAR, bad performance without data augmentation
        • (could also be because the dataset has a lot of noise)

Applications

  • research on transitions
    • might identify root causes for performance