Matrix notation for NNs

 

Example for simple case

  • A NN consists of linear transformations alternating with loss functions.
  • or , \begin{align*} \mathbf{h = a[\theta_{0}+ \theta_{x}]} \\ \mathbf{h' = a[\psi_{0} + \Psi h]}\\ \mathbf{y' = \phi'_{0}+ \phi'h'} \end{align*}
    • where

General case

  • Vector of hidden units at layer is
  • Vector of biases that contribute to hidden layer is
  • Weights that are applied to layer and contribute to hidden layer is
  • General network with layers is now