Sigmoid σ(x)=1+exp(−x)1 dxdσ(x)=σ(x)(1−σ(x)) max : 0.25 Logistic Xavier/Glorot init RNN : Hidden Bernoulli Distribution over a binary variable