Soft Attention For a simple Seq2Seq, all hidden state vectors ht across timesteps are linearly combined ci=Σj=1Tαijhj aij=Σk=1Texp(eij)exp(eij)