Speech Emotion Recognition

  • GAN-based Data Generation for Speech Emotion Recognition
  • form of speech emotion spectrograms
  • used for training speech emotion recognition networks
  • nvestigate the usage of GANs for capturing the data Manifold when the data is eyes-off, i.e., where they can train networks using the data but cannot copy it from the clients
  • CNN-based GAN with spectral normalization on both the generator and discriminator, both of which are pre-trained on large unlabeled speech corpora
  • even after the data on the client is lost, their model can generate similar data that can be used for model bootstrapping in the future