Speech Emotion Recognition
- GAN-based Data Generation for Speech Emotion Recognition
- form of speech emotion spectrograms
- used for training speech emotion recognition networks
- nvestigate the usage of GANs for capturing the data Manifold when the data is eyes-off, i.e., where they can train networks using the data but cannot copy it from the clients
- CNN-based GAN with spectral normalization on both the generator and discriminator, both of which are pre-trained on large unlabeled speech corpora
- even after the data on the client is lost, their model can generate similar data that can be used for model bootstrapping in the future