Europarl-ST

  • multilingual speech translation corpus
  • based on speeches and debates in the European parliament between 2008-2013
  • Creative Common Non-Commercial license
  • data belongs to the European Union
  • by releasing, authors want to improve speech translation
  • consists of audio, transcript and translation of the transcript
  • 72 translation directions
  • includes also noisy samples