Towards efficient self-supervised representation learning in speech processing

Speech processing models are computationally expensive, generating environmental concerns because of their high energy consumption. ESSL (Efficient Self-Supervised Learning) addresses this issue, enabling pretraining with a single GPU for only 28 hours. The reduction in computational costs represents up to two orders of magnitude improvement against existing speech models. Its source code is available on GitHub under an MIT license.

Liveradio

Orange Radio

Full Content

Fast Point

Le Switch Tuner

TV d’Orange

Livebox

Set Top Box

Djingo

La Clé TV

Live Button

Internet Facile

Livebox Tools

Livebox

Towards efficient self-supervised representation learning in speech processing