WORKSHOP: MY AI IS BETTER THAN YOURS WS25

MODELS: RAVE

Lecturer: Kim Albrecht Lars Christian Schmidt Yağmur Uçkunkaya

Winter 2025

What it does: Learns compact features of sound from your dataset and re-generates sound from them in real time.
Media: Sound

RAVE (Realtime Audio Variational autoEncoder) is an auto-encoder for sound: it takes sound as input and is trained to reconstruct that sound as output. It learns a compact latent space (a small set of parameters that represent/describe the audio), then decodes those parameters back into a waveform. Because it runs efficiently, it’s suitable for live performance, realtime sound generation/transformation, and interactive installations.

Workings

1. Collect audio

2. Prep

3. Train your RAVE model

4. Export & use

Why Try RAVE?

Resources