NearestNeighbor Audio Demo

Data from AE-W/batch_outputs

View
Noise (ID)

How to read the IDs

  • Numeric IDs (e.g. 00_000357) come from the SONYC dataset.
  • IDs starting with fold come from the UrbanSound8k dataset.

Audio labels: BG = background noise | FG = generated foreground | Mix = BG + FG

Nearest Neighbor: Baseline outputs (top 10 prompts)