Name: A perceptual evaluation of various commercial models of music source separation, with a focus on model performance against non-traditional source material
Start: 2026-05-29T09:00:00+0200
End: 2026-05-29T11:00:00+0200

Schedule as of May 16, 2022 - subject to change

Default Time Zone is CEST - Central European Summer Time
You can change your view to your time zone (look for "Timezone" on the right)

LIVESTREAMS : A and B

ON DEMAND VIDEOS (previous days)

A perceptual evaluation of various commercial models of music source separation, with a focus on model performance against non-traditional source material

Friday May 29, 2026 9:00am - 11:00am CEST

Foyer Building 303A

Music source separation (MSS) systems are commonly used in
production, remixing,; audio analysis work, yet
questions arise regarding the extent that objective
evaluations of model performance align with human
perceptual evaluations, particularly when tasked with
non-traditional source material (in this case, heavily
processed electronic music). This study seeks to set a
framework for an evaluation of 3 machine learning
approaches to MSS: a spectrogram-domain model (spleeter), a
waveform-domain model (Demucs v2),; a hybrid-domain
model (HTDemucs). Subjective evaluations of model
performance were accumulated via a MUSHRA-style listening
test, while objective evaluations were assessed using
signal-to-distortion ratio (SDR); Frechet Audio Distance
(FAD). Results showed consistent agreement across objective
metrics, with the hybrid-domain model outperforming the
other singular-domain models. Perceptual ratings also
favored the hybrid model, with listeners occasionally
rating the model output as equal or better quality than the
original reference, interestingly. Preliminary analysis
indicates some moderate but insignificant correlations
between the two assessment paths, reinforcing concerns
about relying solely on numerical evaluations when
discussing MSS model performance. Implications for model
design; future evaluation procedures are discussed.

Authors

Sahan Wijewardane

University of Miami

Friday May 29, 2026 9:00am - 11:00am CEST
Foyer Building 303A Technical University of Denmark Asmussens Alle, Building 303A DK-2800 Kgs. Lyngby Denmark

AI and Machine Learning in Audio, Poster | Perception, Poster

Presentation Type Poster

AES Europe 2026

Sahan Wijewardane

Attendees (10)

Get help with the event

AES Europe 2026

Sahan Wijewardane

Attendees (10)

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Get help with the event