Name: Flow-HOA: Generative Joint Optimization for Ambisonics Encoding via Flow Matching
Start: 2026-05-28T16:00:00+0200
End: 2026-05-28T16:30:00+0200

Schedule as of May 16, 2022 - subject to change

Default Time Zone is CEST - Central European Summer Time
You can change your view to your time zone (look for "Timezone" on the right)

LIVESTREAMS : A and B

ON DEMAND VIDEOS (previous days)

Flow-HOA: Generative Joint Optimization for Ambisonics Encoding via Flow Matching

Thursday May 28, 2026 4:00pm - 4:30pm CEST

Aud 42

Higher-Order Ambisonics (HOA) encoding from sparse,
irregular microphone arrays remains a critical challenge
for consumer spatial audio capture in immersive
communication; XR. We propose Flow-HOA, a generative
framework that jointly optimizes a multi-dimensional
perceptual objective while producing a deployable,
time-invariant bank of Finite Impulse Response (FIR)
encoding filters. Using conditional flow matching, the
model learns to map a simple prior distribution to the
target distribution of FIR filter coefficients. Training is
guided by a composite loss that balances time-domain
waveform fidelity, multi-resolution spectral consistency,
sub-band energy preservation,; spatial directivity
constraints. Objective evaluations demonstrate improved
performance over strong model-based baselines in both
signal fidelity; spatial accuracy metrics. Subjective
listening tests further confirm that Flow-HOA yields higher
overall sound quality with reduced artifacts.

Authors

AI and Machine Learning in Audio, Lecture | Recording Production and Reproduction, Lecture

Presentation Type Lecture

AES Europe 2026

Tianshu Qu

Xueyang Lv

Yufan Qian

Yuhuan You

Attendees (10)

Get help with the event