Abstract:
Directional Audio Coding (DirAC) is a proven method for
parametrically representing a 3D audio scene in B-format and is capable
of reproducing it on arbitrary loudspeaker layouts. Although such a
method seems well suited for low bitrate Ambisonic transmission, little
work has been done on the feasibility of building a real system upon
it. In this paper, we present a DirAC-based coding for Higher-Order
Ambisonics (HOA), developed as part of a standardisation effort to
extend the 3GPP EVS codec to immersive communications. Starting from
the first-order DirAC model, we show how to reduce algorithmic delay,
the bitrate required for the parameters and complexity by bringing the
full synthesis in the spherical harmonic domain. The evaluation of the
proposed technique for coding 3rd order Ambisonics at bitrates from 32
to 128 kbps shows the relevance of the parametric approach compared
with existing solutions.
Accepted at ICASSP 2025
Paper