This page contains some examples of parametric SRIR processing. All examples are based on measurements of two variable
acoustics rooms. Room Impulse Responses were measured using different microphone arrays, including the GRAS VI-50 intensity probe, the open array used in [5],
the Eigenmike em32, and the tetrahedral cardioid array Rode NT-SF1. This allows for comparison between some of the methods that are summarized in the table below.
Note that these renderings can only be regarded as snapshots; they are examples that we have produced while reviewing different methods.
It is certainly possible that better parameters or other measurements give better results. Also, as methods inevitably use different microphone arrays
and rendering methods, one can not make general statements by directly comparing these renderings in all regards.
Different Approaches
Here we provide renderings of the same room, created with different parametric SRIR workflows. HO-SIRR, SDM and ASDM can be used for loudspeaker rendering.
To take the full reproduction chain into account, the responses were rendered to a 37 loudspeaker setup placed in an anechoic room at the Aalto Acoustics lab.
In order to present results online, the loudspeaker responses were then convolved with a set of BRIRs, measured at the center of the array.
SDM uses the SDM toolbox, following the demoCostum script. With the settings suggested there, some artefacts can occur due to the equalization, see next section.
Measurement: Open array of omnidirectional microphones (incl. central microphone)
Analysis: Broadband TDoA estimation, equalization
Rendering: NLS to 37 loudspeaker, binauralization using KEMAR HATS BRIR measured in the loudspeaker array
Rendering: Ambisonics encoding to 4th order, mode-matching decoder to 37 loudspeakers, binauralization using KEMAR HATS BRIR measured in the loudspeaker array
[5] desribes a binaural method based on SDM. For this, the responses were not rendered to the loudspeaker array,
but HRIR measurements of the KEMAR HATS were used directly.
Measurement: Open array of omnidirectional microphones (with central microphone)
Analysis: Broadband TDoA estimation
Rendering: Direct binaural rendering using KEMAR HATS HRIR, slope correction (RTMod) in fractional octave bands and Allpass (AP) roughness compensation
The Binaural condition is a reference measurement using the KEMAR HATS. The same diffuse-field EQ as for the virtualized loudspeaker playback was used.
Scroll sideways to find more samples.
Signal
Dry Signal
SDM Toolbox
HO-SIRR
ASDM
RTMod+AP
Omnidirectional
Binaural Reference
Cello
Kickdrum
Specific Aspects
Equalization for Broadband Methods
These examples were created using TDoA-based broadband directional estimation,
nearest loudspeaker rendering to 37 speakers and subsequent binauralization, such as typically done in SDM [1].
Also the measurements used here were done in a variable acoustics room, however taken from a different dataset than the examples above.
In such a workflow, coloration compensation should be applied. These examples demonstrate different coloration compensation methods.
Using the settings suggested in the demo script found in the SDM toolbox can sometimes lead to artefacts, which might explain some of the problems other researchers have observed.
Different Equalization methods were tested, such as filterbank-based equalization, similar to [2]. It was applied to loudspeaker responses instead of Ambisonics channels.
The spectral differences are perceived also in case of the continuous cello signal. Roughness is only heard in case of the transient kickdrum sample (which was also used in [4]).
Scroll sideways to find more samples.
Reverberant Room
Signal
No Equalization
SDM Toolbox Default
Filterbank
Cello
Kickdrum
Dry Room
Signal
No Equalization
SDM Toolbox Default
Filterbank
Cello
Kickdrum
HO-SIRR with varying input order [4]
The following examples were rendered using the complete HO-SIRR [4] workflow. The same variable acoustics room as before was used,
but this time the Eigenmike em32 was used instead of the GRAS intensity probe. With a measurement from a real array, some spectral differences are immanent between varying input orders.
Reverberant Room
Scroll sideways to find more samples.
Signal
Reverberant Room. Input Order = 1
Reverberant Room. Input Order = 4
Cello
Kickdrum
Dry Room
Scroll sideways to find more samples.
Signal
Dry Room Room. InputOrder = 1
Dry Room Room. InputOrder = 2
Dry Room Room. InputOrder = 3
Dry Room Room. In Order = 4
Cello
Kickdrum
Examples from other studies
The signals used for the listening test described in [3] can be found here
here
More examples of HO-SIRR can be found along with the sourcecode
References
[1] S. Tervo, J. P. Tynen, A. Kuusinen, T. Lokki, "Spatial Decomposition Method for Room Impulse Responses,"
J. Audio Eng. Soc., vol. 61, no. 1, pp. 17-28 (2013 Mar.).
[2] S. Tervo, J. Pätynen, N. Kaplanis, "Spatial Analysis and Synthesis of Car Audio System and Car Cabin Acoustics with a Compact Microphone Array,"
J.Audio Eng. Soc, vol. 63, no. 11, pp. 914-925 (2015 Feb.), https://doi.org/10.17743/jaes.2015.0080
[3] M. Zaunschirm, M. Frank, F. Zotter, "Binaural Rendering with Measured Room Responses:
First-Order Ambisonic Microphone vs. Dummy Head,” Applied Sciences, vol. 10, no. 5, p. 1631 (2020 Feb.),
https://doi.org/10.3390/app10051631
[4] L. McCormack, V. Pulkki, A. Politis, O. Scheuregger, M. Marschall, "Higher-order Spatial Impulse Response Rendering:
Investigating the perceived effects of spherical order, dedicated diffuse rendering, and frequency resolution,"
J. Audio Eng. Soc, vol. 68, no. 5, pp. 368-354 (2020 May), https://doi.org/10.17743/jaes.2020.0026.
[5] S. V. A. Gari, P. T. Calamia, and P. W. Robinson, "Optimizations of the Spatial Decomposition Method for Binaural Reproduction,"
J. Audio Eng. Soc., vol. 68, no. 12, p. 18, 2020.