Sensory Profiling of Individual and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory
Alexander Lindau, Fabian Brinkmann, Stefan Weinzierl (2014): "Sensory Profiling of Individual and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory" , talk given at the Forum Acusticum, Krakow, Poland, 2014
and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory Alexander Lindau1, Fabian Brinkmann1, Stefan Weinzierl1 1Audio Communication Group, Technische Universität Berlin 7th Forum Acusticum of the European Acoustics Association Kraków, 2014 Structured Session on “Applied Binaural Signal Processing”
group BRIR measurement moving artificial head anechoic audio BRIR data sets Filtering Fast Convolution Playback listener w. headphones head tracking data Time-variant Fast Convolution
(1995): "Head-Related Transfer Functions of Human Subjects", in: J. Aud. Eng. Soc., 43(5), pp. 300-321 HRTFs from several individuals ITDs from several individuals Data-based Dynamic Binaural Synthesis (DDBS)
on overall perceptual performance of DDBS conducted before Qualify and quantify remaining deviations in a comparative listening test Use Spatial Audio Quality Inventory (SAQI)1 to create questionnaire SAQI: Consensus vocabulary for perceptual evaluation of Spatial Audio technologies Developed during Focus Group of 21 German VA experts Available resources: SAQI Test Manual2 (incl. illustrative stimuli, analysis tools …) Implementation in free Matlab toolbox WhisPER3 Fachgebiet Audiokommunikation Audio communication group 1 Lindau, A. et al. (2014): “A Spatial Audio Quality Inventory”, in: Acta Acustica united with Acustica, 100(5): 984–994 2 DOI: 10.14279/depositonce-1 3 DOI: 10.14279/depositonce-31
[1] Lindau, Alexander; Weinzierl, Stefan (2006): "FABIAN - An instrument for software-based measurement of binaural room impulse responses in multiple degrees of freedom", in: Proc. of the 24th Tonmeister Covention. Leipzig, pp. 621-625 [2] Lindau, Alexander; Brinkmann, Fabian (2012): "Perceptual Evaluation of Headphone Compensation in Binaural Synthesis Based on Non-Individual Recordings", in: J. Audio Eng. Soc., 60(1/2): 54-62 [3] Lindau, A.; Estrella, J.; Weinzierl, S. (2010): “Individualization of Dynamic Binaural Synthesis by Real Time Manipulation of the ITD”, in: Proc. of the 128th AES Convention. London, preprint no. 8088 [4] Erbes, V.; Schultz, F.; Lindau, A.; Weinzierl, S. (2012): „An extraaural headphone system for optimized binaural reproduction“, in: Proc. Of the 38th DAGA. Darmstadt, pp. 313-314 fWonder package for fast dynamic convolution Non-individual DDBS with ITD-individualization[3] BK211 transaural headphones with LMS equalization[4] [1] [2]
Ø 30 yrs.) Task: rate 45 items from German SAQI 1) Non-individual DDBS vs. Reality 2) Individual DDBS vs. Reality 1 s pink noise bursts (0.75 s ISI) Written instructions with oral feedback Fachgebiet Audiokommunikation Audio communication group Example for a SAQI Instruction Example for a SAQI rating scale
Test on “Difference”, p .012 (> for N-DDBS) Some SAQI qualities not or only little perceived (> for I-DDBS) Inter-individual variation, but no overall differences (> for N-DDBS) Inter-individual variation, and overall differences, too (> for N-DDBS)
DDBS Non-individual DDBS: Inter-individual deviations due to differing pinna cues Overall differences: comb-filtered, closer, larger, less well localizable, less compact ¾ Probably due headphone calibration and interaural coherence Individual DDBS: Overall differences: increased LF, less sharp ¾ Probably due to limits of measurement accuracy Fachgebiet Audiokommunikation Audio communication group
temporal variation in binaural information (content, movement) Achieve true 3D localization in anechoic and echoic spaces Improve distance estimation (Georganti et al., IEEE, 2013) Identify coloration, specific timbres (MIR, Psy.-Ac., v. d. Par/SEACEN) Identify acoustic artifacts ... without access to references Bottom-up models of room acoustic qualities (van Dorp-S., JASA, 2013) Model higher order constructs by experience, schema-based memory representations, abstraction … support change over time and based on input received: learning and forgetting Exploit knowledge on interplay of perception and psych. states/traits Support multimodal input Fachgebiet Audiokommunikation Audio communication group