Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Sensory Profiling of Individual and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory

Sensory Profiling of Individual and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory

Alexander Lindau, Fabian Brinkmann, Stefan Weinzierl (2014): "Sensory Profiling of Individual and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory" , talk given at the Forum Acusticum, Krakow, Poland, 2014

Dr. Alexander Lindau

September 11, 2014
Tweet

More Decks by Dr. Alexander Lindau

Other Decks in Science

Transcript

  1. 1 Fachgebiet Audiokommunikation Audio communication group Sensory Profiling of Individual

    and Non-individual Dynamic Binaural Synthesis Using the Spatial Audio Quality Inventory Alexander Lindau1, Fabian Brinkmann1, Stefan Weinzierl1 1Audio Communication Group, Technische Universität Berlin 7th Forum Acusticum of the European Acoustics Association Kraków, 2014 Structured Session on “Applied Binaural Signal Processing”
  2. 2 Fachgebiet Audiokommunikation Audio communication group Møller, H. (1992): "Fundamentals

    of Binaural Technology", in: Applied Acoustics, 36(3/4): 171-218 The ‘Theorem’ of Binaural Technology
  3. 3 Data-based Dynamic Binaural Synthesis (DDBS) Fachgebiet Audiokommunikation Audio communication

    group BRIR measurement moving artificial head anechoic audio BRIR data sets Filtering Fast Convolution Playback listener w. headphones head tracking data Time-variant Fast Convolution
  4. 4 Fachgebiet Audiokommunikation Audio communication group Møller, H. et al.

    (1995): "Head-Related Transfer Functions of Human Subjects", in: J. Aud. Eng. Soc., 43(5), pp. 300-321 HRTFs from several individuals ITDs from several individuals Data-based Dynamic Binaural Synthesis (DDBS)
  5. 6 Profiling Individual and Non-individual DDBS with SAQI ƒ Studies

    on overall perceptual performance of DDBS conducted before ƒ Qualify and quantify remaining deviations in a comparative listening test ƒ Use Spatial Audio Quality Inventory (SAQI)1 to create questionnaire ƒ SAQI: Consensus vocabulary for perceptual evaluation of Spatial Audio technologies ƒ Developed during Focus Group of 21 German VA experts ƒ Available resources: ƒ SAQI Test Manual2 (incl. illustrative stimuli, analysis tools …) ƒ Implementation in free Matlab toolbox WhisPER3 Fachgebiet Audiokommunikation Audio communication group 1 Lindau, A. et al. (2014): “A Spatial Audio Quality Inventory”, in: Acta Acustica united with Acustica, 100(5): 984–994 2 DOI: 10.14279/depositonce-1 3 DOI: 10.14279/depositonce-31
  6. 7 Fachgebiet Audiokommunikation Audio communication group Category Auditory Quality Tone

    Color (8) Tone color bright-dark, High-/Mid-/Low-frequency tone color, Sharpness, Roughness*, Comb filter coloration*, Metallic tone color Tonalness (3) Tonalness, Pitch, Doppler effect Geometry (10) Horizontal/Vertical direction, Front-back position, Distance, Depth, Width, Height, Externalization, Localizability, Spatial disintegration Room (3) Level of reverberation, Duration of reverberation, Envelopment by reverberation Time behavior (7) Pre-/Post-echoes, Temporal disintegration, Crispness, Speed, Sequence of events, Responsiveness Dynamics (3) Loudness, Dynamic range, Dynamic compression effects* Artifacts (7) Pitched/Impulsive/Noise-like artifact, Alien source, Ghost source, Distortion, Tactile vibration General (7) (Overall) Difference, (Overall) Clarity, Speech intelligibility, Naturalness, (Spatial) Presence, Degree-of-Liking, Other * Illustrative audio examples have been created. SAQI: What’s inside the box?
  7. 8 Method: Listening Test Setup Fachgebiet Audiokommunikation Audio communication group

    [1] Lindau, Alexander; Weinzierl, Stefan (2006): "FABIAN - An instrument for software-based measurement of binaural room impulse responses in multiple degrees of freedom", in: Proc. of the 24th Tonmeister Covention. Leipzig, pp. 621-625 [2] Lindau, Alexander; Brinkmann, Fabian (2012): "Perceptual Evaluation of Headphone Compensation in Binaural Synthesis Based on Non-Individual Recordings", in: J. Audio Eng. Soc., 60(1/2): 54-62 [3] Lindau, A.; Estrella, J.; Weinzierl, S. (2010): “Individualization of Dynamic Binaural Synthesis by Real Time Manipulation of the ITD”, in: Proc. of the 128th AES Convention. London, preprint no. 8088 [4] Erbes, V.; Schultz, F.; Lindau, A.; Weinzierl, S. (2012): „An extraaural headphone system for optimized binaural reproduction“, in: Proc. Of the 38th DAGA. Darmstadt, pp. 313-314 ƒ fWonder package for fast dynamic convolution ƒ Non-individual DDBS with ITD-individualization[3] ƒ BK211 transaural headphones with LMS equalization[4] [1] [2]
  8. 9 Method: Listening Test Task ƒ Nine subjects (3 female,

    Ø 30 yrs.) ƒ Task: rate 45 items from German SAQI 1) Non-individual DDBS vs. Reality 2) Individual DDBS vs. Reality ƒ 1 s pink noise bursts (0.75 s ISI) ƒ Written instructions with oral feedback Fachgebiet Audiokommunikation Audio communication group Example for a SAQI Instruction Example for a SAQI rating scale
  9. 12 Fachgebiet Audiokommunikation Audio communication group Results ƒ Wilcoxon Signed-Rank

    Test on “Difference”, p .012 (> for N-DDBS) ƒ Some SAQI qualities not or only little perceived (> for I-DDBS) ƒ Inter-individual variation, but no overall differences (> for N-DDBS) ƒ Inter-individual variation, and overall differences, too (> for N-DDBS)
  10. 13 Conclusions ƒ More and larger deviations found for non-individual

    DDBS Non-individual DDBS: ƒ Inter-individual deviations due to differing pinna cues ƒ Overall differences: comb-filtered, closer, larger, less well localizable, less compact ¾ Probably due headphone calibration and interaural coherence Individual DDBS: ƒ Overall differences: increased LF, less sharp ¾ Probably due to limits of measurement accuracy Fachgebiet Audiokommunikation Audio communication group
  11. 14 Outlook – Implications for future binaural models ƒ Exploit

    temporal variation in binaural information (content, movement) ƒ Achieve true 3D localization in anechoic and echoic spaces ƒ Improve distance estimation (Georganti et al., IEEE, 2013) ƒ Identify coloration, specific timbres (MIR, Psy.-Ac., v. d. Par/SEACEN) ƒ Identify acoustic artifacts ... without access to references ƒ Bottom-up models of room acoustic qualities (van Dorp-S., JASA, 2013) ƒ Model higher order constructs by experience, schema-based memory representations, abstraction … support change over time and based on input received: learning and forgetting ƒ Exploit knowledge on interplay of perception and psych. states/traits ƒ Support multimodal input Fachgebiet Audiokommunikation Audio communication group