Pyroomacoustics: Open source library for audio room simulation and prototyping speech enhancement systems

Slide 1

Slide 1 text

pyroomacoustics Room Simulation / Multichannel Audio Processing Robin Scheibler LINE Corporation, Speech Team Tokyo BISH Bash #1 — 2020/04/07 1

Slide 2

Slide 2 text

Self-Introduction Robin Scheibler role Senior Researcher @ LINE (since 2020/03/01) education Ph.D. in Signal Processing from EPFL (Switzerland) previously • Post Doc at Tokyo Metropolitan University • Intern/Researcher at NEC, IBM • Build mobile Geiger counters Safecast • Since 2014, developer of pyroomacoustics research • Fast transforms (Fourier, Hadamard, sparse, etc) • Multi-channel Audio Processing • Reproducible research hobby Ski, DIY electronics, fermentation homepage http://www.robinscheibler.org github @fakufaku twitter @fakufakurevenge 2

Slide 3

Slide 3 text

Outline 1. Pyroomacoustics General 2. Room Simulation 3. Blind Source Separation 3

Slide 4

Slide 4 text

Pyroomacoustics General

Slide 5

Slide 5 text

Pyroomacoustics: Motivation Smart Speaker Hello! Front-end - denoising - beamforming - doa - separation Services - Personal assistant - Speech-to-text - Search - ... Internet Noise source Target speaker 5

Slide 6

Slide 6 text

Slide 7

Slide 7 text

Slide 8

Slide 8 text

Slide 9

Slide 9 text

Pyroomacoustics Python Package Summary Content • Room acoustics simulator (C/C++) • Multi-channel audio processing algorithms Install $ pip install pyroomacoustics (binary wheels for Mac and Windows) Python 3.7, 3.6, 3.5, (2.7) Requires numpy, scipy Optional matplotlib, sounddevice, samplingrate Doc https://pyroomacoustics.readthedocs.io GitHub https://github.com/LCAV/pyroomacoustics 6

Slide 10

Slide 10 text

Motivation TRY to run an algorithm LISTEN to its output REASON and modify it Development Loop Prototyping of multichannel algorithms Without pyroomacoustics: experiments → time consuming With pyroomacoustics: simulation → fast → short cycle Data Augmentation Without pyroomacoustics: few examples of RIR, diﬃcult to collect With pyroomacoustics: easy to generate lots of examples 7

Slide 11

Slide 11 text

Room Simulation

Slide 12

Slide 12 text

Sound Propagation in a Room • Described by wave equation: ∇2 − 1 c2 ∂2 ∂t2 u(r, t) = 0 • Point source in free space: u(r, t) = 1 4π r − r0 δ t − r − r0 c • Diﬃcult for arbitrary boundaries (i.e. rooms) • Precise simulation ⇒ Finite element methods (FEM) • Approximate ⇒ image source model 9

Slide 13

Slide 13 text

The Image Source Model • Walls are perfect reﬂectors • Impulse response from image source is an impulse • Simple to implement 10