1	IMPULSE RESPONSE MEASUREMENTS BY EXPONENTIAL SINE SWEEPS
2	Time Line
3	Starting point: room impulse response
4	The Past
5	Traditional measurement methods Pulsive sources: ballons, blank pistol
6	Example of a pulsive impulse response
7	Loudspeaker as sound source A loudspeaker is fed with a special test signal x(t), while a microphone records the room response A proper deconvolution technique is required for retrieving the impulse response h(t) from the recorded signal y(t)
8	Measurement process The desidered result is the linear impulse response of the acoustic propagation h(t). It can be recovered by knowing the test signal x(t) and the measured system output y(t). It is necessary to exclude the effect of the not-linear part K and of the background noise n(t).
9	Electroacoustical methods Different types of test signals have been developed, providing good immunity to background noise and easy deconvolution of the impulse response: MLS (Maximum Lenght Sequence, pseudo-random white noise) TDS (Time Delay Spectrometry, which basically is simply a linear sine sweep, also known in Japan as “stretched pulse” and in Europe as “chirp”) ESS (Exponential Sine Sweep) Each of these test signals can be employed with different deconvolution techniques, resulting in a number of “different” measurement methods Due to theoretical and practical considerations, the preference is nowadays generally oriented for the usage of ESS with not-circular deconvolution
10	The first MLS apparatus - MLSSA MLSSA was the first apparatus for measuring impulse responses with MLS
11	More recently - the CLIO system The Italian-made CLIO system has superseded MLSSA for most low-cost electroacoustics applications (measurement of loudspeakers, quality control)
12	The first TDS apparatus - TEF Techron TEF 10 was the first apparatus for measuring impulse responses with TDS Subsequent versions (TEF 20, TEF 25) also support MLS
13	The Present
14	Today’s Hardware: PC and audio interface
15	Hardware: loudspeaker & microphone
16	The first ESS system - AURORA
17	Exponential Sine Sweep method x(t) is a band-limited sinusoidal sweep signal, which frequency is varied exponentially with time, starting at f₁ and ending at f₂.
18	Test Signal – x(t)
19	Measured signal - y(t) The not-linear behaviour of the loudspeaker causes many harmonics to appear
20	Inverse Filter – z(t) The deconvolution of the IR is obtained convolving the measured signal y(t) with the inverse filter z(t) [equalized, time-reversed x(t)]
21	Deconvolution of Exponential Sine Sweep The “time reversal mirror” technique is employed: the system’s impulse response is obtained by convolving the measured signal y(t) with the time-reversal of the test signal x(-t). As the log sine sweep does not have a “white” spectrum, proper equalization is required
22	Result of the deconvolution The last impulse response is the linear one, the preceding are the harmonics distortion products of various orders
23	Maximum Length Sequence vs. Exp. Sine Sweep
24	Distortion measurements A headphone was driven with a 1 V RMS signal, causing severe distortion in the small loudspeaker. The measurement was made placing the headphone on a dummy head. Measurements: ESS and traditional sine at 1 kHz
25	Distortion measurements Comparison between: traditional distortion measurement with fixed-frequency sine (the black histogram) the new exponential sweep (the 4 narrow, coloured lines)
26	Spatial analysis by directive impulse responses The initial approach was to use directive microphones for gathering some information about the spatial properties of the sound field “as perceived by the listener” Two apparently different approaches emerged: binaural dummy heads and pressure-velocity microphones:
27	IACC “objective” spatial parameter It was attempted to “quantify” the “spatiality” of a room by means of “objective” parameters, based on 2-channels impulse responses measured with directive microphones The most famous “spatial” parameter is IACC (Inter Aural Cross Correlation), based on binaural IR measurements
28	Lateral Fraction (LF) spatial parameter Another “spatial” parameter is the Lateral Fraction LF This is defined from a 2-channels impulse response, the first channel is a standard omni microphone, the second channel is a “figure-of-eight” microphone:
29	Are binaural measurents reproducible? Experiment performed in anechoic room - same loudspeaker, same source and receiver positions, 5 binaural dummy heads
30	Are IACC measurents reproducible? Diffuse field - huge difference among the 4 dummy heads
31	Are LF measurents reproducible? Experiment performed in the Auditorium of Parma - same loudspeaker, same source and receiver positions, 4 pressure-velocity microphones
32	Are LF measurents reproducible? At 25 m distance, the scatter is really big
33	3D Impulse Response (Gerzon, 1975)
34	3D extension of the pressure-velocity measurements The Soundfield microphone allows for simultaneous measurements of the omnidirectional pressure and of the three cartesian components of particle velocity (figure-of-8 patterns)
35	Directivity of transducers
36	A-format microphone arrays Today several alternatives to Soundfield microphones do exists. All of them are providing “raw” signals from the 4 capsules, and the conversion from these signals (A-format) to the standard Ambisonic signals (B-format) is performed digitally by means of software running on the computer
37	The Waves project (2003) The original idea of Michael Gerzon was finally put in practice in 2003, thanks to the Israeli-based company WAVES More than 50 theatres all around the world were measured, capturing 3D IRs (4-channels B-format with a Soundfield microphone) The measurments did also include binaural impulse responses, and a circular-array of microphone positions More details on WWW.ACOUSTICS.NET
38	Problems with ESS measurements
39	Pre-ringing at high and low frequency Pre-ringing at high frequency due to improper fade-out
40	Pre-ringing at high and low frequency Perfect Dirac’s delta after removing the fade-out
41	Pre-ringing at high and low frequency Pre-ringing at low frequency due to a bad sound card featuring frequency-dependent latency
42	Kirkeby inverse filter The Kirkeby inverse filter is computed inverting the measured IR
43	Pre-ringing at high and low frequency Convolving the time-smeared IR with the Kirkeby compacting filter, a very sharp IR is obtained
44	Problems with ESS measurements
45	Equalization of the whole system An anechoic measurement is first performed
46	Equalization of the whole system A suitable inverse filter is generated with the Kirkeby method by inverting the anechoic measurement
47	Equalization of the whole system The inverse filter can be either pre-convolved with the test signal or post-convolved with the result of the measurement Pre-convolution usually reduces the SPL being generated by the loudspeaker, resulting in worst S/N ratio On the other hand, post-convolution can make the background noise to become “coloured”, and hence more perciptible The resulting anechoic IR becomes almost perfectly a Dirac’s Delta function:
48	Problems with ESS measurements
49	Sensitivity to abrupt pulsive noises Often a pulsive noise occurs during a sine sweep measurement
50	Sensitivity to abrupt pulsive noises After deconvolution, the pulsive sound causes untolerable artifacts in the impulse response
51	Sensitivity to abrupt pulsive noises Several denoising techniques can be employed: Brutely silencing the transient noise Employing the specific “click-pop eliminator” plugin of Adobe Audition Applying a narrow-passband filter around the frequency which was being generated in the moment in which the pulsive noise occurred The third approach provides the better results:
52	Problems with ESS measurements
53	Clock mismatch When the measurement is performed employing devices which exhibit signifcant clock mismatch between playback and recording, the resulting impulse response is “skewed” (stretched in time):
54	Clock mismatch It is possible to re-pack the impulse response employing the already-described approach based on the usage of a Kirkeby inverse filter:
55	Clock mismatch However, it is always possible to generate a pre-stretched inverse filter, which is longer or shorter than the “theoretical” one - by proper selection of the lenght of the inverse filter, it is possible to deconvolve impulse responses which are almost perfectly “unskewed”:
56	Problems with ESS measurements
57	High-frequency cancellation due to averaging When several impulse response measurements are synchronously-averaged for improving the S/N ratio, the late part of the tail cancels out, particularly at high frequency, due to slight time variance of the system
58	High-frequency cancellation due to averaging However, if averagaing is performed properly in spectral domain, and a single conversion to time domain is performed after averaging, this artifact is significantly reduced The new “cross Functions” plugin can be used for computing H1:
59	The Future
60	The Future 1 : better spatial information Microphone arrays capable of synthesizing aribitrary directivity patterns Advanced spatial analysis of the sound field employing spherical harmonics (Ambisonics - 1° order or higher) Loudspeaker arrays capable of synthesizing arbitrary directivity patterns Generalized solution in which both the directivities of the source and of the receiver are represented as a spherical harmonics expansion
61	How to get better spatial resolution? The answer is simple: analyze the spatial distribution of both source and receiver by means of higher-order spherical harmonics expansion Spherical harmonics analysis is the equivalent, in space domain, of the Fourier analysis in time domain As a complex time-domain waveform can be though as the sum of a number of sinusoidal and cosinusoidal functions, so a complex spatial distribution around a given notional point can be expressed as the sum of a number of spherical harmonic functions
62	Higher-order spherical harmonics expansion
63	3°-order microphone (Trinnov - France) Arnoud Laborie developed a 24-capsule compact microphone array - by means of advanced digital filtering, spherical ahrmonic signals up to 3° order are obtained (16 channels)
64	4°-order microphone (France Telecom) Jerome Daniel and Sebastien Moreau built samples of 32-capsules spherical arrays - these allow for extractions of microphone signals up to 4° order (25 channels)
65
66	Verification of high-order patterns Sebastien Moreau and Olivier Warusfel verified the directivity patterns of the 4°-order microphone array in the anechoic room of IRCAM (Paris)
67	What about source directivity ? Current 3D IR sampling is still based on the usage of an “omnidirectional” source The knowledge of the 3D IR measured in this way provide no information about the soundfield generated inside the room from a directive source (i.e., a musical instrument, a singer, etc.) Dave Malham suggested to represent also the source directivity with a set of spherical harmonics, called O-format - this is perfectly reciprocal to the representation of the microphone directivity with the B-format signals (Soundfield microphone). Consequently, a complete and reciprocal spatial transfer function can be defined, employing a 4-channels O-format source and a 4-channels B-format receiver:
68	Directivity of transducers LookLine D200 dodechaedron
69	High-order sound source Adrian Freed, Peter Kassakian, and David Wessel (CNMAT) developed a new 120-loudspeakers, digitally controlled sound source, capable of synthesizing sound emission according to spherical harmonics patterns up to 5° order.
70	Technical details of high-order source Class-D embedded amplifiers
71	Accuracy of spatial synthesis The spatial reconstruction error of a 120-loudspeakers array is frequency dependant, as shown here:
72	Complete high-order MIMO method Employing massive arrays of transducers, it will be feasible to sample the acoustical temporal-spatial transfer function of a room Currently available hardware and software tools make this practical only up to 4° order, which means 25 inputs and 25 outputs A complete measurement for a given source-receiver position pair takes approximately 10 minutes (25 sine sweeps of 15s each are generated one after the other, while all the microphone signals are sampled simultaneously) However, it has been seen that real-world sources can be already approximated quite well with 2°-order functions, and even the human HRTF directivites are reasonally approximated with 3°-order functions.
73	The Future 2 : not linear systems Often impulse responses are measured for being employed in auralization systems (i.e. Waves) Linear convolution is employed for this This method indeed does not sound realistic, as it removes any not-linear effect We can now exploy the results of an ESS measurement for performing a not-linear convolution For this, indeed, the measured “harmonic orders IRs” have to be transformed into corresponding Volterra kernels
74	Theory of nonlinear convolution The basic approach is to convolve separately, and then add the result, the linear IR, the second order IR, the third order IR, and so on. Each order IR is convolved with the input signal raised at the corresponding power:
75	From measured IRS to Volterra Kernels A simple linear system allows for computation of Volterra Kernels starting from the measured “harmonic orders” IRs
76	Efficient non-linear convolution As we have got the Volterra kernels already in frequency domain, we can efficiently use them in a multiple convolution algorithm implemented by overlap-and-save of the partitioned input signal:
77	Software implementation A small Italian startup company, Acustica Audio, developed a VST plugin based on the Diagonal Volterra Kernel method, named Nebula
78	Software implementation Nebula is also equipped with a companion application, Nebula Sampler, designed for automatizing the measurement of a not linear system with the Exponential Sine Sweep method:
79	Time-variant systems Nebula can sample also time-variant systems, such as flangers or compressors, by repeating the sine sweep measurement several times, along a repetition cycle or changing the signal amplitude
80	Reconstruction accuracy Nebula is actually limited to Volterra kernels up to 5^th order, and consequently does not emulates high-frequency harmonics:
81	Audible evaluation of the performance Original signal
82	Subjective listening test A/B comparison Live recording & non-linear auralization 12 selected subjects 4 music samples 9 questions 5-dots horizontal scale Simple statistical analysis of the results A was the live recording, B was the auralization, but the listener did not know this
83	Results Statistical parameters – more advanced statistical methods would be advisable for getting more significant results
84	Another example Original signal
85	Conclusions The sine sweep method revealed to be systematically superior to the MLS & TDS methods for measuring electroacoustical impulse responses The ESS method also allows for measurement of not-linear devices and to assess harmonic distortion Current limitation for spatial analysis in room acoustis is due to transducers (loudspeakers and microphones) A new generation of loudspeakers and microphones, made of massive arrays, is under development. The “harmonic orders” impulse responses obtained by the exponential sine sweep method can be used for not-linear convolution, which yields more realistic auralization