Audio Measurement Basics

Why Measure Audio?

It is virtually impossible to get the most out of your sound system without a way to objectively measure its perfomance. It used to be that the knowledge and expensive hardware required to make such measurements were out of the reach of most home audio enthusiasts.

Now, with ETF, these sophisticated measurements can be taken by anyone with a computer! Many audiophiles are finding that a few hours of acoustic measurement result in greater improvement sound quality than could be obtained by spending thousands of dollars upgrading their equipment.

Measurement Fundamentals

The first, most important measurement of a given speakers performance in a given room is that of impulse response, also referred to as transient response in some books and magazines.

"Impulse" refers to a very short, very loud sound, such as a gunshot. The ideal impulse requires an infinite amount of energy delivered to a system over a time period approaching 0 seconds. In the real world, of course, this is not possible, so an impulse is approximated by a very short time duration high amplitude sound.

"Response" refers to the way the impulse sounds in the room. The response will depend both on the speaker producing the impulse, and the sound qualities of the room itself. In the case of loudspeaker measurement, the impulse can take the form of a very high voltage placed across the loudspeaker terminals for a very short period of time, resulting in a loud "click" from the loudspeaker. The speaker/room response to this impulse is recorded on a chart recorder with a microphone. An example of an impulse response chart is shown below


Fig 1 An example of an Impulse Response curve


The vertical axis of the chart shows the amplitude of the sound waves. You can think of this axis measuring sound volume (technically, this is not 100% true, but it's a good enough description for our purposes). On the horizontal axis is time, measured in Milliseconds (1/1000 of a second). The initial "spike" on the graph, at 0 ms, is caused by the sound of the impulse itself reaching the recording microphone. The oscillations and spikes that follow are echoes as this sound reverberates through the room and, to a lesser extent, within the speaker itself.

The problem is, impulse response measurements are very difficult to interpret. In the case of loudspeakers and rooms, it is difficult to differentiate "good sound" from sound quality that may not be considered as good just by looking at the graph. For this reason, other methods have been developed to display this sound data in more informative ways. It should be stressed that all the different types of charts and graphs people have devised to look at sound data are simply different ways of looking at the same information. No one representation is inherently superior to another; they simply tend to emphasise different aspects of the sound.


For example, the impulse response shown above shows us primarily when sounds from the impulse arrive at the recording microphone. It tells us nothing about the relative strengths of the various frequencies in the sound.

For frequency information, we turn to what is far and away the most popular way of presenting audio data: frequency response, also known sometimes as the "steady state response". The frequency response curve is generated by applying a mathematical operation known as the Fourier Transform to the impulse response. ETF does this for you automatically, so you really have no need to understand Fourier transforms. However, if you're interested in learning more, our list of reference materials will point you in the right direction.

Frequency response is the undisputed king of audio measurement. Without good frequency response, high fidelity sound is not possible. Below you will see a frequency response chart taken from the demonstration room project you will find at the Acoustisoft support page. It shows the low frequency response delivered by a particular subwoofer at 8 different positions in a room.

      


Fig 2: Low frequency response for a subwoofer measured in 8 positions around a room

   

You will notice two obvious differences between this and the impulse response graph. First, the horizontal axis now represents frequency, not time. This representation tells us nothing of when the sounds arrived. Instead, it adds up the amount of sound energy of each frequency that arrived at the microphone over a given period of time, known as the gate time. Usually, this gate time is set long enough to include the first big spike you see in the impulse response (the sound coming directly from the speaker), but short enough that all the subsequent spikes (the echoes from around the room) are ignored. For this reason, the frequency response as shown above is primarily the response of the speaker, with room effects ignored. This is useful in that it allows comparisons of different speakers independent of the rooms they happen to be measured in. < /p>

So what do we look for in a frequency response curve? Well, ideally, we want the curve that is the flattest (we want the speaker deliver equal amounts of sound energy at all frequencies), and the highest (delivering the most sound energy).

In this case, the flattest curve seems to be number 5, while the highest is number 1. We could decide to accept speaker position 5, as it demonstrates the best natural response, or start playing around with an equalizer, turning various frequencies up or down to try to "flatten" curve 1. In either case, our end goal is to have the flattest frequency response possible, as that will deliver the best sound.

The other thing you will notice is that frequency response doesn't oscillate back and forth from positive to negative the way the impulse response does. It is in the nature of waves to oscillate. As a more direct measurement of the sound waves, this oscillation is seen in the impulse response. As already mentioned, however, the frequency response sums the amount of energy of each frequency that arrives at the microphone over a given time interval, rather than displaying the state of the sound wave at a given time. For this reason, no oscillation is observed.

Many examples of impulse response and frequency response measurements are included in various places in ETF literature.


Fig 3: The steps in converting an impulse response to a frequency response

MLS Measurement

Earlier, we stated that placing a high voltage across the speakers' terminals for a short period of time could approximate an impulse. Unfortunately, this is easier said than done. The practical problems in applying a short duration high voltage signal to a loudspeaker are that it is inconvenient and requires specialize (and possibly dangerous) hardware. It is also necessary to take several such measurements and average them to compensate for background noise. Fortunately, the science of audio measurement has advanced to the point that test signals such as this are no longer required.

Over the years, different types of test signal have come into popularity, such as pink noise excitation, time delay spectroscopy, and periodic impulse excitation among others. Each in turn has made improvements measurement quality and ease of use.

ETF employs he current "state of the art" measurement technique, called Maximum Length Sequence, or MLS.

This is the test signal type of choice for many instrumentation manufacturers because of its relative ease of implementation, low cost of required hardware, and its high immunity to measurement artefacts such as noise or distortion. It also provides more actual information about system response than any previous measurement technique.

An MLS measurement begins with several seconds of white noise (that is, noise of equal energy at all audible frequencies). The response to this signal is recorded, and converted it to an impulse response by means of a bit of mathematical magic known as the Hadamard transform. The MLS method is very convenient in that the MLS signal can be accurately applied with very low cost hardware such as in a low cost computer "game" type sound card, or even single bit devices in specialized hardware.

MLS makes a very accurate recording of speaker / room impulse response. This allows the response to any form of room excitation to be determined.

Spectral Shaping

MLS signals are a form of "white noise". White noise contains a signal level that is identical for all frequencies just as the colour white contains all other colours in the visible spectrum. The fact that the MLS test signal contains energy at all of the possible frequencies of interest makes it possible to calculate an equivalent impulse response of a measured system.

A test signal such as this can damage loudspeaker drivers if not used carefully. Excessive high frequency energy can damage high frequency drivers. This is the reason for the existence of pink noise. Pink noise has decreasing energy levels with as frequency is increased, making it possible to play the signal louder through the audio system before damage to high frequency units occurs.

This disadvantage of MLS is easily compensated for when using a two-channel measurement system such as ETF. ETF measures a system response through one channel and measures only the sound card response through the other channel. This allows ETF to have knowledge of the sound card response and therefore correct the measurement for sound card anomalies. This also means that sound card tone controls may be adjusted to boost low frequencies and attenuate high frequencies. This technique is known as spectral shaping. The test signal may be shaped with an equalizer or tone control to emphasise or de-emphasise any frequency region. The advantages of the other system types are therefore available in ETF through this technique.


HOME - DOWNLOAD - PURCHASE - SUPPORT - DEMO ROOM - AUDIO LINKS - CONTACT US

Copyright (c) 2000 - Acoustisoft Incorporated - All Rights Reserved