B&O Tech: Naked Truth IV

#29 in a series of articles about the technology behind Bang & Olufsen loudspeakers

 

Sorry – I’ve been busy lately, so I haven’t been too active on the blog.

 

Here are some internal shots of the BeoLab 17 and BeoLab 20 loudspeakers. As you can see in the shot of the back of the BeoLab 17, the entire case is the enclosure is for the woofer. The tweeter has its own enclosure which seals it from the woofer cabinet.

 

GGM0798

 

 

 

GGM0793

 

GGM0789

 

GGM0803

 

What’s not obvious in the photos of the BeoLab 20 is that the midrange and woofer cabinets are separate sealed boxes. There is a bulkhead that separates the two enclosures cutting across the loudspeaker just below the midrange driver.

 

 

GGM0806

 

GGM0814

 

GGM0837

 

GGM0827

 

GGM0822

B&O Tech: Reading Spec’s – Part 1

#28 in a series of articles about the technology behind Bang & Olufsen loudspeakers

 

Introduction

Occasionally, I read other people’s blogs and forum postings to see what’s happening outside my little world. This week, I came across this page in which one of the contributors made some comments about B&O’s loudspeaker specifications – or, more precisely, the lack of them – or the lack of precision in them – particularly with respect to the Frequency Range specifications.

So, this posting will be an attempt to explain how we determine the frequency range of our loudspeakers.

 

Bandwidth (also known as “Frequency Range”)

Ask any first-year electrical engineering student to explain how to find the “bandwidth” of an audio product and they’ll probably all tell you the same thing – which will be something like the following:

  1. Measure the magnitude response (what many people call the “frequency response”) of the product.
  2. Find the peak in the magnitude response
  3. Going upwards in frequency, find the point where the magnitude drops by 3.01 dB relative to the peak.
  4. Going downwards in frequency, find the point where the magnitude drops by 3.01 dB relative to the peak.
  5. Subtract the lower frequency from the upper frequency and you get the bandwidth.

(If you’re curious, the 3.01 dB threshold is chosen because -3.01 dB is equivalent to one-half of the power of the peak. This is why the -3.01 dB points are also known as a the “half-power points”)

Fig 1: An example of how bandwidth is measured on a typical audio device. See text for more information
Fig 1: An example of how bandwidth is measured on a typical audio device. See text above for more information

 

Figure 1 shows a pretty typical looking curve for an audio device (admittedly, not a very good one…). The magnitude response is flat enough, and it extends down to 34 Hz and up to 15.6 kHz.

This same technique can be used to find the bandwidth of an audio processing device, as is shown in Figure 2.

 

Fig 2: An example of how bandwidth is measured on a filter - in this case a peaking filter.
Fig 2: An example of how bandwidth is measured on a filter – in this case a peaking filter. The Bandwidth of this filter ranges from 770 Hz to 1300 Hz – a total bandwidth of 530 Hz.

Loudspeaker Frequency Range

Let’s try applying that same method used for audio “black boxes” on a loudspeaker. We’ll measure the on-axis magnitude response of the loudspeaker in a free field (one without reflections), and find the frequencies where the magnitude drops 3.01 dB below the peak value. An example of this is shown below.

Fig 3: An unsmoothed on-axis magnitude response of a real loudspeaker. If we use the same technique to measure the bandwidth on this loudspeaker as we did in Fig 2, the bandwidth will range from 7.3 kHz to 13.9 kHz.
Fig 3: An unsmoothed on-axis magnitude response of Loudspeaker #1. If we use the same technique to measure the bandwidth on this loudspeaker as we did in Fig 2, the bandwidth will range from 7.3 kHz to 13.9 kHz.

 

Hmmmm… that didn’t turn out as nicely as I had hoped. It seems that (using this definition) the loudspeaker whose response is shown in Figure 3 has a Frequency Range of 7.3 kHz to 13.9 kHz. This is unfortunate, since it is not a tweeter – it’s a rather large, commerically-available, floor-standing loudspeaker with a rather good reputation.

Okay, maybe we’re being too stringent. Let’s say that, instead of defining the Frequency Range as the area between the – 3 / + 0 dB points, we’ll make it ± 3 dB instead. Figure 4 shows the same loudspeaker with that version of Frequency Range.

 

Fig 4: An unsmoothed on-axis magnitude response of a real loudspeaker. We now define the Frequency Range as being within the ± 3 dB points, which results in a range of 2.2 kHz to 15.9 kHz.
Fig 4: An unsmoothed on-axis magnitude response of Loudspeaker #1. We now define the Frequency Range as being within the ± 3 dB points, which results in a range of 2.2 kHz to 15.9 kHz.

 

Great- it got better! Now the Frequency Range of this loudspeaker (under the new ±3 dB definition) is from 2.2 kHz to 15.9 kHz. On paper, that still makes it a tweeter – so we’re still in trouble here. Note that I have scaled the magnitude response here to “help” the loudspeaker as much as I can by putting the peak in the magnitude response on the + 3 dB line. If I had not done this (for example, if I had said “±3 dB relative to the magnitude at 1 kHz” the numbers would certainly not get better…)

Let’s try the same definition on a different loudspeaker – shown in Figure 5.

 

Fig 4: An unsmoothed on-axis magnitude response of a real loudspeaker. Using a Frequency Range  defined by the ± 3 dB points, the range is 5.4 kHz to 18.0 kHz.
Fig 4: An unsmoothed on-axis magnitude response of Loudspeaker #2. Using a Frequency Range defined by the ± 3 dB points, the range is 5.4 kHz to 18.0 kHz.

 

This loudspeaker (also a commerically-available floor-standing model with a good reputation) has a Frequency Range (using the ± 3 dB points) of 5.4 kHz to 18.0 kHz. This ±3 dB definition isn’t working out very well. Let’s try one more loudspeaker to see what happens.

 

Fig 6: An unsmoothed on-axis magnitude response of a real loudspeaker. Using a Frequency Range  defined by the ± 3 dB points, the range is 5.4 kHz to 18.0 kHz.
Fig 5: An unsmoothed on-axis magnitude response of Loudspeaker #3. Using a Frequency Range defined by the ± 3 dB points, the range is 83 Hz to 20.9 kHz.

 

Yay! It worked! For loudspeaker #3, shown in Figure 5, the Frequency Range is from 83 Hz to 20.9 kHz. Ummmm… except that, of the 3 loudspeakers I measured for this experiment, this is, by far the smallest. It’s a little 2-way loudspeaker that I can easily lift with one hand whilst sipping a cup of coffee from the other (and I don’t work out – ever!)

So, what we have learned so far is that it’s better to buy a small loudspeaker than a big one, since it has a much wider frequency range. No, wait. that can’t be right…

Hmmmm… what if we were to smooth the magnitude responses? Maybe that would help…

 

Fig 6: The on-axis magnitude responses of the 3 loudspeakers (Black=#1, Blue=#2, Red =#3), 1/3 octave smoothed.
Fig 6: The on-axis magnitude responses of the 3 loudspeakers (Black=#1, Blue=#2, Red =#3), 1/3 octave smoothed.

 

That’s a little better, but Loudspeaker #1 (the black curve), still bottoms out in the midrange, causing its frequency range to resemble that of a tweeter. And Loudspeaker #2 loses out on the high end due to the peaks. (Note that, in this plot, I’ve scaled them all to have the same magnitude at 1 kHz – just trying something out to see if that helps. It didn’t.) How about more smoothing?

 

Fig 7: The on-axis magnitude responses of the 3 loudspeakers (Black=#1, Blue=#2, Red =#3), 1 octave smoothed.
Fig 7: The on-axis magnitude responses of the 3 loudspeakers (Black=#1, Blue=#2, Red =#3), 1 octave smoothed.

 

Got it! Now all of the loudspeakers’ responses have been smeared out enough… uh… adequately smoothed… to make our definition of Frequency Range have a little meaning. The “big” loudspeakers have wider Frequency Ranges than the “little” loudspeaker. So, we’ll just octave-smooth all of our measurements. Well…  at least until we find another loudspeaker that needs even more smoothing…

 

Sidebar: In case you’re wondering: the three loudspeakers I’m talking about above are all commercially available products. One of the three is a Bang & Olufsen loudspeaker. Don’t bother asking which is which (or which B&O loudspeaker it is) – I’m not telling – mostly because it doesn’t matter.

The moral thus far…

Of course, the point that I’m trying to make here is that Frequency Range, like any specification for any audio device, needs to make sense. If we arbitrarily set some test method (i.e. “measure the on-axis magnitude response – and then smooth it”) and apply arbitrary criteria (i.e. ± 3 dB) then we may not get a useful description of the device’s behaviour.

So, at this point, you’re probably asking “Well… how does B&O measure Frequency Range?” Well, I’m glad you asked!

After we’re done with the sound design of the loudspeaker, we have to make sure it has the correct sensitivity. This is a measure of how loud it is for a given voltage at its input.  So, we put the loudspeaker in the Cube and measure its final on-axis magnitude response.

I’ll illustrate this using a magnitude response that I invented, shown in Fig 8. Note that this response is not a real loudspeaker – it’s one that I invented just for the purposes of this discussion.

 

Step 1: We measure the on-axis magnitude response of the loudspeaker at 1 m after the sound design is finished. We then look at the average level of the response between 200 Hz and 2 kHz.
Fig 8: Step 1: We measure the on-axis magnitude response of the loudspeaker at 1 m after the sound design is finished. We then look at the average level of the response between 200 Hz and 2 kHz. (Note that the response shown here is NOT a measurement of a real loudspeaker.)

 

 

We then look at the average level of the magnitude response between 200 Hz and 2 kHz and adjust the gain in the signal processing of the loudspeaker to make the sensitivity what we want it to be. For almost all B&O loudspeakers, that sensitivity corresponds to an output level of 88 dB SPL for an input with a level of 125 mV RMS. (The only exceptions are BeoLab 1, BeoLab 5, and BeoLab 9 which produce 91 dB SPL for a 125 mV RMS input.)

So, after the gain has been adjusted, the magnitude response looks like Figure 9, below.

 

Step 2: The gain of the loudspeaker has been adjusted so that the average output between 200 Hz and 2 kHz is 88 dB SPL for a 125 mV RMS input.
Fig 9: Step 2: The gain of the loudspeaker has been adjusted so that the average output between 200 Hz and 2 kHz is 88 dB SPL for a 125 mV RMS input.

 

We then look for the frequencies that have a magnitude that are 10 dB lower than the average level between 200 Hz and 2 kHz. This is illustrated in Figure 10.

 

Step 3: The frequencies with magnitudes that are 10 dB lower than the average between 200 Hz and 2 kHz are found. These are the values used in the Frequency Range specification.
Step 3: The frequencies with magnitudes that are 10 dB lower than the average between 200 Hz and 2 kHz are found. These are the values used in the Frequency Range specification.

 

The values that correspond to the -10 dB points (relative to the average level between 200 Hz and 2 kHz) are the frequencies stated in the Frequency Range specification.

This is how B&O specifies Frequency Range for all of its loudspeakers. That way, you (and we) can directly compare their specifications to each other. Of course, some other manufacturer may (or probably will) use a different method – so you cannot use B&O’s Frequency Range specifications to compare to another company’s products. We don’t use a ±3 dB threshold, not only because this would require arbitrary smoothing in order to prevent weird things from happening (as I showed above) but also because the on-axis magnitude response of B&O loudspeakers is a result of the loudspeakers’ sound design (which includes a consideration of its power response) which means that, if you just look at the on-axis response, it might not be as flat as a magazine would lead you to believe it should be.

 

 

The Fine Print

1. The method I described above is a slightly simplified explanation of what we actually do – but the difference between what I said and the truth is irrelevant. The details are in the method we use to do the averaging – so it’s not really a big deal unless you’re actually writing the software that has to do the work.

2. We do this measurement of the Frequency Range using a signal with a level of 125 mV at the input of the loudspeaker. So, if your music is playing with a similar level (or lower) then you will have a loudspeaker that is performing as specified. However, if you play the music louder, the frequency range will change. In most cases, the low frequency limit will increase due to the ABL and thermal protection algorithms. The details of (1) how much it will increase, (2) what level of music will cause it to increase, and (3) what frequency content in the music will cause it to increase, are different from loudspeaker model to loudspeaker model.  This was the root of some confusion for some people when they compare the frequency range of the BeoLab 12-3 to the BeoLab 12-2. These two loudspeakers have almost identical low frequency cutoffs, despite the fact that one of them has 2 woofers and the other has only 1. At “normal” listening levels, they have both been tuned to have similar magnitude responses – however, as you turn up the volume, the BeoLab 12-2 will lose bass earlier than the BeoLab 12-3.

3. Subwoofers are different – since it doesn’t make sense to try and find the average magnitude response of a subwoofer between 200 Hz and 2 kHz.

 

B&O Tech: Near… Far…

#27 in a series of articles about the technology behind Bang & Olufsen loudspeakers

Introduction

To begin with, please watch the following video.

One thing to notice is how they made Grover sound near and far. Two things change in his voice (yes, yes, I know. It’s not ACTUALLY Grover’s voice. It’s really Yoda’s). The first change is the level – but if you’re focus on only that you’ll notice that it doesn’t really change so much. Grover is a little louder when he’s near than when he’s far. However, there’s another change that’s more important – the level of the reverberation relative to the level of the “dry” voice (what recording engineers sometimes call the “wet/dry mix”). When Grover is near, the sound is quite “dry” – there’s very little reverberation. When Grover is far, you hear much more of the room (more likely actually a spring or a plate reverb unit, given that this was made in the 1970’s).

This is a trick that has been used by recording engineers for decades. You can simulate distance in a mix by adding reverb to the sound. For example, listen to the drums and horns in the studio version of Penguins by Lyle Lovett. Then listen to the live version of the same people playing the same tune. Of course, there are lots of things (other than reverb) that are different between these two recordings – but it’s a good start for a comparison. As another example, compare this recording to this recording. Of course, these are different recordings of different people singing different songs – but the thing to listen for is the wet/dry mix and the perception of distance in the mix. Another example is this recording compared to this recording.

So, why does this trick work? The answer lies inside your brain – so we’ll have to look there first.

Distance Perception in the Mix

If you’re in a room with your eyes closed, and someone in the room starts talking to you, you’ll be pretty good at estimating where they are in the room – both in terms of angular location (you can point at them) and distance. This is true, even if you’ve never been in the room before. Very generally speaking, what’s going on here is that your brain is automatically comparing:

  • the two sounds coming into your two ears – the difference between these two signals tells you a lot about which direction the sound is coming from, AND
  • the direct sound from the source to the reflected sound coming from the room. This comparison gives you lots of information about a sound source’s distance and the size and acoustical characteristics of the room itself.

If we do the same thing in an anechoic chamber (a room where there are no echoes, because the walls absorb all sound) you will still be good at estimating the angle to the sound source (because you still have two ears), but you will fail miserably at the distance estimation (because there are no reflections to help you figure this out).

If you want to try this in real life, go outside (away from any big walls), close your eyes, and try to focus on how far away the sound sources appear to be. You have to work a little to force yourself to ignore the fact that you know where they really are  – but when you do, you’ll find that things sound much closer than they are. This is because outdoors is relatively anechoic. If you go to the middle of a frozen lake that’s covered in fluffy snow, you’ll come as close as you’ll probably get to an anechoic environment in real life. (unless you do this as a hobby)

So, the moral of the story here is that, if you’re doing a recording and you want to make things sound far away, add reflections and reverberation – or at least make them louder and the direct sound quieter.

Distance Perception in the Listening Room

Let’s go back to that example of the studio recording of Lyle Lovett recording of Penguins. If you sit in your listening room and play that recording out of a pair of loudspeakers, how far away do the drums and horns sound relative to you? Now we’re not talking about whether one sounds further away than the other within the mix. I’m asking, “If you close your eyes and try to guess how far away the snare drum is from your listening position – what would you guess?”

For many people, the answer will be approximately as far away as the loudspeakers. So, if your loudspeakers are 3 m from the listening position, the horns (in that recording) will sound about 3 m away as well. However, this is not necessarily the case. Remember that the perception of distance is dependent on the relative levels of the direct and reflected sounds at your ears. So, if you listen to that recording in an anechoic chamber, the horns will sound closer than the loudspeakers (because there are no reflections to tell you how far away things are). The more reflective the room’s surfaces, the more the horns will sound further away (but probably no further than the loudspeakers, since the recording is quite dry).

This effect can also be the result of the width of the loudspeaker’s directivity. For example, a loudspeaker that emits a very narrow beam (like a laser, assuming that were possible) would not send any sound towards the walls – only towards the listening position. So, this would have the same effect as having no reflection (because there is no sound going towards the sidewalls to reflect). In other words, the wider the dispersion of the sound from the loudspeaker (in a reflective room) the greater the apparent distance to the sound (but no greater than the distance to the loudspeakers, assuming that the recording is “dry”).

Loudspeaker directivity

So, we’ve established that the apparent distance to a phantom image in a recording is, in part, and in some (perhaps most) cases, dependent on the loudspeaker’s directivity. So, let’s concentrate on that for a bit.

Let’s build a very simple loudspeaker. It’s a model that has been used to simulate the behaviour of a real loudspeaker, so I don’t feel too bad about over-simplifying too much here. We’ll build an infinite wall with a piston in it that moves in and out. For example:

Here, you can see the piston (in red) moving in and out of the wall (in grey) with the resulting sound waves (the expanding curves) moving outwards in the air (in white).

The problem with this video is that it’s a little too simple. We also have to consider how the sound radiation off the front of the piston will be different at different frequencies. Without getting into the physics of “why” (if you’re interested in that, you can look here or here or here for an explanation) a piston has a general behaviour with repeat to the radiation patten of the sound wave it generates. Generally, the higher the frequency, the narrower the “beam” of sound. At low frequencies, there is basically no beam – the sound is emitted in all directions equally. At high frequencies, the beam to be very narrow.

The question then is “how high a frequency is ‘high’?” The answer to that lies in the diameter of the piston (or the diameter of the loudspeaker driver, if we’re interested in real life). For example, take a look at Figure 1, below.

Radiation of 100 Hz (blue) and 1.5 kHz (green) from a 10" diameter piston (i.e. a woofer).
Fig 1: Radiation of 100 Hz (blue) and 1.5 kHz (green) from a 10″ diameter piston (i.e. a woofer).

Figure 1 shows how loud a signal will be if you measure it at different directions relative to the face of a piston that is 10″ (25.4 cm) in diameter. Two frequencies are shown – 100 Hz (the blue curve) and 1.5 kHz (the green curve). Both curves have been normalised to be the same level (100 dB SPL – although the actual value really doesn’t matter) on axis (at 0°). As you can see in the plot, as you move off to the side (either to 90° or 270°) the blue curve stays at 100 dB SPL. So, no matter what your angle relative to on-axis to the woofer, 100 Hz will be the same level (assuming that you maintain your distance). However, look at the green curve in comparison. As you move off to the side, the 1.5 kHz tone drops by more than 20 dB. Remember that this also means that (if the loudspeaker is pointing at you and the sidewall is to the side of the loudspeaker) then 100 Hz and 1.5 kHz will both get to you at the same level. However, the reflection off the wall will have 20 dB more level at 100 Hz than at 1.5 kHz. This also means, generally, that there is more energy in the room at 100 Hz than there is at 1.5 kHz because, if you consider the entire radiation of the loudspeaker averaged over all directions at the same time the lower frequency is louder in more places.

This, in turn, means that, if all you have is a 10″ woofer and you play music, you’ll notice that the high frequency content sounds closer to you in the room than the low frequency content.

If the loudspeaker driver is smaller, the effect is the same, the only difference is that the effect happens at a higher frequency. For example, Figure 2, below shows the off-axis response for two frequencies emitted by a 1″ (2.54 cm) diameter piston (i.e. a tweeter).

Radiation of 1.5 kHz (blue) and 15 kHz (green) from a 1" diameter piston (i.e. a tweeter).
Radiation of 1.5 kHz (blue) and 15 kHz (green) from a 1″ diameter piston (i.e. a tweeter).

Notice that the effect is identical, however, now, 1.5 kHz is the “low frequency region for the small piston, so it radiates in all directions equally (seen as the blue curve). The high frequency (now 15 kHz) becomes lower and lower in level as you move off to the side of the driver, going as low as -20 dB at 90°.

So, again, if you’re listening to music through that tweeter, you’ll notice that the frequency content at 1.5 kHz sounds further away from the listening position than the content at 15 kHz. Again, the higher the frequency, the closer the image.

Same information, shown differently

If you trust me, figures 1 and 2, above, show you that the sound radiating off the front of a loudspeaker driver gets narrower with increasing frequency. If you don’t trust me (and you shouldn’t – I’m very untrustworthy…) then you’ll be saying “but you only showed me the behaviour at two frequencies… what about the others?” Well, let’s plot the same basic info differently, so that we can see more data.

Figure 3, below, shows the same 10″ woofer, although now showing all frequencies from 20 Hz to 20 kHz, and all angles from -90° to +90°. However, now, instead of showing all levels (in dB) we’re only showing 3 values, at -1 dB, -3 dB, and -10 dB. ( These plots are a little tougher to read until you get used to them. However, if you’re used to looking at topographical maps, these are the same.)

A contour plot showing the directivity of a 10" piston (i.e. a woofer). The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 3: A contour plot showing the directivity of a 10″ piston (i.e. a woofer). The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

Now you can see that, as you get higher in frequency, the angles where you are within 1 dB of the on-axis response gets narrower, starting at about 400 Hz. This means that a 10″ diameter piston (which we are pretending to be a woofer) is “omnidirectional” up to 400 Hz, and then gets increasingly more directional as you go up.

Figure 4 shows the same information for a 1″ diameter piston. Now you can see that the driver is omnidirectional up to about 4 kHz. (This is not a coincidence – the frequency is 10 times that of the woofer because the diameter is one tenth.)

A contour plot showing the directivity of a 1" piston (i.e. a tweeter). The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 4: A contour plot showing the directivity of a 1″ piston (i.e. a tweeter). The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

Normally, however, you do not make a loudspeaker out of either a woofer or a tweeter – you put them together to cover the entire frequency range. So, let’s look at a plot of that behaviour. I’ve put together our two pistons using a 4th-order Linkwitz-Riley crossover at 1.5 kHz. I have also not included any weirdness caused by the separation of the drivers in space. This is theoretical world where the tweeter and the woofer are in the same place  – an impossible coaxial loudspeaker.

A contour plot showing the directivity of a two-way loudspeaker made of a 1" and a 10" piston. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 5: A contour plot showing the directivity of a two-way loudspeaker made of a 1″ and a 10″ piston. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

In Figure 5 you can see the effects of the woofer’s directivity starting to beam below the crossover, and then the tweeter takes over and spreads the radiation wide again before it also narrows.

So what?

Why should you care about understanding the plot in Figure 5? Well, remember that the narrower the radiation of a loudspeaker, the closer the sound will appear to be to you. This means that, for the imaginary loudspeaker shown in Figure 5, if you’re playing a recording without additional reverberation, the low frequency stuff will sound far away (the same distance as the loudspeakers), So will a narrow band between 3 kHz and 4 kHz (where the tweeter pulls the radiation wider). However, the materials in the band around 700 Hz – 2 kHz and in the band above 7 kHz will sound much closer to you.

Another way to express this is to show a graph of the resulting level of the reverberant energy in the listening room relative to the direct sound, an example of which is shown in Figure 6. (This is a plot copied from “Acoustics and Psychoacoustics” by David Howard and Jamie Angus).

Reverberant energy from the room relative to the direct sound from a two-way loudspeaker. (X and Angus, 2000)
Fig 6: Reverberant energy from the room relative to the direct sound from a two-way loudspeaker. (from Howard and Angus, 2000)

This shows a slightly different loudspeaker with a crossover just under 3 kHz. This is easy to see in the plot, since it’s where the tweeter starts putting more sound into the room, thus increasing the amount of reverberant energy.

What does all of this mean? Well, if we simplify a little, it means that things like voices will pull apart in terms of apparent distance. Consonant sounds like “s” and “t” will appear to be closer than vowels like “ooh”.

So, whaddya gonna do about it?

All of this is why one of the really important concerns of the acoustical engineers at Bang & Olufsen is the directivity of the loudspeakers. In a previous posting, I mentioned this a little – but then it was with regards to identifying issues related to diffraction. In that case, directivity is more of a method of identifying a basic problem. In this posting, however, I’m talking about a fundamental goal in the acoustical design of the loudspeaker.

For example, take a look at Figures 7 and 8 and compare them to Figure 9. It’s important to note here that these three plots show the directivities of three different loudspeakers with respect to their on-axis response. The way this is done is to measure the on-axis magnitude response, and call that the reference. Then you measure the magnitude response at a different angle, and then calculate the difference between that and the reference. In essence, you’re pretending that the on-axis response is flat. This is not to be interpreted that the three loudspeakers shown here have the same on-axis response. They don’t. Each is normalised to its own on-axis response. So we’re only considering how the loudspeaker compares to itself.

A contour plot showing the directivity of a commercially available 3-way loudspeaker. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 7: A contour plot showing the directivity of a commercially available 3-way loudspeaker. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

Figure 7, above, shows the directivity behaviour of a commercially-available 3-way loudspeaker (not from Bang & Olufsen). You can see that the woofer is increasingly beaming (the directivity gets narrow)  up to the 3 – 5 kHz area. The midrange is beaming up above 10 kHz or so. So, a full band signal will sound distant in the low end, in the 6-7 kHz range and around 15 kHz. By comparison, signals at 2-4 kHz and 10-15 kHz will sound quite close.

A contour plot showing the directivity of traditionally designed 4-way loudspeaker. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 8: A contour plot showing the directivity of traditionally designed 4-way loudspeaker. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

Figure 8, above, shows the directivity behaviour of a 3-way loudspeaker we made as a rough prototype. This is just a woofer, midrange and tweeter, each in its own MDF box – nothing fancy – except that the tweeter box is not as wide as the midrange box which is narrower than the woofer box. You can see that the woofer is beaming (the directivity gets narrow)  just above 1 kHz – although it has a very weird wide directivity at around 650 Hz for some reason. The midrange is beaming up from 5kHz to 10 kHz, and then the tweeter gets wide. So, this loudspeaker will have the same problem as the commercial loudspeaker

A contour plot showing the directivity of a BeoLab 5. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.
Fig 9: A contour plot showing the directivity of a BeoLab 5. The red area has a magnitude between 0 and -1 dB. The orange area has a magnitude of -1 down to -3 dB. The blue area has a magnitude of -3 down to -10 dB. The white area is lower than -10 dB.

As you can see, the loudspeaker with the directivity shown in Figure 9 (the BeoLab 5) is much more constant as you change frequency (in other words, the lines are more parallel). It’s not perfect, but it’s a lot better than the other two – assuming that constant directivity is your goal. You can also see that the level of the signal that is within 1 dB of the on-axis response is quite wide compared with the loudspeakers in Figures 7 and 8. The loudspeaker in Figure 7 not only beams in the high frequencies, but also has some strange “lobes” where things are louder off-axis than they are on-axis (the red lines).

When you read B&O’s marketing materials about the reason why we use Acoustic Lenses in our loudspeakers, the main message is that it’s designed to spread the sound – especially the high frequencies – wider than a normal tweeter, so that everyone on the sofa can hear the high hat. This is true. However, if you ask one of the acoustical engineers who worked on the project, they’ll tell you that the real reason is to maintain constant directivity as well as possible in order to ensure that the direct-to-reverberant ratio in your listening room does not vary with frequency. However, that’s a difficult concept to explain in 1 or 2 sentences, so you won’t hear it mentioned often. However, if you read this paper (which was published just after the release of the BeoLab 5), for example, you’ll see that it was part of the original thinking behind the engineers on the project.

 Addendum 1.

I’ve been thinking more about this since I wrote it. One thing that I realised that I should add was to draw a comparison to timbre. When you listen to music on your loudspeakers in your living room, in a best-case scenario, you hear the same timbral balance that the recording engineer and the mastering engineer heard when they worked on the recording. In theory, you should not hear more bass or less midrange or more treble than they heard. The directivity of the loudspeaker has a similar influence – but on the spatial performance of the loudspeakers instead of the timbral performance. You want a loudspeaker that doesn’t alter the relative apparent distances to sources in the mix – just like you don’t want the loudspeakers to alter the timbre by delivering too much high frequency content.

Addendum 2.

One more thing… I made the plot below to help simplify the connection between directivity and Grover. Hope this helps.

A contour plot showing the directivity of a commercially available 3-way loudspeaker. The wider the plot (vertically), the farther the image.
A contour plot showing the directivity of a commercially available 3-way loudspeaker. The wider the plot (vertically), the farther the image.

Bang & Olufsen BeoVision Avant Reviews

bvavant

 

 

I was responsible for the final sound design (aka tonal balance) of the loudspeakers built into the BeoVision Avant. So, I’m happy to share some of the blame for some of the comments (at least on the sound quality) from the reviews.

 

from Home Cinema Choice magazine

“Where the Avant really gets space-age, though, is with … its ability to drive 21 audio channels ”

“This effort by the integrated speakers , together with its unprecedented audio flexibility,makes the Avant the finest sounding TV I’ve ever heard.”

 

from TrustedReviews.com

“Even a high-end sound bar would struggle to match the gorgeous finesse the Avant combines with its raw power. The speakers reproduce soundtrack subtleties more precisely and elegantly than any other TV we’ve heard. And they do so no matter how dense the soundstage becomes, and without so much as a hint of treble harshness.”

“Then there’s that rear-mounted subwoofer. We had worried that the way this angled subwoofer fires up and out through an actually quite narrow vent could cause boominess or distortion, but not a bit of it. Instead very impressive and well-rounded amounts of bass meld immaculately into the bottom end of the wide mid-range delivered by those terrific left, right and centre speakers.”

 

from flatpanelshd.com

Compared to all other TVs on the market (non-B&O) there is no competition. Sound is so much better. However, we also have to point out that the TV did not receive the best conditions for a proper audio demonstration.

Bang & Olufsen BeoLab 18 Reviews

xl_B&O-BeoLab-17-Close-624

 

I was part of the development team, and one of the two persons who decided on the final sound design (aka tonal balance) of the B&O BeoLab 18 loudspeakers. So, I’m happy to share some of the blame for some of the comments (at least on the sound quality) from the reviews.

 

Bernard Dickinson at Live Magazines said:

The sound reproduction is flawless”

 

Lyd & Billede’s August 2014 review said

“Lydkvaliteten er rigtig god med en åben, distinkt og fyldig gengivelse, som ikke gør højopløste lydformater til skamme.” (The sound quality is very good with an open, clear and detailed reproduction, which do not put high-resolution audio formats to shame.)

and ”Stemmerne er lige klare og tydelige, hvad enten vi sidder lige i smørhullet eller befinder os langt ude i siden. Det er faktisk ret usædvanligt og gør, at BeoLab 18 egner sig lige godt til både baggrundsmusik og aktiv lytning” (The voices are crisp and clear, whether we are sitting right in the sweet spot or far off to the side. It’s actually quite unusual and makes the BeoLab 18 equally suited for both background music and active listening)

B&O Tech: Combinatorics

#26 in a series of articles about the technology behind Bang & Olufsen loudspeakers

 

I occasionally drop in to read and comment on the fora at www.beoworld.org. This is a group of Bang & Olufsen enthusiasts who, for the most part, are a great bunch of people, are very supportive of the brand (and yet, like any good family member, are not afraid to offer constructive criticism when it’s warranted…) Anyways, during one discussion about Speaker Groups in the BeoVision Avant, BeoVision 11, BeoPLay V1 and BeoSystem 4, the following question came up:

 

Speaker Groups

Would be interesting to know how many different ‘Speaker Group constellations’ you actually could make with the new engine. Of course you are limited to 10, if you want to save them. But I guess that should be enough for most of us.

 

This got me thinking about exactly how many possible combinations of parameters there are in a single Speaker Group in our video products. As a result, I answered with the response copied-and-pasted below:

Multiply the number of audio channels you have (internal + external) by 17 (the total number of possible speaker roles not including subwoofers, but including NONE as an option in in case you want to use a subset of your loudspeakers) or 22 (the total number of possible speaker roles including subwoofers)  to get the total number of Loudspeaker Constellations.

If you want to include Speaker Level and Speaker Distance, then you will have to multiply the previous result by 301 (possible distances for each loudspeaker) and multiply again by 61 (the total number of Speaker Levels) to get the total number of possible Configurations.

This means:

If you have a BeoPlay V1 (with 2 internal + 6 external outputs) the answers are

  • 136 constellations without a subwoofer, or
  • 176 constellations with subwoofers, and
  • a maximum of 3,231,536 possible Speaker Groups Configurations (including Levels and Distances)

If you have a BeoVision 11 without Wireless (2+10 channels), then the totals are

  • 204 constellations without a subwoofer, or
  • 264 constellations with subwoofers, and
  • 4,847,304 possible Speaker Groups Configurations (including Levels and Distances)

If you have a BeoVision 11 with Wireless (2+10+8 channels), then the totals are

  • 340 constellations without a subwoofer, or
  • 440 constellations with subwoofers, and
  • 8,078,840 possible Speaker Groups Configurations (including Levels and Distances)

If you have a BeoVision Avant (3+10+8 channels), then the totals are

  • 357 constellations without a subwoofer, or
  • 462 constellations with subwoofers, and
  • 8,482,782 possible Speaker Groups Configurations (including Levels and Distances)

Note that these numbers are FOR EACH SPEAKER GROUP. So you can multiply each of those by 10 (for the number of Speaker Groups you have available in your TV). The reader is left to do this math on his/her own.

Note as well that I have not included the Speaker Roles MIX LEFT, MIX RIGHT – for those of you who are using your Speaker Groups to make headphone outputs – you know who you are… ;-)

Note as well that I have not included the possibilities for the Bass Management control.

 

 Sound Modes

 

This also got me thinking about the total number of possible combinations of settings there are for the Sound Modes in the same products. In order to calculate this, you start with the  list of the parameters and their possible values which is listed below:

  • Frequency Tilt: 21
  • Sound Enhance: 21
  • Speech Enhance: 11
  • Loudness On/Off: 2
  • Bass Mgt On/Off: 2
  • Balance: 21
  • Fader: 21
  • Dynamics (off, med, max): 3
  • Listening Style: 2
  • LFE Input on/off: 2
  • Loudness Bass: 13
  • Loudness Treble: 13
  • Spatial Processing: 3
  • Spatial Surround: 11
  • Spatial Height: 11
  • Spatial Stage Width: 11
  • Spatial Envelopment: 11
  • Clip Protection On/Off: 2

Multiply all those together and you get 1,524,473,211,092,832 different possible combinations for the parameters in a Sound Mode.

Note that this does not include the global Bass and Treble controls which are not part of the Sound Mode parameters.

However, it’s slightly misleading, since some parameters don’t work in some settings of other parameters. For example:

All four Spatial Controls are disabled when the Spatial Processing is set to either “1:1” or “downmix” – so that takes away 29282 combinations.

If Loudness is set to Off, then the Loudness Bass and Loudness Treble are irrelevant – so that takes away 169 combinations.

So, that reduces the total to only 1,524,473,211,063,381 total possible parameter configurations for a single Sound Mode.

Finally, this calculation assumes that you have all output Speaker Roles in use. For example, if you don’t have any height loudspeakers, then the Spatial Height control won’t do anything useful.

 

 

If you’d like more information on what these mean, please check out the Technical Sound Guide for Bang & Olufsen video products downloadable from this page.

B&O Tech: Where great sound starts

#25 in a series of articles about the technology behind Bang & Olufsen loudspeakers

 

You’ve bought your loudspeakers, you’ve connected your player, your listening chair is in exactly the right place. You sit down, put on a new recording, and you don’t like how it sounds. So, the first question is “who can I blame!?”

Of course, you can blame your loudspeakers (or at least, the people that made them). You could blame the acoustical behaviour of your listening room (that could be expensive). You could blame the format that you chose when you bought the recording (was it 128 kbps MP3 or a CD?). Or, if you’re one of those kinds of people, you could blame the quality of the AC mains cable that provides the last meter of electrical current supply to your amplifier from the hydroelectric dam 3000 km away Or you could blame the people who made the recording.

In fact, if the recording quality is poor (whatever that might mean) then you can stop worrying about your loudspeakers and your room and everything else – they are not the weakest link in the chain.

So, this week, we’ll talk about who those people are that made your recording, how they did it, and what each of them was supposed to look after before someone put a CD on a shelf (or, if you’re a little more current, put a file on a website).

 

Recording Engineer

The recording engineer is the person you picture of when you think about a recording session. You have the musicians in the studio or the concert hall, singing and playing music. That sound travels to microphones that were set up by a Recording Engineer who then sits behind a mixing console (if you’re American – a “mixing desk” if you’re British) and fiddles with knobs obsessively.

Fig 1. A recording engineer (the gentleman on the right) engineering a recording. This is actually probably a staged shot – but it could easily have been taken either during the tracking or mixing part of the process.

There’s a small detail here that we should not overlook. Generally speaking, a “recording engineer” has to do two things that happen at different times in the process of making a recording. The first is called “tracking” and the second is called “mixing”.

 

Tracking

Normally, bands don’t like playing together – sometimes because they don’t even like to be in the same room as each other.  Sometimes schedules just don’t work out. Sometimes the orchestra and the soloist can’t be in the same city at the same time.

In order to circumvent this problem, the musicians are recorded separately in a process called “tracking”. During tracking, each musician plays their part, with or without other members of the band or ensemble. For example, if you’re a rock band, the bass and the drummer usually arrive first, and they play their parts. In the old days, they would have been recorded to seaport tracks on a very wide (2″!) magnetic tape (hence the term “tracking”) where each instrument is recorded on a separate track. That way, the engineer has a separate recording of the kick drum and the snare drum and each tom-tom and each cymbal, and so on and so on. Nowadays, most people don’t record to magnetic tape because it’s too expensive. Instead, the tracks are recorded on a hard disc on a computer. However, the process is basically the same.

Once the bass player and the drummer are done, then the guitarist comes into the studio to record his or her parts while listening to the previously-recorded bass and drum parts over a pair of headphones. Then the singer comes in and listens to the bass, drums and guitar and sings along. Then the backup vocalists come in, and so on and so on, until everyone has recorded their part.

During the tracking, the recording engineer sets up and positions the microphones to get the optimal sound for each instrument. He or she will make sure that the gain that is applied to each of those microphones is correct – meaning that it’s recorded at a level that is high enough be mask the noise floor of the electronics and the recording medium, but not so high that it distorts. In the old days, this was difficult because the dynamic range of the recording system was quite small – so they had to stay quite close to the ceiling all the time – sometimes hitting it. Nowadays, it’s much easier, since the signal paths have much wider dynamic ranges so there’s more room for error.

 

In the case of a classical recording, it might be a little different for the musicians, but the technical side is essentially the same. For example, an orchestra will play (so you don’t bring in the trombone section first – everyone plays together) with a lot of microphones in the room. Each microphone will be recorded on its own individual track, just like with the rock band. The only difference is that everyone is playing at the same time.

Fig 2. A typical orchestra recording session. Note that all the musicians are there, and there are a lot of microphones in the room. Each of those microphones is probably being recorded on its own independent track on a hard disc somewhere so that they can be mixed together later.

 

Once all the tracking is done the musicians are finished. They’ve all been captured, each on their own track that can be played back later in isolation (for example, you can listen to just the snare drum, or just the microphone above the woodwind section). Sometimes, they will even have played or sung their part more than once – so we have different versions or “takes” to choose from later. This means that there may be hundreds of tracks that all need to be mixed together (or perhaps just left out…) in order to make something that normal people can play on their stereo.

 

Mixing

Now that all the individual tracks are recorded, they have to be combined into a pretty package that can be easily delivered to the customers. This means that all of those individual tracks that have been recorded have to be assembled or “mixed” together into a version that has, say, only two channels – one for the left loudspeaker and one for the right loudspeaker. This is done by feeding each individual track to its own input on a mixing console and listening to them individually to see how they best fit together. This is called the “mixing” process. During this stage, basic decisions are made like “how loud should the vocals be relative to the guitars (and everything else)”. However, it’s a little more detailed than that. Each track will need its own processing or correction (maybe changing the equalisation on the snare drum – or altering the attack and decay of the bass guitar using a dynamic range compressor – or the level of the vocal recording is changed throughout the tune to compensate for the fact that the singer couldn’t stay the same distance from the microphone whilst singing…) that helps it to better fit into the final mix.

Fig 3. A mixing console that has been labelled with the various tracks for the input strips. This is a very typical look for a console during a mixing session – although the surroundings are not.

 

If you walk into the control room of a recording studio during a mixing session, you’d see that it looks almost exactly like a recording session – except that there are no musicians playing in the studio. This is because what you usually see on videos like this one is a tracking session – but the recording engineer usually does a “rough mix” during tracking – just to get a preliminary idea of how the puzzle will fit together during mixing.

Once the mixing session for the tune is finished, then you have a nearly-finished product. You at least have something that the musicians can take home to have a listen to see if they’re satisfied with the overall product so far.

 

Editing

In classical music there is an extra step that happens here. As I said above, with classical recordings, it’s not unusual for all the musicians to play in the same room at the same time when the tracking is happening. However, it is unusual that they are able to play all the way through the piece without making any mistakes or having some small issues that they want to fix. So, usually, in a classical recording, the musicians will play through the piece (or the movement) all the way through 2 or 3 times. While that happens, a Recording Producer is sitting in the control room, listening and making notes on a copy of the score. Each time there is a mistake, the producer makes a note of it – usually with a red make indicating the Take Number in which the mistake was made. If, after 2 or 3 full takes of the piece, there are points in the piece that have not been played correctly, then they go back and fix small bits. The ensemble will be asked to play, say 5 bars leading up to the point that needs fixing – and to continue playing for another 5 bars or so.

Later, those different takes (either full recordings, or bits and pieces) will be cut and spliced together in a process called editing. In the old days, this was done using a razor blade to cut the magnetic tape and stick it back together. For example, if you listen to some of Glen Gould’s recordings, you can hear the piano playing along, but the tape hiss changes suddenly in the background noise. This is the result of a splice between two different recordings – probably made on different days or with different brands of tape. Nowadays, the “splicing” is done on a computer where you fade out of one take and fade into another gradually over 10 ms or so.

Fig 4. A “crossfade” on a modern digital audio workstation. The edit point (what used to be a “tape splice”) is where the recording on the top right is faded out and a different recording (bottom right) of the same music is faded in.

 

If the editing was perfect, then you’ll never hear that it happened. Sometimes, however, it’s possible to hear the splice. For example, listen to this recording and  listen to the overall level and general timbre of the piano. It changes to a quieter, duller sound from about 0′ 27″ to about 0´ 31″. This is a rather obvious tape splice to a different recording than the rest of the track.

 

Mastering Engineer

The final stage of creating a recording is performed by a Mastering Engineer in a mastering studio. This person gets the (theoretically…) “finished” product and makes it better. He or she will sit in a room that has very little gear in it, listening to the mixed song to hear if there are any small things that need fixing. For example, perhaps the overall timbre of the tune needs a little brightening or some control of the dynamic range.

Another basic role of the mastering engineer is to make sure that all of the tracks on a single album sound about the same level – since you don’t want people sitting at home fiddling with the volume knob from tune to tune.

When the mastering engineer is done, and the various other people have approved the final product, then the recording is finished. All that is left to do is to send the master to a plant to be pressed as a CD – or uploaded to the iTunes server – or whatever.

 

Fig 5. A mastering engineer sitting at a mastering console. Notice that, unlike a mixing console, a mastering console does not have a massive number of faders and knobs because it doesn’t have a lot of inputs. Also note that the mastering engineer looks better rested and more cleanly shaven than the recording engineer (above) because he doesn’t have to talk to musicians every day at work. Okay, okay, I’m joking… sort of…

 

In other words the Mastering Engineer is the last person to make decisions about how a recording should sound before you get it.

This is why, when I’m talking to visitors, I say that our goal at Bang & Olufsen is to build loudspeakers that perform so that you, in your listening room, hear what the mastering engineer heard – because the ultimate reference of how the recording should sound is what it sounded like in the mastering studio.

 

Appendicies

What’s a producer?

The title of Recording Producer means different things for different projects. Sometimes, it’s the person with the money who hires everyone for the recording.

Sometimes (usually in a pop recording) it’s the person sitting in the control room next to the recording engineer who helps the band with the arrangement – suggesting where to put a guitar solo or where to add backup vocals. Some pop producers will even do good ol’ fashioned music arrangements.

A producer for a classical recording usually acts as an extra set of ears for the musicians through the recording process. This person will also sit with the recording engineer in the control room, following the score to ensure that all sections of the piece have been captured to the satisfaction of the performers. He or she may also make suggestions about overall musical issues like tempi, phrasing, interpretation and so on.

 

But what about film?

The basic procedure for film mixing is the same – however, the “mixing engineer” in a film world is called a “re-recording engineer”. The work is similar, but the name is changed.

 

So that’s a “Tonmeister”?

A tonmeister is a person who can act simultaneously as a Recording Engineer and a Recording Producer. It’s a person who has been trained to be equally competent in issues about music (typically, tonmeisters are also musicians), acoustics, electronics, as well as recording and studio techniques.