Localisation cues in recordings
Posted: Sat Sep 21, 2013 9:19 pm
A lot of people know that my focus on analysing audio for a while now has been in the area of timing. There are a number of reasons for this focus but one of the main ones being that sound stage is a noticeably better focused in the better sounding systems. I attribute this to better reproduced timing in the reproduced sound field.
The foundation for this sound stage premise is based on psychoacoustic underpinnings - in nature, we localise sounds based on the difference in the timing of the signal reaching the two ears (interaural time difference - ITD) & also on the difference of the intensity of the sound between the ears. Here's an interesting, hands-on demo of just this effect with ITD & IID http://auditoryneuroscience.com/topics/ ... ty-trading
Note: the lowest values used in this demo are ITD of 0.22mS & IID of 3dB - very much bigger than the just noticeable hearing limits for ITD of 2us & IID of 0.05dB
Now the question that always bugged me was that an awful lot of recordings are not live recordings - they are products of a studio engineer - the question being do these engineers observe psychoacoustic rules when creating a sound stage on the recording? Not being intimate with the process I just didn't know - I suspected that just volume differences were used to create this sound stage.
The question then becomes how we deal with this anomaly when it occurs in playback i.e the timing & volume aren't in accordance with natural localisation of sounds. It seems we accommodate to this in our listening?
I came across this recently when doing some further reading of Jneutron's comments on group delay (timing differences across frequencies) in speaker cables. He stated this which makes some sense http://www.audiocircle.com/index.php?to ... #msg479123
The foundation for this sound stage premise is based on psychoacoustic underpinnings - in nature, we localise sounds based on the difference in the timing of the signal reaching the two ears (interaural time difference - ITD) & also on the difference of the intensity of the sound between the ears. Here's an interesting, hands-on demo of just this effect with ITD & IID http://auditoryneuroscience.com/topics/ ... ty-trading
Note: the lowest values used in this demo are ITD of 0.22mS & IID of 3dB - very much bigger than the just noticeable hearing limits for ITD of 2us & IID of 0.05dB
Now the question that always bugged me was that an awful lot of recordings are not live recordings - they are products of a studio engineer - the question being do these engineers observe psychoacoustic rules when creating a sound stage on the recording? Not being intimate with the process I just didn't know - I suspected that just volume differences were used to create this sound stage.
The question then becomes how we deal with this anomaly when it occurs in playback i.e the timing & volume aren't in accordance with natural localisation of sounds. It seems we accommodate to this in our listening?
I came across this recently when doing some further reading of Jneutron's comments on group delay (timing differences across frequencies) in speaker cables. He stated this which makes some sense http://www.audiocircle.com/index.php?to ... #msg479123
Unfortunately, the bulk of the music heard today is of studio origion. All the artists don't even have to be on the same continent.
Given say, 24 individual tracks that are mixed down to two, ITD loses. The mixdown engineer turns a knob to position a track in space.
Music content which has been "panned" for position, is a forced response media. In other words, we humans are forced to interpret the inaccurate localization parameters to determine a position in space.
Unforced, or natural localization, uses both intensity difference as well as time difference. If you get a chance, try altering one or the other, and listen to how your mind reacts. If you reduce one ear's intensity (say with a towel, or a plugged ear due to water), note that you can still determine the source location fairly well...this is because you are sensitive to the time difference. In fact, as you lower one channel intensity, you can "see" that the source direction is "fighting" the intensity error, trying to remain centered. This is ITD at work.
Conversely, if you have a multiplexed dac feeding two channels into headphones, and the S/H system is not syncronous, you get an 11 uSec interchannel delay (typical of really cheap sound cards). With a built in 11 uSec delay, there is nothing you can do with the balance control to get the image field correct... It is really weird.
Nobody measures the ITD information which is presented to our ears to determine if the system has altered it. Since half the information we use to distinguish direction is ignored in the measurements, how then do we determine the accuracy of the delivery system?
Human ITD sensitivity is in the 2 to 5 uSec range. Very small number, eh?
Cheers, John