Jul 18, 2012
Are high sample rates making your music sound worse ?

Yes, you read that right. There’s a real possibility that using high sample rates could actually be reducing the quality of your audio, not making it better.
How is this possible ?
In a nutshell, many “affordable” soundcards have a non-linear response to high frequency content.
Meaning that even though they are technically capable of recording at 96 kHz and above, the small benefits of the higher sample rate are completely outweighed by unwanted “intermodulation distortion” in the analogue stages.
Further down I link to some test files so you can test your own system for this problem, but let’s just slow down for a minute and add a little background information.
What’s the point of high sample rates anyway ?
The sample rate determines how many samples per second a digital audio system uses to record the audio signal. The higher the sample rate, the higher frequencies a system can record. CDs, most mp3s and the AAC files sold by the iTunes store all use a sample rate of 44.1 kHz, which means they can reproduce frequencies up to roughly 20 kHz.
Testing shows that most adults can’t hear much above 16 kHz, so on the face of it, this seems sensible enough. Some can, but not the majority. And examples of people who can hear above 20 kHz are few and far between. And to accurately reproduce everything below 20 kHz, a digital audio system removes everything above 20 kHz – this is the job of the anti-aliasing filter.
But a fair few musical instruments produce sound well above these frequencies – muted trumpet and percussion instruments like cymbals or chime bars are clear examples.
This leads to two potential objections to a 44.1 kHz sample rate – first, that in order to reproduce a sound accurately we should capture as much of it as possible, including frequencies we probably can’t hear. There are various suggestions that we may be able to somehow perceive these sounds, even if we can’t actually hear them. And secondly that depending on the design, the anti-aliasing filter may have an effect at frequencies well below the 20 kHz cut-off point.
Whether these arguments stand up to scrutiny or not, the solution is obvious – record at higher sample rates. The filters can work higher up and be more gentle, and all the high-frequency content can be recorded accurately. Simple, right ?
Well, not entirely. In fact, these arguments don’t really make sense – for an excellent and detailed discussion of why not, check out this article.
So why NOT use higher sample rates, then ?
Back when CD was released, recording at 96 kHz or above simply wasn’t viable at a reasonable price, especially not in consumer audio. Times have moved on though, and these days almost any off-the-peg digital audio chip is capable of at least 96 kHz processing, if not higher.
Now these files take up much more space than simple 44.1 kHz audio, but hard drive space is cheap, and getting cheaper all the time – why not record at 96 kHz or higher, just in case either of those hotly debated arguments really does carry some weight ?
The answer lies in the analogue circuitry of the equipment we use. Just because the digital hardware in an interface is capable of 96 kHz or higher audio processing, doesn’t mean the analogue stages will record or play the signal cleanly.
It’s quite common for ultrasonic content to cause intermodulation distortion right down into the audible range. Or in simple English, the inaudible high-frequency content actually makes the audio you can hear sound worse.
You can read all the gory details in a the same excellent article I linked to above here, but here’s the summary:
…it’s not certain that intermodulation from ultrasonics will be audible on a given system. The added distortion could be insignificant or it could be noticable. Either way, ultrasonic content is never a benefit, and on plenty of systems it will audibly hurt fidelity. On the systems it doesn’t hurt, the cost and complexity of handling ultrasonics could have been saved, or spent on improved audible range performance instead.
Check your own system
If you want to test your own playback system, there are files included in the article – to download them, click here. The files contain various types of test material, but all of it is at frequencies that is completely inaudible to humans.
So interpreting the results is simple:
Assuming your system is actually capable of full 96kHz playback, the above files should be completely silent with no audible noises, tones, whistles, clicks, or other sounds.
For what it’s worth, the onboard sound of my Mac Pro fails all these tests – make sure you’ve set your system to 96 kHz output, if you want to try them yourself.
So what does this mean ?
If you can hear audible sound when you play these test files, then you may be making your audio quality worse by choosing to use 96 kHz or higher sample frequencies !
If that’s the case, then you face the difficult choice – do you spend time and money upgrading to handle these very high frequencies, even though they probably aren’t audible ? Or just optimise for 44.1 kHz, which is still the most common playback frequency ?
Notice I said you may be making things worse – even if your system fails these tests, the music you record may not have ultrasonic content that causes audible problems.
Another test would be to apply a phase-linear high-pass filter to your music at 25 kHz (say) and listen to the result – again, you shouldn’t be able to hear anything. If you can’t, then the high sample rates probably aren’t recording any information which will cause you a problem – but in that case, are you actually getting any benefit from them ?
A controversial question
Finally, the fact that ultrasonic content can potentially cause intermodulation distortion and make things sound different even when they shouldn’t raises a tough question.
Are all the people who claim to be hearing improved quality at 96 kHz and above really hearing what they think they are ? Or are they just hearing intermodulation distortion ?
My experience
It’s been over 6 years since I first tested myself with 96 kHz audio – I compared the SACD of Pink Floyd’s “Dark Side Of The Moon” and a pure DSD live jazz recording – I down-sampled both to 48 kHz and 44.1 kHz and blind tested myself switching between the three.
In each case, I could reliably hear a difference between 44.1 kHz and 48 kHz, but not between 48 kHz and 96 kHz. And, I was left with the distinct feeling that I could compensate for the difference I did hear with a very small EQ tweak on the 44.1 kHz version…
Now, that was just on those two recordings, at that time, using that system – maybe those recordings don’t have anything in them that gets the “benefit” of 96 kHz playback.
Or maybe 48 kHz is actually good enough ?
Leave a comment and let me know what your experiences are with high sample rate recordings.
If you found this post useful, you might like to check out my recent eBook,
“The Best Of Production Advice” – it has all the best content from this site site to help you start improving your recording, mixing and production skills today. For more information, click here.
Related posts:


.gif)





Ian,
First, thanks for focusing our attention on this extremely interesting and thought-provoking topic. I do have a thought or two on the matter.
While I agree with the data presented in the Xiph.org article you’re using as the basis of this post, I do not necessarily agree with one of its seemingly-tacit key premises: that ultrasonic modulation is synonymous with distortion and unwanted aberrations.
Ultrasonic modulation happens all the time in the real (analog) world. Any time you mix two signals containing ultrasonic content, this process will result in additive and subtractive frequencies which can then color frequencies down into the audible range. The mere inclusion of ultrasonic modulation has nothing to do with a value judgment that this is distortion or unwanted. It can just as well present as a character or added dimension of the source audio.
The exception here, of course, is where low-quality electronic components themselves are generating noise at an ultrasonic level, which then cross-modulates and produces extra frequencies in the audible spectra that would not naturally occur. In that case, I would usually agree with the subjective judgment of distortion and noise.
However, at one point the article states, “If the same transducer *reproduces* ultrasonics along with audible content…” (note: emphasis is mine). Observe that the author chose the word ‘reproduces’ rather than ‘generates’, leading me to believe that he is also considering naturally-occurring ultrasonic modulation to be as unwanted as additional component noise.
That nit-pick aside, it’s an interesting article and a good summary analysis on your part. Thanks for bringing that one to our attention, Ian.
_
Thanks for the comment. I read the article to relate only to ultrasonic content generated by the analogue gear.
The bottom line is, there are no intermodulation distortion signals in the test files. If you hear them when you play back your signal, they are being “generated” by the playback system…
@ Dustbunnies.
When you mix “ultrasonic” signals they just mix together. It’s only in the presence of non-linearity that the sum and difference frequencies are created.
If you generate, say, 19kHz and 20kHz if the system is linear you won’t hear any 1kHz.
It’s called “Linear Superposition” In order for the air to do this you need *very* high SPL’s.
Competently designed analogue electronics, will have these distortions in the <-100dB levels.
DC
What Dave said.
Hi Dave, great to see you here ! [Waves]
The real culprit here is not higher sample rates, but bad analog. Bad analog always sounds, well… bad. Higher sample rates simply reveal it more fully.
The xiph.org article is laughable: “None of that is relevant to playback; here 24 bit audio is as useless as 192kHz sampling. The good news is that at least 24 bit depth doesn’t harm fidelity. It just doesn’t help, and also wastes space.”
24bit audio is a significant improvement and if author Monty can’t hear the difference he probably lacks a reasonable monitor setup. I wonder if he advocates cars being engineered to go no faster than the speed limit so as not to waste capacity.
I missed Randy’s comment here, but we bumped into each other on Twitter.
A spirited debate followed – you can read the whole thing here, if you like !
http://storify.com/Ian_Shepherd/are-high-sample-rates-making-your-music-sound-wors
Do we have any idea how much kit produces these errors?
Tried the samples out on my Prism Orpheus and a TC Electronic StudioKonnect 48 I am reviewing with no issues. If its not wide spread is this not simular to saying:
“Some HP laptop speakers fart when bass notes are played, maybe we shouldn’t use 30hz in recordings as few systems can reproduce it”?
Ok obviously its not that extreme but do you see what I am getting at?
Honest answer – no. But there are so many systems advertising “support” which will certainly fail (every Mac’s built-in audio, for example) I think it’s important to make people aware that the numbers don’t tell the same story, just as not all HD TVs give a better picture.
I totally agree. I think the 44.1Khz reproduces a total range of 22.05Khz Stereo, as the frequency number is split between left and right.
I don’t think there is anymore to it then that. I think a lot of audiophiles get caught up in the bigger is better idea. I guess they believe it makes them sound more intellectual then they actually are.
In the digital age of being able to record a professional sounding album in your basement with not much money, they like to pretend they can hear something the average person can’t.
Maintaining the idea they have control over something mystical and mysterious that noob engineers can’t hear or perceive because they just don’t have the ear for it.
Thanks Ian, very insightful article, really got me reconsidering things as a musician and home studio engineer. Very glad I found this website. Cheers!
Sidenote: @ Heavy Metal, the reason why the minimum sample rate is required to be at least double the max. audible frequency is explained by the Nyquist-Shannon sampling theorem. In short, no it’s not because it’s stereo. Also, some people certainly can hear things others cannot, just as how some people do not require eyeglasses, while others do. A producer/technician friend of mine has actually been able to hear signals beyond 20khz when tested for them, though this is indeed extremely rare. As for me, I can hear things that are much lower in volume than the average person, again confirmed by audiologists. The problem is people’s attitudes toward such abilities, either for or against, and how this is exploited primarily by the consumer market.