MPEGplus logo


Frank Klemm's Dither and Noise Shaping Page




Dithering

The noise (original analog signal minus quantized signal) is normally uncorrelated in time and uncorrelated with the signal. This is because the value of the error (-0.5 <= error <= +0.5) is not predictable in any way depending on the last error or depending on the signal itself. This means that the noise is constant and »white«, the power density is (nearly) constant, there are no peaks in it. This is very important, because the human ear can very easily detect For especially low level signals the signal is not »white« and you can find »moving« spikes in the spectrum. Searching for such problematic signals shows that there are more general classes: For demonstrating the problem I have choosen a special synthetic sound. It consists of frequencies between 20 and 200 Hz and I called it »Submarine«. At the end of this page there is a piece of music to shown this is a problem in real music. First the signal in the time and in the frequency domain:

[16 bit Amplitude]

As you can see the signal has a maximum level around -1 dB, not something around -90 dB as you have certainly expected. The dips and spikes are not real, this is a graphical subsampling problem of Cooledit.

[16 bit Spectrum, 0...500 Hz]

This is the spectrum from 0 . . . 500 Hz. The full spectrum range (0 . . . 22.05 kHz) doesn't resolve the 5 spectral lines:

[16bit Spectrum]

Now we quantize the signal down to 10 bit (i.e. we are simulating the quantization of a 22 bit DAC downto a 16 bit DAC at levels below -36 dB, so everyone can test this with a normal cheap 16 bit sound card setting the full scale SPL to something around 100 . . . 105 dB - 36 dB.

[16bit Spectrum rounded]

Urghhh, what's that??? This don't look very nice and it even sounds worse . . .
What happened. Lets see the first second of this signal:

[10 bit 1st second]

You can see the noise signal is a triangle signal with rising frequency. While the amplitude arises the frequency arises. Frequency parts above half the sampling frequency are mirrored down and up, so you can hear a lot of tones. Yes, this is digital audio, not short wave radio ;-)

The first PCM ADC and DAC converters had 14 bit and due to there error they have effectively less than 14 bit. So digital records in the late 70's and the early 80's really sound awfully. This is the reason for topics like "Bad Digital Noise", "Records sounds better than CDs", "Consumers need more than 16 bit/44.1 kHz".

Lets add some noise before quantization:

[16bit Spectrum dither]

As you can see adding some noise before quantization removes all ugly quantization effects by making the signal indeterministic.

The disadvantage is that dithering always reduces the Signal-to-Noise-Ratio by about 3 . . . 4 dB.



Enhanced Dithering

Enhanced dithering computes the entropy of the signal and only add noise (entropy) when the signal's entropy falls below a critical value.

The advantage is that this don't reduce the Signal-to-Noise ratio if this is not necessary. Multiple consecutive quanizations only add a dithering signal once, not multiple.



Noise Shaping

The human ear has a different sensitivity for different frequencies (ATH: Absolute Threshold of Hearing). And it's possible to intentionally correlate the noise in time so it has a frequency spectrum which looks like the ATH:

[Noise shaping frequency response]

Noise shaping always increase the absolute unweighted power of the noise, but reduces the audible weighted noise. The effect depends on the ATH in the range from 0 . . . fs/2, typical maximum values you can reach are:

Sample
frequency
Audible SNR
increasing
technical SNR
increasing
 8 kHz  4 dB -20 dB
12 kHz  3 dB -20 dB
16 kHz  3 dB -17 dB
22 kHz  4 dB -10 dB
32 kHz  5 dB  -8 dB
44 kHz 15 dB -29 dB
48 kHz 18 dB -29 dB
56 kHz 23 dB -27 dB
64 kHz 27 dB -25 dB
72 kHz 30 dB -23 dB
96 kHz 36 dB -20 dB


[noise shape]

You can see below 15 kHz the noise is reduced, above 15 kHz noise is enlarged. In the range from 2 . . . 5 kHz the noise is minimal. Using 96 kHz makes this much more interesting, because you have the range from 22 . . . 48 kHz for additional noise, so you can reach a SNR of about 135 dB with 16 bit audio or 98 dB with 10 bit audio:

[noise shape 96 kHz]

How does the signal look like?

[noise shape signal]



Noise shaping and Dithering

Noise shaped signals have also the problem of noise-signal correlation like non-dithered quantization. But you can still combine noise shaping and dithering. So you have the advantage of noise shaping with the properties of dithering: a constant, but 3 . . . 4 dB increased noise.

[dither+noise shape]



Listening Examples:

The first example is the very beginning of Máire Brennan's »Na Paisti« taken from the album »Perfect Time«.

The song begins with a deep growing louder tone which is known as a source of ugly quantization noise.

You can download the quantized WAVE files (10 bit, 48 or 96 kHz) or the result back converted to 44.1 kHz and encoded with MPEGplus.






Last modified:  2001-04-28                                Visitors:  ???
[eMail]      [Addr]