![]() | All Advertisers |
| Member Services Directory | Classifieds | Reviews | Jobs | Deal Zone | Merchandise | Marketplace | Facebook App | Books, DVDs & Gadgets | Video Vault | Tips & Techniques |
| |||||||
| Tags: advice observations enlightenment, digitalicious, dithering heights, technical techiness |
New Reply | Thread Tools | Search this Thread |
| | #121 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
| |
| | |
| | #122 | ||
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
Quote:
| ||
| | |
| | #123 |
| Lives for gear Joined: Jul 2006 Location: Germany
Posts: 2,420
| I would say the term does apply WRT to the finer "steps", i.e. changes of amplitude will be sampled/digitized more accurately or more precisely. 24 bit sampling does not provide greater accuracy WRT to the sampled waveform's phase (position), though.
|
| | |
| | #124 | |
| Lives for gear Joined: Mar 2008 Location: Sweden
Posts: 3,960
| Quote:
PCM is working on the basis of binary numbers. A 3bit wordlength can have the following combinations: 000 - 100 - 010 - 110 - 001 - 101 - 011 - 111 and can therefore represent eight different values from zero (000) to seven (111). We can give the first position the value of 1, the second position the value of 2 and the third position the value of 4. The 3bit wordlength for a given voltage representing 0dBFS will give 1/7 of full scale on each step and a dynamic range of 18dB If we increase the wordlength to 4bit we can have the following combinations: 0000 - 1000 - 0100 - 1100 - 0010 - 1010 - 0110- 1110 - 0001 - 1001 - 0101 - 1101 - 0011 - 1011 - 0111 - 1111. The 4bit combinations give us sixteen different values from zero to fifteen. Here we build upon the first 3bit series and the rightmost(?) position have the value 8. For the same voltage as the 3bit example representing 0dBFS we have higher resolution (= lower noise) since each step is 1/15. The 4bit wordlength give us 24dB dynamic range, an increase of 6dB/1bit from the first example with 3bit. Confessiontime: I don't know if I have the details correct here but this is my simplified understanding and I do think it serves the purpose of giving a basic view of the subject. If the signal is static/periodic like music is partly we will have correlation of the quantization steps and that means distortion plus correlated/signal dependent noise. If the signal has non static/periodic content (noise) continously that is higher in level than the quantization steps this will decorrelate the quantization distortion and turn it into noise. In 24bit audio we dont have to add dither since first of all, the quantization errors are so small and also because the analog noise from all earlier stages will serve as dither noise. In 16bit audio the analog noise may be lower than the quantization steps and therefore we may need to add dither noise in order to decorrelate the quantization errors. The general view of those that have a grip on this (the way I understand) is that if the noise from the digital system is inaudible, then that resolution is all you need and that means that for many situations 16bit audio can't be bettered by higher resolution by longer wordlength. /Peter | |
| | |
| | #125 | |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
If one examines the difference file at considerable zoom levels (don't try that using the mp3 version I posted though - the mp3 encoding stuffs the detail), it's what you would expect to see if one follows Nishmaster's explanation - you see the quantisation error signal having exactly the visual character of Nishmaster's very handy illustration where the original audio has samples above 96dB below full scale, and where the audio is entirely below 96dB below full scale, it's the normal-looking waveform representing that audio. And as all values in that difference file have had the first 16 bits set to zero by the process of obtaining the difference by inversion, the reproduced level is bound to peak no higher than 96dB below full scale (or thereabouts). It's also uncontestably true that if you mix (add) that very low level 24-bit difference file to the 16 bit file truncated from the original, you get the original 24 bit recording back, bit for bit. You are simply taking a series of values like 100.24 120.56 099.77 000.33 (pretend that's the original 24 bit file) and splitting them into two sets of numbers 100 120 099 000 (as in the 16 bit truncated file) and 0.24 0.56 0.77 0.33 (as in the 24 bit difference file) and then adding them back together again. Even I can get my head around that bit of maths! | |
| | |
| | #126 | |
| Lives for gear | Quote:
ummm. no. his analogy was FLAWLESS. yours was a bunch of geek talk. there is no need to even continue. ![]() ![]()
__________________ Neither the Beatles, nor Stevie Wonder had Protools.... maybe you just suck. .- overheard in a session | |
| | |
| | #127 |
| Super Moderator Joined: Aug 2002 Location: NYC
Posts: 7,405
|
All due respect for everyone involved with this thread, but... I find this thread fascinating and nauseating at the same time. Is this even possible? I want to stop reading it, but I seem to keep coming back to it every chance I get. And, I'm getting dizzy each and every time I come back for more. Man, I'm totally screwed with this. We may need to shut this sucka down in a while; perhaps we should have a mandatory cut-off if the thread hits post number 437. Or should it be post 504? Come on folks - let us get back to work or play or whatever you do when you're not on this thread. Right or wrong?
__________________ Steve Remote AuraSonicLtd.com the home of ASL Mobile & Location Production Remoteness on the Linkedin Network What about my Facebook Profile? Remoteness on Myspace |
| | |
| | #128 |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter |
I really must do something else... but this is now how I see the whole thing in practical terms (and practical terms are what ultimately matters) - When comparing a 16 bit recording vs a 24 bit recording, the level described in every sample in the 24 bit recording will be slightly more accurate than in the 16 bit recording (unless the sample happens to fall on a 16-bit-expressible value in the 24 bit recording). The significance of the improved degree of accuracy can be considered as, or converted to, or heard as, audio or noise peaking to around -96dB below full scale. If the original audio is at a high level, then the significance (audibility) of something expressable as error converted to audio at that -96dB level may be felt to be in practice insignificant - that's a matter for individuals to decide on. As the original audio approaches -96dB, then the proportion of the error becomes more significant (audible), particularly when replayed at higher than normal levels. When the audio is entirely below -96dB then the 16 bit file is silent and therefore totally in error. Using the truncation and inversion technique I described earlier, you can hear for yourself what the error actually sounds like for any given recording as if it had (by some ideal mechanism) been recorded at 16 bits instead of 24. The result is mostly quantisation noise but in a very good system there will be meaningful audio where passages in the original recording have levels below -96dB. However, in real world recording - particularly on location, getting back to our roots - such levels will rarely be encountered unless the recording level is set deliberately very low. |
| | |
| | #129 | |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
| |
| | |
| | #130 |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter |
But we've not yet started on what the D to A converter actually does with these slightly incorrect samples and what happens if you try to scientifically analyse the actually reproduced sound of the 16 and 24 bit recordings... gulp...
|
| | |
| | #131 | |
| Super Moderator Joined: Aug 2002 Location: NYC
Posts: 7,405
| Quote:
I guess we should keep this battle going until the "cops" show up. Then we can scatter and re-group in another thread for one more blow-out-fest. | |
| | |
| | #132 | |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
hi, well, i have pretty much stayed on the right page. thank you very much for noticing! the increase in bit depth does in fact result in greater accuracy at all levels. the 24 bit system is capable of registering and describing an amplitude value closer to the actual value presented to it than the 16 bit system is. in that sense it is closer to being able to meet the requirements of the nyquist / shannon theorum. as we recall, one of the problems with the nyquist / shannon theorum is that it assumes an "infinite bit quantizer". as stated elsewhere, "The assumptions necessary to prove the theorem form a mathematical model that is only an idealization of any real-world situation. The conclusion that perfect reconstruction is possible is mathematically correct for the model but only an approximation for actual signals and actual sampling techniques." in any event, an increased ability to represent smaller quantization steps necessarily results in greater accuracy throughout the range with respect to amplitudes. the increased accuracy does not manifest itself solely in the area below -96dBfs. this is well-established, and anyone trying [ad nauseum,, in this thread] to say that the increased accuracy in confined to levels below -96dBfs is misleading others, either deliberately or otherwise. there only seems to be one person trying to say that, unless you are also trying to say that. what i am also saying is that this does not really have to be a "math problem". and all the attempted analogies and so forth are confusing everyone even further. and, respectfully, the way you write your posts is unduly difficult to decipher, because you assume that people are going to be able to follow your somewhat arbitrary, unclear manner of presentation. i was unable to locate my secret decoder ring, so i did not actually even respond directly to your post. but to the extent that you may be trying to argue that 24 bit audio does not capture amplitude more accurately than 16 bit audio, i disagree. and i also think that some are kind of trying to turn this into a mathematics forum, rather than simply stating the obvious. i do not mean to complain, but its probably not very helpful to most of the readers to simply show off math skills over and over again, especially when you do not even seem to agree on what terms to use. and bizarre experiments are not going to show anyone anything, particularly where they are undertaken in an attempt to prove someone's point. i'm always interested, but my understanding has always been consistent with the pohlmann book, and this issue is long and very well established. right. | |
| | |
| | #133 |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| hi, with regard to a couple of posts talking about "self-dither", i should mention that the prevailing thinking [for whatever that term may be worth to you] seems to be that relying on "self-dither" is a mistake. circuit noise is not of the same nature as tpdf dither, or any other commonly used type of dither. circuit noise has a significant thermal noise component. apparently for dither to really work well at all, it has to be a specific type of noise. i am just reporting what i have heard from designers about this. i personally am not that crazy about the idea of adding a bunch of noise to signals unless it is really serving some purpose. also, remember that you can generally perceive signal down into an analog noise floor [different than a digital noise floor]. right. |
| | |
| | #134 |
| 3 + infractions, forum membership suspended. Joined: Oct 2008
Posts: 1,978
| http://en.wikipedia.org/wiki/Headroom http://en.wikipedia.org/wiki/Dynamic_range The dynamic range of human hearing is roughly 140 dB. The dynamic range is defined as the ratio of the maximum to minimum amplitude a given device can record and is usually expressed in decibels. The 16-bit Compact Disc has a theoretical dynamic range of 96 dB. 20-bit digital audio is theoretically capable of 120 dB dynamic range; similarly, 24-bit digital audio calculates to 144 dB dynamic range. All digital audio recording and playback chains include input and output converters and associated analog circuitry, significantly limiting practical dynamic range. Observed 16-bit digital audio dynamic range is about 90 dB. Dynamic range in analog audio is the difference between low-level thermal noise in the electronic circuitry and high-level signal saturation resulting in increased distortion and, if pushed higher, clipping. In 1981, researchers at Ampex determined that a dynamic range of 118 dB on a dithered digital audio stream was necessary for subjective noise-free playback of music in quiet listening environments. "dBm" indicates that the reference quantity is one milliwatt, while "dBu" is referenced to 0.775 volts RMS. http://en.wikipedia.org/wiki/Decibel http://en.wikipedia.org/wiki/Signal-to-noise_ratio#Digital_signals When using digital storage the number of bits of each value determines the maximum signal-to-noise ratio. In this case the noise is the error signal caused by the quantization of the signal, taking place in the analog-to-digital conversion. http://en.wikipedia.org/wiki/Analog-to-digital_converter ADC is an electronic device that converts an input analog voltage (or current) to a digital number. The resolution of the converter indicates the number of discrete values it can produce over the range of analog values. An ADC has several sources of errors. Quantization error and (assuming the ADC is intended to be linear) non-linearity is intrinsic to any analog-to-digital conversion. There is also a so-called aperture error which is due to a clock jitter and is revealed when digitizing a time-variant signal (not a constant value). These errors are measured in a unit called the LSB, which is an abbreviation for least significant bit. In the above example of an eight-bit ADC, an error of one LSB is 1/256 of the full signal range, or about 0.4%. ADC resolution in bitinput frequency1 Hz44.1 kHz192 kHz1 MHz10 MHz100 MHz1 GHz81243 µs28.2 ns6.48 ns1.24 ns124 ps12.4 ps1.24 ps10311 µs7.05 ns1.62 ns311 ps31.1 ps3.11 ps0.31 ps1277.7 µs1.76 ns405 ps77.7 ps7.77 ps0.78 ps0.08 ps1419.4 µs441 ps101 ps19.4 ps1.94 ps0.19 ps0.02 ps164.86 µs110 ps25.3 ps4.86 ps0.49 ps0.05 ps–181.21 µs27.5 ps6.32 ps1.21 ps0.12 ps––20304 ns6.88 ps1.58 ps0.16 ps–––2419.0 ns0.43 ps0.10 ps––––3274.1 ps–––––– This table shows, for example, that it is not worth using a precise 24-bit ADC for sound recording if we don't have an ultra low jitter clock. One should consider taking this phenomenon into account before choosing an ADC. Since a practical ADC cannot make an instantaneous conversion, the input value must necessarily be held constant during the time that the converter performs a conversion (called the conversion time). An input circuit called a sample and hold performs this task—in most cases by using a capacitor to store the analog voltage at the input, and using an electronic switch or gate to disconnect the capacitor from the input. Many ADC integrated circuits include the sample and hold subsystem internally. All ADCs work by sampling their input at discrete intervals of time. Their output is therefore an incomplete picture of the behaviour of the input. If the input is known to be changing slowly compared to the sampling rate, then it can be assumed that the value of the signal between two sample instants was somewhere between the two sampled values. If, however, the input signal is changing fast compared to the sample rate, then this assumption is not valid. If the input signal is changing much faster than the sample rate, then this will not be the case, and spurious signals called aliases will be produced at the output of the DAC. The frequency of the aliased signal is the difference between the signal frequency and the sampling rate. For example, a 2 kHz sinewave being sampled at 1.5 kHz would be reconstructed as a 500 Hz sinewave. This problem is called aliasing. To avoid aliasing, the input to an ADC must be low-pass filtered to remove frequencies above half the sampling rate. In A to D converters, performance can usually be improved using dither. This is a very small amount of random noise (white noise) which is added to the input before conversion. Its amplitude is set to be about half of the least significant bit. Its effect is to cause the state of the LSB to randomly oscillate between 0 and 1 in the presence of very low levels of input, rather than sticking at a fixed value. Rather than the signal simply getting cut off altogether at this low level (which is only being quantized to a resolution of 1 bit), it extends the effective range of signals that the A to D converter can convert, at the expense of a slight increase in noise If a signal is sampled at a rate much higher than the Nyquist frequency and then digitally filtered to limit it to the signal bandwidth then there are 3 main advantages:
The current crop of AD converters utilized in music can sample at rates up to 192 kilohertz. Many people[citation needed] in the business consider this an overkill and pure marketing hype, due to the Nyquist-Shannon sampling theorem. high-profile recording studios record in 24-bit/192-176.4 kHz PCM or in DSD formats, and then downsample or decimate the signal for Red-Book CD production. http://en.wikipedia.org/wiki/Nyquist-Shannon_sampling_theorem The theorem states:[1] If a function x(t) contains no frequencies higher than B cps, it is completely determined by giving its ordinates at a series of points spaced 1/(2B) seconds apart.The conclusion that perfect reconstruction is possible is mathematically correct for the model but only an approximation for actual signals and actual sampling techniques. practice, neither of the two statements of the sampling theorem described above can be completely satisfied, and neither can the reconstruction formula be precisely implemented. The reconstruction process that involves scaled and delayed sinc functions can be described as ideal. It cannot be realized in practice since it implies that each sample contributes to the reconstructed signal at almost all time points, requiring summing an infinite number of terms. Instead, some type of approximation of the sinc functions, finite in length, has to be used. The error that corresponds to the sinc-function approximation is referred to as interpolation error. Practical digital-to-analog converters produce neither scaled and delayed sinc functions nor ideal impulses (that if ideally low-pass filtered would yield the original signal), but a sequence of scaled and delayed rectangular pulses. This practical piecewise-constant output can be modeled as a zero-order hold filter driven by the sequence of scaled and delayed dirac impulses referred to in the mathematical basis section below. A shaping filter is sometimes used after the DAC with zero-order hold to make a better overall approximation. http://en.wikipedia.org/wiki/Sampling_frequency sample rate, or sampling frequency defines the number of samples per second (or per other unit) taken from a continuous signal to make a discrete signal. The inverse of the sampling frequency is the sampling period or sampling interval, which is the time between samples.[1] http://en.wikipedia.org/wiki/Clock_rate http://en.wikipedia.org/wiki/Crystal_oscillator 12.288Digital audio systems - DAT, MiniDisc, sound cards; 256 × 48 kHz (28 × 48 kHz). 11.2896Used in compact disc digital audio systems and CDROM drives; allows binary division to 44.1 kHz (256 × 44.1 kHz), 22.05 kHz, and 11.025 kHz http://en.wikipedia.org/wiki/Bit_rate http://en.wikipedia.org/wiki/PCM http://en.wikipedia.org/wiki/Audio_bit_depth For a recording with a 44.1 kHz sampling rate, 2 channels (stereo) and a 16 bit depth: 44100 x 2 x 16 = 1411200 bits per second, or, 1411.2 kbit/s dynamic range in dB is equal to bits * 6.02 + 1.76. 'bit' is the abbreviation for a single 'binary digit', represented by a 0 or a 1. Since a digital 'word' is simply a binary number, the number of bits per word is simply the number of digits that make up the corresponding number. Binary numerics are base-2; thus, each digit can only be a '0' or a '1'. In comparison, traditional decimal numerics are base-10, having digits that can only be 0 through 9. For example, the 16-bit binary number '1001011011001010' is equivalent to the 5-digit decimal number 38602. http://en.wikipedia.org/wiki/Digital_audio In an analogue audio system, sounds begin as physical waveforms in the air, are transformed into an electrical representation of the waveform, via a transducer (for example, a microphone), and are stored or transmitted. To be re-created into sound, the process is reversed its fundamental wave-like characteristics remain unchanged All analogue audio signals are susceptible to noise and distortion, due to the inherent noise present in electronic circuits. All analogue audio signals are susceptible to noise and distortion, due to the inherent noise present in electronic circuits. http://en.wikipedia.org/wiki/Digital-to-analog_converter Maximum sampling frequency: This is a measurement of the maximum speed at which the DACs circuitry can operate and still produce the correct output. it is necessary to use DACs that operate at over 40 kHz. The CD standard samples audio at 44.1 kHz, http://en.wikipedia.org/wiki/Sample_rate_conversion http://en.wikipedia.org/wiki/Oversampling http://www.mother-of-tone.com/creation.htm Altmann Creation ADC is the world's first zero-oversampling high sample-rate audio AD converter. No analog or digital filters. -90 dB background hiss Altmann Creation ADC uses a true 16-bit sampling topology without any filtering and without any digital processing. todays analog to digital converters employ 1-bit or low bit sampling. Todays other AD converters with their low-bit topology perform a time-averaging AD conversion which is subject to mathematical interpretation. A REAL AD conversion, as performed by the Altmann Creation ADC is not subject to interpretation, but it is a real precision measurement of the analog input signal. |
| | |
| | #135 |
| 3 + infractions, forum membership suspended. Joined: Oct 2008
Posts: 1,978
| download this 96kHz test-signal, however it will look distorted when played back on your oversampling DAC. ![]() to record with 44.1kHz sample-rate. The Altmann Creation ADC will record this step-function as it occurs, although it contains higher than Nyquist frequencies. When this recording is played back on a non-overampling DA-Converter, like the Altmann Attraction DAC, it looks like this: ![]() This looks pretty much like the input signal, just a little bit more course, because of the slower 44.1 kHz sample rate (remember, the analog input signal was generated with 96kHz). Now, what happens, if we do the same recording and playback, but use any of today's common ADC and DAC topologies with low-bit resolution and oversampling,as they are employed in every recording studio on this planet ? Well, it then looks like this: ![]() The difference you see, is in the ripple that occurs before and after every step-event. This ripple-distortion is caused by the internal digital filtering Whenever a musical transient is recorded, it will be distorted just like shown in the above measurement. Your ears hear this added 'ripple' and it causes the music to sound unnatural or artificial. the following scopeshot. This was recorded with the Altmann Creation ADC, and played back through a common oversampling DA-Converter: ![]() As you see, the Altmann Creation ADC reduced the ripple-distortion significantly, although the playback is performed by a conventional oversampling DAC. But thers no other A/B audio files to hear/download, same source signal vs. other ADC. strange. The Altmann Attraction DAC Digital to analog conversion is performed by a R2R converter operating on true sample-rate (no oversampling) and true sample-values. This means, that the data on the disc (input data) is converted sample-by-sample with high precision and without altering any data. http://en.wikipedia.org/wiki/Oversampling oversampling is the process of sampling a signal with a sampling frequency significantly higher than twice the bandwidth or highest frequency of the signal being sampled. By increasing the bandwidth of the sampled signal, the anti-aliasing filter has less complexity and can be made less expensively by relaxing the requirements of the filter at the cost of a faster sampler. , oversampling is implemented in order to achieve cheaper higher-resolution A/D and D/A conversion. For instance, to implement a 24-bit converter, it is sufficient to use a 20-bit converter that can run at 256 times the target sampling rate. Averaging a group of 256 consecutive 20-bit samples adds 4 bits to the resolution of the average, producing a single sample with 24-bit resolution. averaging is possible only if the signal contains perfect equally distributed noise (i.e. if the A/D is perfect and the signal's deviation from an A/D result step lies below the threshold, the conversion result will be as inaccurate as if it had been measured by the low-resolution core A/D and the oversampling benefits will not take effect).. a signal with a bandwidth or highest frequency of B = 100 Hz. The sampling theorem states that sampling frequency would have to be greater than 200 Hz. Sampling at 200 Hz would result in β = 1. Sampling at four times that rate (β = 4) would result in a sampling rate of 800 Hz. This gives the anti-aliasing filter a transition band of 600 Hz ( (fs-B) - B (800Hz-100Hz) - 100Hz 600 Hz) instead of 0 Hz if the sampling frequency was virtually 200 Hz. An anti-aliasing filter with a transition band of 600 Hz is much more realizable than that of 0 Hz (which would require a perfect filter). If the sampler went to eight times over then the transition band would increase to 1400 Hz, which means the anti-aliasing filter could be less expensive due to relaxation of the transition band requirements. After being sampled at 800 Hz, the signal (ostensibly with a bandwidth of 400 Hz) could be digitally filtered to have a bandwidth of 100 Hz and then further downsampled to closer to 200 Hz. |
| | |
| | #136 | ||||
| Lives for gear Joined: Jul 2006 Location: Germany
Posts: 2,420
|
Sorry Steve, one last try... Quote:
Please forget about dBs. Think voltages. Look at David's math and also forget about noise floors for now: A standard audio ADC requires 2V peak to peak to light up all the bits. 16 bits = .0305 mV = -96 dBFS = lowest resolvable signal component 24 bits = .00012 mV = -144 dBFS = lowest resolvable signal component A 16 bit converter will divide the range from 0 to 2 V into 65536 steps, i.e. equal steps of 0.000030517578125 V or 0.0305 mV. Any level and any amplitude variation below that (e.g. 0.01 mV) will not be "seen", regardless of the absolute amlitude at any given time. A 24 bit converter divides the same range (0 - 2 V) into 16 million "steps", of 0,00000011920928955078125 V or 0.00012 mV. Any amplitude variation at any absolute position can be increased or decreased by a value as small as 0.00012 mV, e.g. from 2 V to 1,99999988079071044921875 V. This is not limited to the range below -96 dBFS. Quote:
Quote:
Quote:
Maybe, but my point was that in a normal live recording, you're not likely to ever encounter quantization noise (unless you set max. levels at -70 dBFS), even with a 16 bit recording. Ambient noise takes care of that, levels won't likely fall anywhere near even -90... | ||||
| | |
| | #137 | |||||||
| Lives for gear Joined: Mar 2008 Location: Sweden
Posts: 3,960
| Quote:
I think the way you put it is confusing and it would be more correct to talk about faster sampling having higher bandwith which means it has better slewrate. It can track the fast signals better. It does not truly alleviate the ringing (for an identical filter with higher samplerate) but put the ringing higher in frequency. What is your source for improved impulse response (as long as the system has the full audio bandwith) giving an advantage of localization? I think it can be worthwhile to point out that transients (pulsive sounds) end up with right timing even with 44.1kS/s.. in other words the peak of the transient does not have to end up at a point of sampling but it can end up between as well. One needs to understand "the sinc" and the nature of lowpass filters for that. Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
/Peter | |||||||
| | |
| | #138 |
| 3 + infractions, forum membership suspended. Joined: Oct 2008
Posts: 1,978
| Anti-aliasing - Wikipedia, the free encyclopedia anti-aliasing is the technique of minimizing the distortion artifacts known as aliasing when representing a high-resolution signal at a lower resolution. Anti-aliasing means removing signal components that have a higher frequency than is able to be properly resolved by the recording (or sampling) device. This removal is done before (re)sampling at a lower resolution. When sampling is performed without removing this part of the signal, it causes undesirable artifacts such as the black-and-white noise near the top of figure 1-a below. anti-aliasing is often done using an analog anti-aliasing filter to remove the out-of-band component of the input signal prior to sampling with an analog-to-digital converter. Figure 1-a illustrates the visual distortion that occurs when anti-aliasing is not used. Notice that near the top of the image, where the checkerboard is very distant, the image is impossible to recognize, and is not aesthetically appealing. By contrast, figure 1-b is anti-aliased. The checkerboard near the top blends into gray, which is usually the desired effect when the resolution is insufficient to show the detail. Even near the bottom of the image, the edges appear much smoother in the anti-aliased image. Figure 1-c shows another anti-aliasing algorithm, based on the sinc filter, which is considered better than the algorithm used in 1-b. Figure 2 shows magnified portions of Figure 1 for comparison. The left half of the image is taken from Figure 1-a, and the right half of the image is taken from Figure 1-c. Observe that the gray pixels help make 1-c much smoother than 1-a, though they are not very attractive at the scale used in Figure 2. Compare the diamond on the left with the anti-aliased one on the right Enlarged viewFigure 3 Fig 3 shows how anti-aliasing smooths the outline. Text is affected in just the same way. (a) (b) (c)Figure 1 Figure 2. Reconstruction filter - Wikipedia, the free encyclopedia a reconstruction filter (or anti-imaging filter) is used to construct a smooth analogue signal from the output of a digital to analogue converter (DAC) or other sampled data output device. The sampling theorem describes why the input of an ADC requires a low-pass analog electronic filter, called the anti-aliasing filter. For the same reason, the output of a DAC requires a low-pass analog filter, called a reconstruction filter. Ideally, both filters should be brickwall filters, constant phase delay in the pass-band with constant flat frequency response, and zero response from the Nyquist frequency. Practical filters have non-flat frequency or phase response in the pass band and incomplete suppression of the signal elsewhere in theory a DAC gives a series of impulses, in practice, the output of a DAC is more typically a series of stair-steps. The low pass reconstruction filter smooths the stair step (removes the harmonics above the Nyquist limit) to (re)construct the analogue signal corresponding to the digital time sequence. Nyquist–Shannon sampling theorem - Wikipedia, the free encyclopedia http://en.wikipedia.org/wiki/Samplin...nal_processing) Digitizing - Wikipedia, the free encyclopedia Anti-aliasing filter - Wikipedia, the free encyclopedia Metric system - Wikipedia, the free encyclopedia 1microvolt is 1⁄1,000,000 of a of a Volt. 0 dBu = 0.775v thermal noise is less than 1microvolt. 16-bits is 65.536 20-Bits is 1.048.576 24-Bits is 16.777.216 audio bit-depth is limited by thermal noise. but... Sampling Rate is not limited by thermal noise. but by wordclock jiter. 192khz at 24-Bits, needs 0.10ps of maxium jitter. Second - Wikipedia, the free encyclopedia 1 picosecond is 10–12 seconds. = 0,00000000001 converter needs to do oversampling. jitter is limited by crystal quality, accuracy, temperature, and electric ripple & noise... also if its an external clock, also limited by cable quality, purity of the materials, size, shield, etc... why record at 24-Bits if thermal noise limits to 20-Bits? easy... becouse at 24-Bits, still theres more resolution even if thermal noise only allows to see 20-Bits, theres less steps in those same "20-Bits of dynamic range & noise floor" at 24-Bits, theres no need for dither, becouse the LSB is not truncated, etc.. sampling rate is two things at same time, limits the maxium frequency range. "vertical", and horizontal, the maxium of splits/steps/sampling points 1 second of audio can have. if you split 1 second in 1trillion, you get verry accurate picture, but assuming theres no jitter, becouse that is limited by clock jitter. 44.100 is needed for 22050hz audio bandwith, 44.100x16x2 = 1.411.200 data rate, 88.200x16x2 = 2.822.400 data rate, 192.000x24x2 = 9.216.000 data rate. transfer rate is more than double, becouse the Biphase Mark Code. Biphase mark code - Wikipedia, the free encyclopedia Acoustics - Wikipedia, the free encyclopedia |
| | |
| | #139 | |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
Let me repeat - you can take any 24 bit file, however it was created, truncate it to 16 bits, invert that against the original, and you will have the difference between the two, which will consist of the content of the least significant 8 bits of the 24 bit file. Where the stripped-off most significant 16 bits were greater than zero, the remaining 8 bits will sound as noise - not surprisingly as they are not meaningful without the original data contained in the original most significant 16 bits. Where the original most significant 16 bits were zero, then the whole of the original signal is preserved in the remaining 8 bits, and whatever audio is present in those samples can be heard because no change was made by the inversion. When the sub-96dB difference file is mixed with the 16 bit file, you get the original 24 bit file back, bit for bit. If you think this is wrong, don't say so till you've tried it yourself, I would respectfully request. So far the test has been attacked on the basis that there's something wrong with the DAW(s) used, and now that there's something wrong with the file used. There is nothing wrong with the DAW or the file, nor the test. I have not the slightest doubt that the use of 24 bits provides a lower noise floor throughout a recording. What I still doubt is whether the benefit of the added resolution at all levels translates to anything other than that lower noise floor, as some appear to imply (and some assert it directly). If terms such as 'more air' or 'less compressed' or 'wider stereo image' (quoting David from memory, and picking on him because he has made the most specific claims, as I recall it!), or 'greater resolution', are synonyms for 'lower noise floor', then I think it's important to just use that description, otherwise there's a danger that some may think there's some other quality which can be heard and described - or even analysed. | |
| | |
| | #140 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
If you record 24-bit audio with peaks at -48 dBFS and then boost the levels digitally by 48 dB, you essentially have the equivalent of a 16-bit recording made with peaks at 0 dBFS. The benefit of recording at 24 bits and peaks at -48 is that you have 48 dB of headroom as a safeguard against clipping. | |
| | |
| | #141 | |||||||||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
i don't think that we are talking about only slew rate here. and i actually think you may be talking about something quite different than i am, or at least a different aspect. see below. faster sampling involves greater bandwidth, which yields better amplitude impulse response. this is separate than time / frequency domain stuff, although they are also improved. you appear to be referring exclusively to time / frequency domain response, and that is not what i was talking about, nor is it what the graph that was posted is intended to convey. there are many kinds of impulse responses that can be tested for a given system, and i believe that they do not all interrelate the way that time / frequency domain does. you are talking about fourier, maybe? Quote:
lavry does not talk about amplitude response. moreover, to my knowledge, he does not deny that impulse response is superior in faster sampling with regard to amplitude. Quote:
and i believe i understand his point. the fact that not every transient falls exactly on a sample point does not necessarily mean it will not be represented upon reconstructionQuote:
![]() all practical, realizable filters relevant to this discussion ring, as far as i have been advised. Quote:
again, this "two sided relationship" is not what is being discussed by the chart, or me, or others. we are talking about amplitude. we mean amplitude as a distinct construct, seperable from frequency or speed [time]. Quote:
something sampled at 192kHz need only be bandlimited to about 96kHz. you don't need much of a filter, if any, to do that. if you are suggesting that ringing does not exist, then you are the lone ranger on that one. david rick suggested an aes paper [a few pages back, i think] that may have some useful information on this. they have been working on this stuff for a while, i think. dcs used to make converters that have numerous different filters. some were better as far as less ringing, some better as far as other stuff. all of these filters, and indeed any dsp process made a part of conversion requires increased latency. latency sucks, and we are at a point where the only option is going to be faster sampling, because the "audiophile" converters are already not practical for tracking and overdubbing from a latency standpoint. but i digress. Quote:
no one has argued that samplerate converting to a lower sample rate would ever yield an improvement in impulse response. Quote:
Quote:
right. | |||||||||
| | |
| | #142 | |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
Here's some actual sample values obtained when truncating a 24 bit recording to 16 bits. These come from Audition which has the occasionally handy ability to save an audio file to text - 24 bit version - 2601.863 2445.293 2279.117 2103.984 1920.586 1729.641 1531.906 1328.145 1119.176 905.8008 688.8711 469.2383 247.7539 25.30859 -197.2461 -419.0312 -639.1562 -856.7773 -1071.035 -1281.082 -1486.09 -1685.27 -1877.824 -2062.996 -2240.074 -2408.344 -2567.152 -2715.875 -2853.934 -2980.77 16 bit version - 2602 2445 2279 2104 1921 1730 1532 1328 1119 906 689 469 248 25 -197 -419 -639 -857 -1071 -1281 -1486 -1685 -1878 -2063 -2240 -2408 -2567 -2716 -2854 -2981 Looking at these values I'm wondering how one would express the error in the 16 bit file as a percentage of the more correct version in the 24 bit file - or otherwise come up with a meaningful way of saying how much more accurate the 24 bit file actually is. Is it enough to consider, sample by sample, the percentage difference between the two figures, or should one somehow take into account the change from one sample to another - in other words, taking into account the rate of change as well as the error? Or maybe it's not worth thinking about. | |
| | |
| | #143 | ||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
Quote:
hi, d_fu, i am not sure i see what you could find incorrect about ozpeter's statement taken by itself. i think ozpeter may be arguing something slightly different. ozpeter, are you trying to say simply that the magnitude of the increased accuracy in 24 bit recordings represents [or is equal to] the magnitude of the difference in error between a 24 bit recording and a 16 bit recording [same source, obviously]? or are you trying to say that only errors in sounds whose total amplitude [fundamental and harmonic] lie below the -96dBfs point are improved by 24 bit recording? or something else altogether? right. | ||
| | |
| | #144 | |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
hi, space2019 has definitely taken at least 5 of the red ones in order to accomplish such a feat. not quite sure what it may mean, but there sure is a lot of it. right. | |
| | |
| | #145 | |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
you may want to consider, for starters, that the error you are showing would be multiplied countless times, as a practical matter, in most recordings. right. | |
| | |
| | #146 | ||
| Lives for gear Joined: Jul 2006 Location: Germany
Posts: 2,420
| Quote:
Quote:
| ||
| | |
| | #147 |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
|
hi, i want to mention that i don't see analog noise floor of converters as such a deal-breaker as some others seem to feel. you can hear signal mixed in with analog noise. it seems to me that the analog noise floor of converters does not necessarily make resolving low level signals a waste of time, because the signals can still be perceived with and through the analog noise, no? digital quantization noise, on the other hand results from the low level signals not even being quantized at all, at least sometimes [or being quantized completely erroneously]. so there you have distortion without any actual signal at all. right. |
| | |
| | #148 |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| |
| | |
| | #149 | ||
| Lives for gear Joined: Jul 2006 Location: Germany
Posts: 2,420
| Quote:
Quote:
Daniel | ||
| | |
| | #150 | |
| 3 + infractions, forum membership suspended. Joined: Oct 2008
Posts: 1,978
| Quote:
easy to analize, if you cannot hear it, use a software like rightmark RMAA http://audio.rightmar.org 24-Bits are not always 24-Bits, 16-Bits are not always 16-Bits. for example: SoundBlaster Live! 5.1 24-Bits pci, sounds like 10-Bits when i hear the same song, with same amplifier, same loudspeakers, same wordclock, "s/pdif out"... vs. Roland MMP-2 a song recorded with Pacific Microsonics ofcourse. sounds like half the song is missing. also all DAW Softwares sounds diferent. Sonar 32-bits with 32.Bits engine, sounds totally diferent than same Sonar with 64-Bit engine active, but not becouse the Bits, its becouse the diferent algorithms needed to process "mix the audio." for example: Sonar 32-Bits at 32-Bit engine, sounds totally diferent than Cubase/Nuendo at 32-Bits. what sounds best? i hear Sonar 32/64eng. sounds 100% transparent, all others dont. | |
| | |
New Reply
Facebook
Twitter
LinkedIn
| Thread Tools | Search this Thread |
| Similar Threads | ||||
| Thread | Thread starter | Forum | Replies | Last Post |
| audio driver bit depth vs. recording bit depth | musaee | So much gear, so little time! | 2 | 31st December 2009 03:15 AM |
| bit rate vs. bit depth | DNR | So much gear, so little time! | 6 | 21st July 2008 06:07 PM |
| Session Bit Depth | QjeffQ | Post Production forum! | 3 | 13th March 2008 09:59 PM |
| Hardware I/O bit-depth vs. DAW bit-depth | strat65 | So much gear, so little time! | 2 | 30th October 2005 04:27 PM |
| two questionsabout bit depth in PT | thenewyear | So much gear, so little time! | 12 | 15th January 2004 02:24 AM |
| |