![]() | All Advertisers |
| Member Services Directory | Classifieds | Reviews | Jobs | Deal Zone | Merchandise | Marketplace | Facebook App | Books, DVDs & Gadgets | Video Vault | Tips & Techniques |
| |||||||
| Tags: advice observations enlightenment, digitalicious, dithering heights, technical techiness |
New Reply | Thread Tools | Search this Thread |
| | #1 | ||
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter |
In the 192kHz thread which most here are probably no longer reading, a side issue arose concerning bit depth. In response to an earlier statement I posted - Quote:
Quote:
However, when recording audio and you want to record at conservative levels to avoid digital overs, of course 24 bit recording has its place, and likewise in subsequent post-processing. But the theory set out above would indicate that a 24 bit original recording peaking to full scale will not generally sound any different from a 16 bit original recording under the same conditions, unless you are able to hear recorded components below around -96dB referenced to full scale. To determine that, generate a 24 bit -96dB sine wave. Mix it with the fortissimo end of a symphony peaking to full scale, and have the sine wave extend beyond the end, into what would otherwise be silence. Replay it at a tolerable level. Can you hear the sine wave (a) during the music or (b) afterwards? If you can, then you should be able to hear the benefit of the added dynamic range provided by a 24 bit recording. If you can't, then it would seem unlikely that you can hear the difference between a 24 bit recording and a 16 it recording, as that difference lies purely in the audio at levels 96dB below full scale. | ||
| | |
| | #2 |
| Gear Head Joined: Jan 2006 Location: Washington, DC
Posts: 70
|
I don't think your understanding of analog to digital conversion is accurate. Additional bits add accuracy to the conversion across the entire dynamic range, it doesn't add the ability to capture increasingly softer sounds, as you imply. My best analogy when I'm teaching this is climbing a mountain. The size of the mountain doesn't change, but the number and height of the steps leading to the top do. With 8 bit converters there are 2^n or 256 steps. 16 bit = 65536 steps. 24 bit = 16777216 steps. To put it another way, if the mountain is 1 Volt "tall", each 8 bit step is 3.91 mV high, whereas each 24 bit step is 59.5 nV high. As a result, when recording a symphony orchestra the recordist can capture the actual loudness of the group with much greater accuracy, whether in loud passages or soft, with a 24 bit converter than with an 8 bit converter. kj Edit: I didn't realize this dead horse was so well beaten. Sorry for creating added noise on this off topic post. |
| | |
| | #3 | ||||||||||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
hi, you apparently don't want to understand, and your analysis is wildly incorrect. in 24 bit audio, the louder components are also resolved with greater accuracy than they are in 16 bit audio. right. | ||||||||||
| | |
| | #4 | |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
| |
| | |
| | #5 |
| Lives for gear |
I definitely don't have the background to join this debate, and I'll probably end up with a bloody nose, but here is my own thought: most gear in the signal chain before the converter has an audible and measurable noise floor - even good mic's add as much as 18dB of noise, preamps add their own EIN, etc. By the time we reach the converter, we have a "significant" amount of electronic noise that would be greater than the very smallest signal we could generate. Add to that the fact that most gear has a maximum of some sort: mic's have a maximum SPL and are also limited by their own sensitivity, ditto for preamps - most can't take more than 24dB in and then have a limitation on their output before clipping. If I were designing a converter, I would know of these limitations and take them into account. It would make sense that I would not waste my increased resolution (i.e. my last 8 bits) in the range where there would be naught but electronic noise - I would spread it across the usable dynamic range of a system. As far as the numbers go, I would think you can have the numbers represent anything you want them to - I could have my 24-bit steps be just as large as my 16-bit steps and so represent a greater dynamic range (but who needs 144 dB of dynamic range?); or I could spread my 24-bit steps across the same dynamic range as my 16-bit steps and so have less quantization error. I would also imagine that it is possible to do something in between. I wish a designer would chime in here. I am far too limited by own ignorance and would like to know how it works in the practical sense.
__________________ "Everybody gets so much information all day long that they lose their common sense." - G. Stein 1946 The reputation of a thousand years may be determined by the conduct of one hour. - Japanese Proverb "Look into his face and hear the music of the ages. Don't pay too much attention to the sounds--for if you do, you may miss the music." - George Ives http://www.andersonsoundrecording.com |
| | |
| | #6 |
| Lives for gear |
This is all really fascinating to me. I don't have the tech background to make any meaningful posts but I am interested to hear from those that do. One thing I did notice last week while recording a vocal recital - the entire dynamic range of the performance was about 60db. That's well within the limits for even 16bit audio. But depending on really who is "correct" here 24bits would still be an improvement, or it might not be anything different than 16bits. |
| | |
| | #7 |
| Lives for gear Joined: May 2005 Location: EU
Posts: 2,431
|
It would be interesting if someone with a solid design background would chime in and explain what the challenges and limitations are in regards to bit depth, impulse response and the like. I would also gladly take any suggestions as to a solid, yet pedagogical textbook that explains this thouroughly. |
| | |
| | #8 |
| Gear Head Joined: Jan 2006 Location: Washington, DC
Posts: 70
|
OK, because pictures are sometimes more useful than explanations, I threw together very quick and dirty unipolar (not peak to peak, but zero to peak) transfer functions for two ideal, linear ADCs, 3-bit and 4-bit, considering only analog input voltage and output digital code, and neglecting all types of error and filtering. Increasing from 3-bit to 4-bit increases resolution at all input voltage levels, not just at lower input voltage levels. kj The basis for the transfer function came from here: http://focus.ti.com/lit/an/sbaa147a/sbaa147a.pdf. FS = Full Scale |
| | |
| | #9 |
| Lives for gear |
Hi Karl Thanks for the picture. This diagram that you have I would understand to be an example of maintaining the same dynamic range, but spreading more bits across it to reduce your quantization error. This would not be the same if we were using our increased values to represent a larger range; i.e. 111 is the top of your 3 bit scale; this could be equal to 0111 on your 4-bit scale, and in this respect would use the addition of the most significant bit increase the scope of the entire scale. This, I think, is the crux of the matter - what are we using these additional bits for - greater scope or less error? |
| | |
| | #10 | |
| Gear Head Joined: Jan 2006 Location: Washington, DC
Posts: 70
| Quote:
On the other hand if by added scope you mean that the overall range stays the same, but that the region from -1V to +1V is broken down into smaller increments than the rest of the range, then you're talking about a non-linear converter, which I would agree is pretty much useless, so it's a good thing that ADCs aren't designed that way. kj | |
| | |
| | #11 |
| Gear addict Joined: Oct 2008 Location: Burbank, CA
Posts: 446
| I don't understand why there is any controversy about this. There should be only one way to do the math, otherwise none of the digital audio equipment we use would work! As far as I understand it, in 16 and 24 bit recording the minimum and maximum possible values represent the same original SPL, although the binary values will be different. In 24 bit recording we simply have more available values to represent the original signal with greater precision. In other words, in 16 bit recording no signal (zero SPL) is represented by 0000 0000 0000 0000. In 24 bit recording no signal is represented by 0000 0000 0000 0000 0000 0000. In 16 bit recording the maximum signal is represented by 1111 1111 1111 1111. In 24 bit the maximum signal is represented by 1111 1111 1111 1111 1111 1111. Both these numbers represent the exact same original maximum SPL, even though one is a larger number. A 24 bit measurement system gives us 16,711,680 more available values than a 16 bit one. Therefore, 24 bit recording is approximately 256 times more precise than 16 bit at all voltages, from no signal to maximum signal. So why this talk about greater dynamic range in 24 bit if the minimum and maximum values remain constant? Simple: in 16 bit the top two available values are 1111 1111 1111 1110 and 1111 1111 1111 1111. In 24 bit, there are approximately 256 more available values between those two steps! The same is true at the bottom end, at the quietest dynamic ranges. |
| | |
| | #12 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
![]() Using the old 1-bit-equals-6-dB rule of thumb, 16 bits can represent 96 dB of dynamic range while 24 bits can represent 144 dB. Modern electronic circuits top out at about 120 dB of dynamic range, so beyond 20 - 21 bits the issue of dynamic range becomes moot and the issue of resolution takes center stage. Given a measuring stick which is 144 cm in length, and the maximum height of the thing to be measured is 120 cm, you'll be able to measure the thing more precisely if your measuring stick is calibrated in tenths of a millimeter than in centimeters. | |
| | |
| | #13 | |
| Gear addict Joined: Oct 2008 Location: Burbank, CA
Posts: 446
|
Yeah, no kidding!!! ![]() Quote:
Looking at the graph we can clearly see that a 3 bit converter would divide the range (0 to 10 volts) into 8 equal sections. A 16 bit converter would divide the same range into 65,536 equal parts, thereby creating a much more accurate representation of the original analog signal. | |
| | |
| | #14 |
| Gear Head |
I have to preface this by saying that I'm still a student and therefore may be far out of my depth. To me, the idea of dynamic range and resolution are intrinsically linked as far as they relate to bit depth in digital audio. A signal at -96dB FS in 16 bit would be represented as 16 zeroes, but so would any incoming signal at a lower level than this. Therefore, since there is no more resolution available below this point, this is the functional "bottom" of the dynamic range of a 16 bit signal. On the other hand, -96dB FS in 24 bit would be represented as 0000 0000 0000 0000 1000 0000 (I think), and therefore there are another 7 bits of resolution below this level, which pushes the dynamic range of this signal that much lower. The only difference between the two is that 16-bit has no way to represent signals that fall between 0dB SPL and -96dB FS, while 24-bit does. If I'm way off with this, please don't hesitate to put me in my place. Edit: just realized that this post is completely redundant, sorry. Last edited by oshearer; 11th April 2009 at 10:01 PM.. Reason: whoops |
| | |
| | #15 | |
| Gear addict Joined: Oct 2008 Location: Burbank, CA
Posts: 446
| Using that logic wouldn't a hypothetical 128 bit ADC be able to reach all the way down to 768 dB below FS? That's some mighty quiet music (if that number even means anything)! ![]() Quote:
| |
| | |
| | #16 |
| Gear Head |
I don't think that shows a flaw in the logic. Sure a 128 bit ADC could reach that far down, but there would be no point in terms of dynamic range, only resolution. Even very good A/D converters only have a dynamic range of about 120 dB, therefore the difference between 20 bit and any greater bit depth is only realized in the practical realm as increased resolution. Going to the theoretical side of things, the bottom limit of human hearing is .00002 pascals. Assuming we had equipment capable of recreating that signal without adding noise (impossible, I know), it would translate to approximately -150dB FS if no amplification were applied after the signal leaves the microphone (assuming a microphone with sensitivity=10mV/Pa and +18dBu=0dB FS). So even though the hypothetical 128 bit ADC would have resolution at that level, that number doesn't actually mean anything in relation to the physical limitations of human hearing. |
| | |
| | #17 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
If you're going to express digital levels in terms of dB below 0 dBFS, which is the proper way to do it, 16 bits can represent -96 dBFS to 0 dBFS; 24 bits can represent -144 dBFS to 0 dBFS. With modern audio circuitry capable of -120 dBFS (noise floor) to 0 dBFS, given 24 bits you've got more dB of range than you actually need -- like using a 12" ruler to measure a 10" thing (we all have a 10" thing, haven't we?). In addition, your 24 bit ruler is calibrated in (metaphorically speaking) 1/64ths of an inch as opposed to 1/16ths of an inch for your 16-bit ruler, which is why we say 24 bits have more resolution. In digital audio you can't have more than 0 dBFS. Full scale is full scale and that's where signals get clipped. | |
| | |
| | #18 |
| Gear Head |
Oh, I know they're different units, I was just using 0dB SPL as a clumsy way of saying "nothing." I would have been better off saying -∞dB FS. By the way, I like that ruler analogy.
|
| | |
| | #19 |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
|
I like the 10" thing analogy |
| | |
| | #20 | |
| Lives for gear Joined: Mar 2004
Posts: 561
| pictures and books Quote:
Kjetil: There are several popular books on digital audio. I've never looked at Ken Pohlman's book, but I have a copy of John Watkinson's The Art of Digital Audio, and it covers a lot of ground. People who want to learn digital audio fundamentals without having to look at equations should consider Digital Audio Explained by Nika Aldrich. There's a lot of DSP in modern audio equipment. Most working audio engineers have very limited DSP background, because the subject is inherently very mathematical. Dip more than your toe in the water, and you can easily drown in Z-transforms, analytic functions, and so forth. To do justice to the subject, one should probably study linear system theory first. But there's a recent textbook that turns this idea on its head, and uses DSP as a starting point. DSP First: A Multimedia Primer I can't say whether the authors (two of whom are very well known) actually succeed in this or not. Be warned that learning this subject is a "project", no matter how good the text. Also, you'll probably need to buy the student edition of MATLAB to work the problems and examples. David L. Rick | |
| | |
| | #21 | |||
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | Quote:
Quote:
Quote:
But note that in neither staircase are the steps evenly spaced. Sound as expressed in dB is logarithmic. As you go down the staircase from the top of the mountain, the height of each step gets smaller and smaller. (If it was a ruler, the graduations get closer and closer together). The two staircases line up perfectly until you get to the bottom of the 16 bit staircase, and then the 24 bit staircase continues towards sea level with a huge number of extra steps, this being necessary because the height of each step continues to reduce. So to get anywhere near the bottom of the mountain, you need an considerable number of extra diminishing steps. We can examine this at a purely practical level too. Say somebody comes to you with a 16 bit recording, and a 24 bit recording, of the same material. Let's say it's a symphonic recording. You are asked to identify which is which. If the 24 bit recording had some magical quality that made it audibly superior at any point, you wouldn't have to go hunting for a very, very quiet passage and listen to it at an unnaturally high level to hear which was the version with the best reproduction of that low level information. But that's the only way you could do it, based on the Journal of the Audio Engineering Society (the study by Meyer and Moran, vol 55 issue 9, Sept 2007). Or is there some other attribute that you could describe that is evidenced by 24 bit audio vs 16? Take another example - if you have a 24 bit recording and truncate it to 16 bits, if throughout the recording the audio is now represented by a 256th of its previous resolution, surely there would be a really dramatic degradation of the audio quality throughout? But that doesn't happen (again I refer back to Meyer and Moran's study). So how can we account for that? By simply understanding that only the component of the audio that is least audible has been changed - the audio at the very lowest levels. And another one - what do you get when you truncate a 24 bit recording to 16 bits and invert it against the original (thus revealing the difference) - almost nothing. Certainly nothing that would support any assertion that above 96dB below full scale the 24 bit recording has a content which differs from the 16 bit version. I could go on but hopefully I've clarified the incorrect assumptions in some preceding posts. | |||
| | |
| | #22 |
| Gear Head |
Thanks for the book suggestions. I'm always trying to find out more about the inner workings of my equipment, both analog and digital, so these should be very helpful.
|
| | |
| | #23 | |
| Lives for gear Joined: Nov 2005 Location: Australia
Posts: 1,323
| Quote:
Let's try this way. When we record a concert, we are recording voltage linearly. We are recording a voltage time waveform which has excursions plus and minus around zero volts. The voltage waveform is what you are looking at in your DAW when editing or mastering. It might help to not get sidetracked on SPL and logarithms and our hearing mechanism and such as this is all irrelevant. Now that voltage time waveform is sampled at Fs, and it is amplitude sliced into even steps of voltage which is some fraction of the full scale voltage which will overload our A/D. The size or fraction of the voltage amplitude steps is full scale volts P-P divided by the resolution afforded by the bit depth. Correct? | |
| | |
| | #24 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
If it takes 2V P-P to light up all the bits of an A-to-D converter, 2V divided by 2^24 means each step represents 0.00000011921 volts. | |
| | |
| | #25 |
| Gear Head |
"Take another example - if you have a 24 bit recording and truncate it to 16 bits, if throughout the recording the audio is now represented by a 256th of its previous resolution, surely there would be a really dramatic degradation of the audio quality throughout? But that doesn't happen (again I refer back to Meyer and Moran's study). So how can we account for that? By simply understanding that only the component of the audio that is least audible has been changed - the audio at the very lowest levels." I may be reading your argument incorrectly, but it seems as though you're stating that there is no increase in resolution at levels above -96dB FS. But if were to start with the highest possible value in a 16 bit system, 0111 1111 1111 1111, to decrease it by the smallest possible amount, we would change the least significant bit, getting 0111 1111 1111 1110. This leaves only one bit of resolution between these two values. If we were to replicate the same two values in a 24 bit system, we would have 0111 1111 1111 1111 0000 0000 and 0111 1111 1111 1110 0000 0000, which gives us a full 8 bits of resolution between the two values. "A signal that's 6 dB down would be 0011 1111 1111 1111" The 16 or 24 bit word does not actually represent a dB value. It is instead an approximation of the instantaneous voltage value of the incoming signal at the time when that particular sample was taken. As we get a longer word, we come closer to approximating that instantaneous value. A -6dB FS sine wave is not represented by a string of the above binary word, but a string of constantly changing binary words that approximate the voltage value at a given point in time. Edit: Haha, I'm too slow a poster, my posts keep being made redundant as I'm writing them. |
| | |
| | #26 | |
| Gear Head | Quote:
Edit: Seems to make more sense that the MSB would toggle between +0.5V and -0.5V, then the next would go either 0.25V above or below the value set by the MSB, and so on down the line. | |
| | |
| | #27 | |||
| Lives for gear Joined: Mar 2004
Posts: 561
| Quote:
Quote:
Quote:
All modern audio converters have (ideal) transfer functions using fixed step size. Let's suppose that a 16-bit converter has a full scale input range of plus or minus one volt. All its steps are the size of the least significant bit: LSB = 2^(-15). For an ideal 24-bit converter with the same analog input range, LSB = 2^(-23). By the way, hearing is not really logarithmic. That's widely believed, but it is wrong. If you want to learn an approximate mathematical rule for loudness perception, it's actually a power law, not a logarithm. Here's how my psychoacoustics professor explained it to me nearly thirty years ago: "Equal ratios in sound intensity yield equal ratios in perceived loudness, but those ratios are not the same.We're all familiar with the rule of thumb that a 10 dB increase sounds about twice as loud. 10 dB is an intensity ratio of 10x, but the corresponding loudness ratio is 2x. David L. Rick | |||
| | |
| | #28 | |
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
1 2 4 8 16 32 64 128 256 512 1024 2048 4096 8192 16384 32768 65536 131072 262144 524288 1048576 2097152 4194304 8388608 This adds up to 16777215. From 0 to 16777215 is 16777216 values or 2^24. | |
| | |
| | #29 | ||
| Lives for gear Joined: Jun 2007 Location: West Hollywood, USA
Posts: 1,492
| Quote:
Quote:
| ||
| | |
| | #30 |
| Lives for gear Joined: Oct 2008
Posts: 624
Thread Starter | ...which in audio translates, in practice, purely to considerations of the dynamic range of the system. I ask again, how do you hear the increased resolution which you are stating exists in a 24 bit system, as compared to a 16 bit system, above approximately 96dB below full scale? In what way is it useful or desirable? How do you account for the 16 bit vs 24 bit inversion phenomenon that I have described?
|
| | |
New Reply
Facebook
Twitter
LinkedIn
| Thread Tools | Search this Thread |
| Similar Threads | ||||
| Thread | Thread starter | Forum | Replies | Last Post |
| audio driver bit depth vs. recording bit depth | musaee | So much gear, so little time! | 2 | 31st December 2009 03:15 AM |
| bit rate vs. bit depth | DNR | So much gear, so little time! | 6 | 21st July 2008 06:07 PM |
| Session Bit Depth | QjeffQ | Post Production forum! | 3 | 13th March 2008 09:59 PM |
| Hardware I/O bit-depth vs. DAW bit-depth | strat65 | So much gear, so little time! | 2 | 30th October 2005 04:27 PM |
| two questionsabout bit depth in PT | thenewyear | So much gear, so little time! | 12 | 15th January 2004 02:24 AM |
| |