![]() | All Advertisers |
| Member Services Directory | Classifieds | Reviews | Jobs | Deal Zone | Merchandise | Marketplace | Facebook App | Books, DVDs & Gadgets | Video Vault | Tips & Techniques |
| |||||||
New Reply | Thread Tools | Search this Thread |
| | #61 | ||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
Quote:
hi, i think it is a mistake to try to generalize about hearing. i am aware of nothing saying that the ear is incapable of perceiving minute variations, or that it simply "averages things". as david collins noted, different people are going to have different levels of ability to discriminate. one thing to take into account is the fact that, if we simply stop giving people an oppertunity to discriminate and function quickly, then people may even tend to lose those abilities [i.e "if you don't use it, you lose it"]. with all the compressed formats and so forth there does seem to be a willingness to sacrifice any semblence of quality for portability and convenience. i think that may be a disservice to the public sometimes. here is a link: Sensitivity of Human Ear it seems to me that david collins was saying that the dac tries to "keep up", and that was my understanding. i don't think dacs are routinely designed to deliberately average groups of sample for output, but perhaps there is somethiing i am unaware of in that regard. i would agree that there are plenty of places in the audio chain where things do not work as well as they theoretically could. but one cannot use deficiencies in one area to argue that it is useless to try and improve other areas. i am not sure what you [lupo] are talking about when you talk about "response time" being limited by some equipment's bandwidth limitations in the frequency domain. i don't see that as relevant. amplitude and frequency have to be considered separately in this context. we are talking about "averaging of amplitude quantization over a period of time by means of dithering" versus "pure analog resolution in the first impression, without quantization or averaging". i think dither is useful in alleviating quantization distortion. i do not think it is necessary, or even truthful, to say that it results in the exact same thing as unquantized analog audio. it is a process, and it does what it does. it is generally viewed as an improvement over the alternative. for some reason there seems to be an emotional push to convince people that dither results in perfection. it can be easier to make use of digital audio once one is o.k. with the fact that it's implementation is not so perfect. i think it is better to avoid word length reduction altogether, to whatever extent it can be avoided, and i think we would be better off if cds had a higher bit depth. i would rather see a push toward that goal. right. | ||
| | |
| | #62 | |
| Lives for gear Joined: Aug 2003 Location: Hollywood CA
Posts: 2,625
Verified Member | Quote:
DC | |
| | |
| | #63 | |
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
i think you know that it is not "infinite" in an unqualified sense. dither requires a length of time and a number of samples to achieve a statistical effect. there is a reluctance to include that fact in the discussion. there is also ongoing controversy about when, how much, and how to to use it, as well as the obvious limitations and the fact that it is not necessarily going to be perfectly implemented. the nature of the signal to be dithered also has some impact on its efficacy. i'm not sure what you mean about "liking" it. i have no problem making use of it. i guess i would "like" it better if dither were not needed. it just seems that people [not just you] are not satisfied with it as it really is, so they want to paint is as something better. i don't really see any benefit it that. it doesn't make it work any better. nor does it improve the signal to noise ratio. currently, a higher noise floor is one thing "not to like" [and i obviously understand that quantization distortion generally sounds worse than dither]. i still say we're better off pushing for a higher bit depth delivery medium. [edit] and before someone gets all twisted for no reason, none of this should be taken the wrong way by anyone. this is just a discussion, and i'm just stating a viewpoint. i think we should at least be able to get a 24 bit delivery medium going at this point. what's the ***king holdup? [edit]right. | |
| | |
| | #64 | ||||||
| Lives for gear Joined: Aug 2005 Location: Norway
Posts: 1,741
Verified Member |
Hello! Possibly. Don't tell those who spend their lifes in the field of psychoacoustics! :D Quote:
An example of this is the difference between a single and repetetive pulse. The lone impulse will have the full frequency range of the system, its impulse response. If the impulse is repeated, it'll be periodic waveform with a fundamental and harmonics, a low duty cycle square wave. The discrete impulses is lumped together as a single piece of information. The earbrain likes to do that. Quote:
Output filtering is doing averaging across samples. This is a single sample: Upper row shows connect-the-dots sample view. Lower row is the same lone sample, low pass filtered. Notice the pre- and postringing of the sharp lowpass linear phase filtering. The ringing extends a couple of dozens of samples before and after the sample point. The final output waveform is the superposition of a bunch of such filtered impulses that all stretch beyond each sample point. A bit more illustrations and talk about this can be found here and a longer better explanation is here. Quote:
Quote:
So this may very well be a another case of us talking about the same thing from diffferent angles! Aint it typical? ![]() Quote:
Quote:
Andreas | ||||||
| | |
| | #65 |
| Lives for gear Joined: Aug 2005 Location: Norway
Posts: 1,741
Verified Member |
PS, here's a picture of 24 bit, dithered and truncated waveforms. The upper rows shows about 40 milliseconds, the lower rows shows ~3 milliseconds. The yellow line stays on the same sample point in both views. Notice that this waveform display (in Izotope RX) show intersample peak information. |
| | |
| | #66 | |||
| Lives for gear Joined: Aug 2003 Location: Hollywood CA
Posts: 2,625
Verified Member | Quote:
Have you read Vanderkooy and Lipshitz AES paper? Quote:
Quote:
DC | |||
| | |
| | #67 |
| Gear maniac | Yes, I guess to a deaf person, there is no perception, so there is no concept of loudness.
__________________ Kawika Heftel Utah Recording Studio "Life without music is a journey through the desert." |
| | |
| | #68 | |||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
yes, i have read the aes paper, a long time ago. i believe it is currently in storage. if there is something in it that you could quote and cite to, that would be helpful. you do not appear to have any question as to how to implement it, but you haven't really disclosed your "formula", so i'm not totally sure what it is. i think i know what you mean, and i'm aware of what i understand to be the "consensus" [flat tpdf at 2x lsb for all intermediate steps, then noise shaped dither, high pass tpdf, or flat tpdf at the very end]. what is your understanding? but there are definitely a whole bunch of alternate theories, and the software usually has a bunch of options for "dither lite", and all kinds of other stuff. there is also quite a bit of support for not using any dither at all for the intermediate 48 bit to 24 bit stuff. lipshitz and vanderkooy seem to be still doing further work and research themselves, no? so i do not think it is fair to say that implementation is entirely a settled matter. Quote:
and, in fact, some people say they are not bothered by quantization distortion. some people talk about it giving an "edge" to the music. i know, but that's what some people say. whatever. i also think it is proper to consider the fact that increased sample rate also reduces quantization noise. 4x sampling apparently yields one additional bit [approx. 6dB]. so dither is not the only way to address the issue. Quote:
following is a quote from someone over on the prosound site, as an example of some varying viewpoints. its not my post. you may find it interesting. i'll emphasize some portion of it that may be interesting to consider. "hi dave; <edit> never mind, bruno said it, i think we are straight on the purpose... maybe there is a case to be made for permitting some quantization errors in the interest of keeping noise levels in check on a 100 track project that is coming out of a daw for further mixing? most daws have a mix bus that is more than 24 bits; we would be re-quantizing 100 times; some errors could become amplified rudely, thus some dithering seems needed. however, i do hear dither noise building, or the effect of multiple dithers seems to affect the perceptions of spacial cues that occur at low levels. try duplicating a float file 100 times, then sum it with one dither to 24 bits. then dither the 100 files each to 24 bit, then sum the same way. i think you will hear a change in the -vibe- which may or may not be desirable. imo, the choice is sometimes subjective. depending on the source(s), and taste. there are "partial" dithers that dither less deeply. (sometimes referred to as "type 2") imo, these sometimes could be best choice. e.g. if distortion at bit 24 would be eliminated from the master when the product is dithered for 16 bit formats. imo, if it seems worth producing a master at 2496, perhaps just for posterity and future generations, then imo the judgement call is important. so that we will be hearing details that are supposed to be in the music (that may have been formerly assumed to be inaudible in lower formats); not "junk" that was formerly flying under the radar, or "unnecessary noise". to return to my former example of a 100 track mix: perhaps an over compressed snare track would not contain even 23 bits of signal data. dither on such track, converted 32->24 would not be effective, therefore not needed. also related imo: i observe that in some cases: some noise shaped dithers have caused more problems for me than they seem to solve. once a client asked me to "turn off the exciter". i've told the story before. i just turned off noise shaping and he was happy. my strong personal opinion is that noise shaping is a big reason why engineers question whether they should dither 32->24; a lot of dithers default to noise shaping, it is another variable; which for me, really screws with the sound. imo, noise shaping obscures our ability to observe effects of dither. so imo, only by listening and judgement could we answer the question: "should i dither -this- 32->24 conversion? and how?" jeff dinces cerberus audio services" in any event, no need to be up in arms about any of this, for me anyway. right. | |||
| | |
| | #69 | ||||||||
| 3 + infractions, forum membership suspended. Joined: Jan 2009
Posts: 780
| Quote:
i don't mind that study, but i sometimes find the push to figure out exactly how to trick people into thinking they are hearing something that they are not really hearing to be a little overdone. Quote:
to the extent it invokes a statistical averaging over time, the addition of dither is probably discerned as smear, at least to some extent [i'm not saying it does not have good effect]. and listening to each sample expanded out to hundreds of samples [and thus "linearized"], is going to be different than simply listening to the source without the "statistical averaging". under your [?] analysis, in one case the ear is averaging the actual amplitudes, in another case the ear is averaging a series of collections of averages, each spanning a longer time than the actual original signal's amplitudes. its "two different things". even if the dithering does have a helpful effect over time, it results in a measurably different thing. Quote:
Quote:
Quote:
\ Quote:
Quote:
that said, i have no big issue with it, i just think it is better to stay objective. there is always interesting information in your posts, by the way. right. Quote:
| ||||||||
| | |
| | #70 | ||||||||
| Lives for gear Joined: Aug 2005 Location: Norway
Posts: 1,741
Verified Member |
Hi! Quote:
Quote:
Here's a cool page with a bunch of audio illusions illustrated with wave file examples: Demonstrations of Auditory Illusions The "gap illusion" thing, amongst others, is highly revelant to the brains ability to connect discrete events as a single one. Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Now the above was just to follow up. What made me post this was that I have indeed been taking your posts about dithering seriously. Have spent some time trying to fault dithering. My presumption is that if there's anything wrong with it, it should be possible to create some artifacts in the process. If that can be done - the artifacts can be found too. What I did was to use extremely low frequencies as test tones and a double highpassed TPDF dither. This dither gives about 100dB noise free range below the LSB, in the bass end. I figured if there was anything wrong to find, using that dither and such frequencies would be a good starting point. After banging my head into the usual FFT limits, the end result was: nothing. Using the most extreme analysis tool I have, the Izotope RX with silly amounts of frequency and time overlaps on the windowing, shows nothing at all. Except for the noise (and potentially FFT artifacts too - watch out!), there's nothing to be found. I figure that if there is anything wrong, it should turn out as a measureable artifact. It didn't. Please have a go yourself! Would be very interesting if you can find anything but signal+noise in a dithered sample train. All the best, Andreas Nordenstam | ||||||||
| | |
| | #71 | |
| Banned Joined: Jun 2008 Location: London
Posts: 1,088
| Quote:
Yep! The massey is a great limiter for PT. May I add, if you can use a LP EQ to hi pass at about 40hz (come on, its myspace and low pass around 12-13khz, it will sound better when encoded with the evil myspace compression + you will more than likely get a couple of extra dbs out of your track | |
| | |
| | #72 | |
| Gear maniac | Quote:
| |
| | |
| | #73 | |
| Lives for gear Joined: Aug 2003 Location: Hollywood CA
Posts: 2,625
Verified Member | Quote:
Are you for testing, or against testing? For the sampling theorm or against it? We never hear an individual sample, the integration is part of everything from the "sigma" part of the delta/sigma converter to the actual physiology of your hearing. It's everywhere and inescapable. That's a good thing as it allows you to hear below the noise floor by letting the noise average "out." I once heard it said the brain is a massively parallel computer, with a clock frequency of 1kHz. DC | |
| | |
| | #74 | ||||||||
| Lives for gear Joined: Aug 2003 Location: Hollywood CA
Posts: 2,625
Verified Member | Quote:
Quote:
Quote:
A sine wave doesn't carry any information anyway. Seen one cycle, you've seen them all. Quote:
Quote:
. It's a form of cosmic consciousness, you know. Quote:
Quote:
Quote:
Hope this helps! DC | ||||||||
| | |
| | #75 | |||||||||
| Lives for gear Joined: Aug 2003 Location: Hollywood CA
Posts: 2,625
Verified Member | Quote:
Quote:
Quote:
What implementation issue do you mean? Quote:
But I still like them. Quote:
Quote:
Quote:
Quote:
Quote:
DC | |||||||||
| | |
| | #76 | ||||||||
| Lives for gear Joined: Aug 2005 Location: Norway
Posts: 1,741
Verified Member |
Hello! Quote:
That's just one example. That's the way a lot of sounds works. They're nothing by themselves, they absolutely need a stretch of time to be what they're intended to be. The very idea of a frequency is dependent on time. Quote:
Quote:
This was your response to the picture of an impulse response of a single sample: Quote:
Quote:
![]() Quick primer of the sampling basic: a single impulse with a perfect brickwall lowpass filter gives the sin(x)/x function, also known as the sinc function. It looks like this: Notice that it crosses the zero line at every Pi, just like the sinewave/circle does. Every sample in a digital system looks like that, with the sample value being the peak of the sinc and the sample clock comming at every Pi. Each sample contributes a certain value at the sample point and zero value at all other surrounding sample points. The land inbetween though, is a far different matter. It's made up by the contribution of many such samples. By summing the value of the leading and trailing edges of many such sincs, the value of the land inbetween the sample dots is restored to what it should be. That was my point back in that other post. Considering the value of the samples themselves only tells the conceptual instantaneous value at the sample clock, it doesn't say much about the real values contained in the reproduced waveform. Therein also lies the answer to this: Quote:
Quote:
I don't think those words have to be typed on a special keyboard to be significant..Quote:
Andreas | ||||||||
| | |
New Reply
Facebook
Twitter
LinkedIn
| Thread Tools | Search this Thread |
| Similar Threads | ||||
| Thread | Thread starter | Forum | Replies | Last Post |
| easy protools question | Guruceta | So much gear, so little time! | 15 | 24th March 2009 06:24 PM |
| Very very easy question (hopefully) | gigamesh | So much gear, so little time! | 2 | 16th July 2007 08:51 AM |
| easy tape transfer question(sound question) | numrologst | So much gear, so little time! | 2 | 13th March 2007 08:22 PM |
| easy volume question | sneezebomb | So much gear, so little time! | 5 | 24th January 2007 08:52 PM |
| Easy question | jeb_milne | Music computers | 11 | 26th October 2004 04:34 AM |
| |