Gearslutz.com
All Advertisers

Go Back   Gearslutz.com > The Forums > Post Production forum!

Tags: , ,

Similar Threads
Thread Thread Starter Forum Replies Last Post
Post your best DRUM SOUND Audio_Vault High end 2 Yesterday 01:02 PM
Live sound gig(warning LONG post) Hope209 The moan zone 3 11th August 2006 08:53 AM
Where should I post for trying to get a sound.. Stoneroses6300 High end 7 30th June 2006 02:44 PM
First post - How does this sound to you? crabtwins Work in progress / advice requested / Show & Tell / Artist showcase 2 3rd May 2006 04:49 AM
Can you please post a sound of your mteks and larvys others welcome spektor High end 6 19th April 2006 12:34 PM

Reply
 
Thread Tools Search this Thread Rating: Thread Rating: 2 votes, 5.00 average. Display Modes
Old 28th January 2007, 10:03 PM   #1
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
Smile Geo's sound post corner

Hi, Just thought I'd add this one for anyone with specific questions I can help with.

I've posted some misc stuff I've written and/or gathered about tech and post... As I trip over stuff I'll post it.

cheers
geo
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 28th January 2007, 10:25 PM   #2
Geert van den Berg
Mac Moderator
 
Geert van den Berg's Avatar
 
Join Date: May 2003
Location: Amsterdam
Posts: 2,408
iTrader: (0)
Then let me help you and stick it to the top... I have read some of your posts over at the DUC and I think this spot is well deserved!
Geert van den Berg is offline   Reply With Quote
Old 30th January 2007, 02:37 AM   #3
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
Thanks!! I appercaite the praise from a pro! Hope to help all....

Since this is GEAR SLUTZ and I'm one. I'll list some of my toys:

Dub Stage:
Euphonix System 5 Hybrid - 2 seat console with 300 Channels for mix.
Neve and Euphonix Converters for Analog and Digital IOs.
FC727 For Protools TDM IO to the System5
SSL converter for Protoold HD IO to System5
Neve Digital IOs for connects to two TC Electronic System 6000's
The 2 6000's have 16 digital IO each and lot's-o-software.
DK600M metering with remote screen
DSM7.1 Down mixer
DA98HR for printmastering
Dolby Digital DMU
DTS T2 Tower
DTS Threatre playback system
GENELEC 5.1 system
misc old analog devices like Lexicon verbs, Keypex Gates, Gainbrains Limiters, LA2A comps, etc.
Protools MIX+ with 4 farm cards and bazillions of plugins
Protools HD with 64 IO using SSL converters
Protools LE Digi002 (3)
Protools LE digi003 (1)
VVTR pro on quad intel using AJA Kona Card for HD output video to HD projector and VUtec microperf screen
SONY HD playback deck, Beta-SP, DVCpro50, and others

Editing rooms:
Mostly Protools Mbox, Digi002s etc w/dvtoolkit running 7.x Protools on MAC quad intels with Genelec monitors.

Foley Pit/ADR
PRotools DIGI003 w/DVtoolkit, Focusrite Mic Pres, Misc gear
Genelec 1031A 5.1 system
DC30+ and multiple screens for monitoring video

Editing Picutre on G5s and Quad intels with FCP 5 HD, Avid Mojo and using PC based SONY XPRI and HDcam for uncompressed Full 1920x1080p HD editing


cheers
geo
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 30th January 2007, 03:19 AM   #4
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
History of audio in film

here's a powerpoint presentation on the history of audio in film I did a year ago or so for a quick presentation...



ok, never mind... for some reason I can't get the .ppt or the zip file to up load. if anyone wants a copy feel free to email me at ghilton@wwaudioinc.com and i'll mail the power point presentation to them.

cheers
geo
Attached Files
File Type: zip A history of film 011506.zip (4.94 MB, 1040 views)
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA

Last edited by Geert van den Berg; 31st January 2007 at 08:52 PM. Reason: added the zip file
georgia is offline   Reply With Quote
Old 3rd February 2007, 03:12 PM   #5
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
Film crew links

some film crew links for people looking for post gigs...

WWW.FILMSTAFF.COM http://www.filmstaff.com/index.asp?camp=101

WWW.MANDY.COM

WWW.CRAIGSLIST.COM

WWW.BLS.GOV/OCO/CG/CGS038.HTM

http://www.panix.com/~jviii/filmlctr/jv3film1.html
http://www.afilm.dk/
www.film-connection.com
http://www.productionhub.com/
http://www.hollywoodnetwork.com/
http://www.usfilmproduction.com/
The Independent Filmmakers Internet Resource Guide
http://www.media-match.com/index.php?affid=9999

http://www.assistantdirectors.com/

http://www.filmmaker.co.za/

http://www.filmcrewonline.com/

http://www.crewnet.com/

http://www.caryn.com/biz/ibuzz/crew-prod.html

http://www.ukscreen.com/crew

http://www.nmfilm.com/locals/faqs/

http://www.eej.com/film_crew.htm

http://www.crewmarket.com/

cheers
geo
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 3rd February 2007, 03:15 PM   #6
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
some thoughts on RAID

A drive array is a collection of hard disk drives that are grouped together. When you talk about Raid, there is often a distinction between physical drives and arrays and logical drives and arrays. Physical arrays can be divided or grouped together to form one or more logical arrays. These logical arrays can be divided into logical drives that the operating system sees. The logical drives are treated like single hard drives and can be partitioned and formatted accordingly. The Raid controller is what manages how the data is stored and accessed across the both the physical and logical arrays. It ensures that the operating system only sees the logical drives and does not need to worry about managing the underlying schema. As far as the system is concerned, it's dealing with regular hard drives. A Raid controller's functions can be implemented in hardware or software. Hardware implementations are better for Raid levels that require large amounts of calculations. The single individual Raid levels don't address every application requirement that exist. So, to get more functionality, someone thought of the idea of combining Raid levels. The main benefit of using multiple Raid levels is the increased performance. Usually combining Raid levels means using a hardware Raid controller. The increased level of complexity of these levels means that software solutions are no practical. Raid 0 has the best performance out of the single levels and it is the one most commonly being
combined. Not all combinations of Raid levels exist. The most common combinations are Raid 0+1 and 1+0. The difference between 0+1 and 1+0 might seem subtle… the difference lies in the amount of fault tolerance. Both these levels require at least 4 hard drives to implement so this can get a bit expensive.. ok lets hit the details of Raid levels…

Raid 0
This is the simplest level of Raid… and it just involves striping. Data redundancy is not even present in this level, so it is not recommended for applications where data is critical. This level offers the highest level of performance out of any single Raid level. At least 2 hard drives are required, preferably identical, and the maximum depends on the Raid controller. None of the space is wasted as long as the hard drives used are identical. it's relatively low cost and high performance gain. This level is good for most people that don't need any data redundancy. It works with SCSI and IDE/ATA implementations. Finally, it's important to note that if any of the hard drives in the array fails, you lose everything.

Raid 1
This level is usually implemented as mirroring. Two identical copies of data are stored on two drives. When one drive fails, the other drive still has the data to keep the system going. Rebuilding a lost drive is very simple since you still have the second copy. This adds data redundancy to the system and provides some safety from failures. Some implementations add an extra Raid controller to increase the fault tolerance even more. It’s ideal for applications that use critical data. Even though the performance benefits are not great, it really helps with preserving data. It is also relative simple and has a low cost of implemention. Most Raid
controllers nowadays implement some form of Raid 1.

Raid 2
This level uses bit level striping with Hamming code ECC. The technique used here is somewhat similar to striping with parity but not really. The data is split at the bit level and spread over a number of data and ECC disks. When data is written to the array, the Hamming codes are calculated and written to the ECC disks. When the data is read from the array, Hamming codes are used to check whether errors have occurred since the data was written to the array. Single bit errors can be detected and corrected immediately. This is the only level that really deviates from traditional Raid ideas. Remember, this level is very complicated and expensive Raid controller hardware is needed.

Raid 3
Raid 3 uses byte level striping with dedicated parity. In other words, data is striped across the array at the byte level with one dedicated parity drive holding the redundancy information. The idea behind this level is that striping the data increasing performance and using dedicated parity takes care of redundancy. 3 hard drives are required. 2 for striping, and 1 as the dedicated parity drive. Although the performance is good, the added parity does slow down writes. The parity information has to be written to the parity drive whenever a write occurs. This increased computation calls for a hardware controller, so software
implementations are not practical. Raid 3 is good for applications that deal with large files since the stripe size is small.

Raid 4
This level is very similar to Raid 3. The only difference is that it uses block level striping instead of byte level striping. The advantage in that is that you can change the stripe size to suit application needs. This level is often seen as a mix between Raid 3 and Raid 5, having the dedicated parity of Raid 3 and the block level striping of Raid 5. Again, you'll probably need a hardware Raid controller for this level. Also, the dedicated parity drive continues to slow down performance in this level as well.

Raid 5
Raid 5 uses block level striping and distributed parity. This level tries to remove the bottleneck of the dedicated parity drive. With the use of a distributed parity algorithm, this level writes the data and parity data across all the drives. Basically, the blocks of data are used to create the parity blocks which are then stored across the array. This removes the bottleneck of writing to just one parity drive. However, the parity information still has to be calculated and written whenever a write occurs, so the slowdown involved with that still applies. The fault tolerance is maintained by separating the parity information for a block from the actual data block. This way when one drive goes, all the data on that drive can be rebuilt from the data on the other drives. Recovery is more complicated than usual because of the distributed nature of the parity. Just as in Raid 4, the stripe size can be changed to suit the needs of the application. Also, using a hardware controller is probably the more practical solution. Raid 5 is one of the most popular Raid levels being used today. It appears to be the best combination of performance, redundancy, and storage efficiency.

Raid 0+1
This combination uses Raid 0 for it's high performance and Raid 1 for it's high fault tolerance. Let's say you have 8 hard drives. You can split them into 2 arrays of 4 drives each, and apply Raid 0 to each array. Now you have 2 striped arrays. Then you would apply Raid 1 to the 2 striped arrays and have one array mirrored on the other. If a hard drive in one striped array fails, the entire array is lost. The other striped array is left, but contains no fault tolerance if any of the drives in it fail.

Raid 1+ 0
Raid 1+0 applies Raid 1 first then Raid 0 to the drives. To apply Raid 1, you split the 8 drives into 4 sets of 2 drives each. Now each set is mirrored and has duplicate information. To apply Raid 0, you then stripe across the 4 sets. In essence, you have a striped array across a number of mirrored sets. This combination has better fault tolerance than Raid 0+1. As long as one drive in a mirrored set is active, the array can still function. So theoretically you can have up to half the drives fail before you lose everything, as opposed too nly two drives in Raid 0+1.

In conclusion
Ok now that you know the different Raid levels and configurations, why would you even bother? Well it really all depends on your application and the Raid level you use. However, in general using Raid provides data redundancy, fault tolerance, increased capacity, and increased performance. Data redundancy protects the data from hard drive failures. This benefit is good for companies or individuals that have critical or important data to protect, or just anyone that's paranoid about losing their gigabytes of data. Fault tolerance goes hand in hand with redundancy in providing a better over-all storage system. The only Raid level that does not have any form of redundancy or fault tolerance is Raid 0. Raid also provides
increased capacity by combining multiple drives. The efficiency of how the total drive storage is used depends on the Raid level. Usually, levels involving mirroring need twice as much storage to mirror the data. And lastly, the reason most people go to Raid is for the increase in performance. Depending on the Raid level used, the performance increase is different. For applications that need raw speed, Raid is definitely the way to go.

Here is a simple view of Raid:
Mirroring gives you Redundancy …therefore Data security goes up. Write performance goes down due to duplicated writes ( the amount varies by implementation). and read performance goes up, since there are two spindles with duplicated data that can be accessed by the system. In fact, in some implementations, the data that is closest to the read head of a given spindle is chosen for read making the seek and latency time drop dramatically ( note: again this depends on how your system is implemented and how you configure caching algorithms. The main thing to remember here is that the Raid controller writes the same data blocks to each mirrored drive. Each drive or array has the same information in it To set up mirroring the number of drives will have to be in the power of 2 for obvious reasons. The drawback here is that both drives are tied up during the writing process which limits parallelism and can hurt performance. A good Raid controller will only read from one of the drives since the data on both are the same. While the other is used to read, the free drive can be used for other requests. This increases parallelism, which is pretty much the concept behind the performance increase of Raid.

Stripping
Spreading that single file across a bunch-o-drives. Security of data drops ( more spindles & drive mechanics to break) but this gives you almost unlimited size of a “single” logical disk. Add two 60 gig disks get 1 120 gig disk. . Striping improves the performance of the array by distributing the data across all the drives. The main principle behind striping is parallelism. Imagine you have a large file on a single hard drive. If you want to read the file, you have to wait for the hard drive to read the file from beginning to end. Now, if you break the file up into multiple pieces and distribute it across multiple hard drives, you have all these drives reading a part of the file at the same time. You only have to wait as long as it takes to read each piece since the drives are working in parallel. The same is true if you were writing a large file to a disk. Transfer performance is greatly increased. The more hard drives you have, the greater the increase in performance. The stripe size is a largely debated topic. There is no ideal stripe size but certain sizes work best with certain applications. The performance effects of increasing or decreasing stripe size are apparent. Using a small stripe size will enable files to be broken up more and distributed across the drives. The transfe performance will increase due to the increased parallelism. However, this also increases the randomness of the position of each piece of the file. As you probably guessed already, using a large stripe size will do the opposite of decreasing the size. The data will be less distributed and transfer performance is decreased. The randomness is decreased as well. The best way to find out the right stripe size for your particular application is to experiment. Start out with a medium stripe size and try decreasing or increasing the siz and recording the difference in over-all performance. Remember, if you want to move or transfer a file somewhere, the controller accesses both drives simultaneously, which is where the performance gain kick in. It only takes half the time to transfer the file. If you increase the number of hard drives, the file will be transferred in 1/Nth the time it takes to transfer from 1 hard drive .

Mirroring and stripping
Add them both together data redundancy is up, security of data is better, read performance goes up, much faster ( depending on configuration again), write performance suffers depending on implementation

Sorry for being so long winded…. It just seemed that there is some confusion with regard to Raid capabilities and benefits.
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 5th February 2007, 01:54 PM   #7
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
Some Dolby info regarding metadata

Since metadata like Dialnorm is becoming more and more important in post, here's some data from Dolby to spark some thinking...


Dolby ®
Metadata Guide

Dolby, Pro Logic, and the double-D symbol are registered trademarks of Dolby Laboratories.
Surround EX is a trademark of Dolby Laboratories. Issue 3
© 2005 Dolby Laboratories, Inc. All rights reserved. S05/14660/16797
Dolby Laboratories, Inc.

A Guide to Dolby Metadata

Metadata provides unprecedented capability for content producers to deliver the highest quality audio to consumers in a range of listening environments. It also provides choices that allow consumers to adjust their settings to best suit their listening environments. In this document, we first discuss the concept of metadata:
• Metadata overview
We then discuss the three factors controlled by metadata that most directly affect the consumer’s experience:
• Dialogue level
• Dynamic range control (DRC)
• Downmixing
Finally, we define each of the adjustable parameters, and provide sample combinations:
• Individual parameters
• Metadata combinations
1 Metadata Overview
Dolby® Digital and Dolby E are both data-rate reduction technologies that use metadata. Metadata is carried in the Dolby Digital or Dolby E bitstream, describing the encoded audio and conveying information that precisely controls downstream encoders and decoders. In normal operation, the encoded audio and metadata are carried together as a data stream on two regular digital audio channels (AES3, AES/EBU, or S/PDIF). Metadata can also be carried as a serial data stream between Dolby E and/or Dolby Digital equipment. Metadata allows content providers unprecedented control over how original program material is reproduced in the home. Dolby Digital is a transmission bitstream (sometimes called an emission bitstream) intendedfor delivery to the consumer at home through a medium such as DTV or
by one metadata stream. The consumer’s Dolby Digital decoder reproduces the program audio according to the metadata parameters set by the program creator, and according to settings for speaker configuration, bass management, and dynamic range that are chosen by the consumer to match his specific home theater equipment and environmental conditions.

Dolby E is a distribution bitstream capable of carrying up to eight channels of encoded audio and metadata. The number of programs ranges from one single program (Program Config: 5.1) to eight individual programs on a single Dolby E stream (Program Config: 8 × 1). Each program is discrete, with its own metadata in the Dolby E stream. Some metadata parameters in a Dolby E stream automatically configure a Dolby Digital encoder at the point of transmission, while others affect only the consumer’s Dolby Digital decoder operation. Dolby E is a professional technology used for broadcast applications, such as program origination and distribution; the Dolby E bitstream carries the entire metadata parameter set. Dolby Digital, used for consumer applications, such as transmission to the home or for DVD authoring, employs a subset of the full metadata parameter set called Dolby Digital metadata; the Dolby Digital bitstream carries only
those parameters necessary for proper decoding by the consumer. Metadata is first inserted during program creation or mastering, and is carried through transmission in a broadcast application or directly onto a DVD. The metadata provides control over how the encoded bitstream is treated at each step on the way to the consumer’s decoder.

Here’s an example of how it works:
In a broadcast truck parked outside a football stadium, the program mixer chooses the appropriate metadata for the audio program being created. The resulting audio program, together with metadata, is encoded as Dolby E and sent to the television station via fiber, microwave, or other transmission link. At the receiving end of this transmission, the Dolby E stream is decoded back to baseband audio and metadata.
The audio program and the metadata are monitored, altered, or re-created as other elements of thprogram are added in preparation for broadcast. This new audio program/metadata pair, reencoded as Dolby E, leaves the postproduction studio and passes through the television station to Master Control, where many incoming Dolby E streams are once again decoded back to their individual baseband digital audio/metadata programs. The audio program/metadata pair that is selected to air is sent to the transmission Dolby Digital encoder, which encodes the incoming audio program according to the metadata stream associated with it, thereby simplifying the transmission process. Finally, the Dolby Digital signal is decoded in the consumer’s home, with metadata providing the information for that decoding process. Through the use of metadata, the mixer in the truck has been able to control the home decoder for the sporting event, while segments such as news breaks, commercials, and station IDs are similarly decoded, each using metadata carried within each individual segment.

This control, however, requires the producer to set the metadata parameters correctly, since they affect important aspects of the audio—and can seriously compromise the final product if set improperly. Although most metadata parameters are transparent to consumers, certain parameters affect the output of a home decoder, such as downmixing for a specific speaker configuration, or when the consumer chooses
Dynamic Range Control to avoid disturbing family and neighbors.

TheDolbyEbitstream containsboththe5.1-and two-channel programs’ encodedaudio,andeach program'smetadata. TheDolbyDigital bitstream containsasingleprogram’s encodedaudioand
correspondingmetadata.

Metadata Flow from Production to Consumer
In the simplest terms, there are two functional classifications of metadata: Professional: These parameters are carried only in the Dolby E bitstream. They are used to automatically configure a downstream Dolby Digital encoder, allowing maximum control by the content producer over how the encoded bitstream istreated at each step on the way to the consumer’s decoder. Consumer: These parameters are carried in both the Dolby E and the Dolby Digital bitstream. The consumer’s Dolby Digital decoder uses these parameters to create the best possible audio program possible on each consumer’s playback system. Consumer parameters include the DRC values, which are ultimately enabled by the end user’s selection, as discussed in Section 3, Dynamic Range Control.

Both types of metadata can be examined, modified, or passed through during encoding. A/D Converter Type

Special Parameters
There are other professional parameters included in the Dolby E bitstream that are not under direct user control, such as Timecode and Pitch Shift.

Timecode
Dolby E bitstreams carry timecode information in hours:minutes:seconds:frames format.

Pitch Shift
The Pitch Shift parameter can be generated automatically by a Dolby E decoder to control the Dolby Model 585 Time Scaling Processor. If the input to the Dolby E decoder is not at normal play speed (as with varispeed or program play), then the Pitch Shift Code parameter indicates the amount of audio pitch shifting required to restore the original program pitch.

Dialogue Level
Dialogue Level (also known as dialogue normalization or dialnorm) is perhaps the single most important metadata parameter. The Dialogue Level setting represents the long-term A-weighted average level of dialogue within a presentation, Leq(A). This level can be quantified with the Dolby Model LM100 Broadcast Loudness Meter. When received at the consumer’s Dolby Digital decoder, this parameter setting determines a level shift in the decoder that sets, or normalizes, the average audio output of the decoder to a preset level. This aids in matching audio volume between program sources. In broadcast transmission, the proper setting of Dialogue Level ensures that the consumer receives a standard listening level, so switching channels or watching a television program through the commercial breaks doesn’t require adjusting the volume. Using the same standard for all content, whether conveyed by broadcast television, DVD, or other media, enables the consumer to switch between sources and programs while maintaining a comfortable and consistent listening level. The proper setting of the Dialogue Level parameter also enables the Dynamic Range Control profiles chosen by the content producer to work as intended in less-than- optimal listening environments, and is essential in any content production, whether it is for transmission in a broadcast stream or for direct distribution to consumers, as with DVDs.
Note: Programs without dialogue, such as an all-music program, still require a careful setting of the Dialogue Level parameter. When setting the parameter for such content, it is useful to compare the program to the level of other programs. The goal is to allow the consumer to switch to your program without having to adjust the volume control.

The Scale
The scale used in the Dialogue Level setting ranges in 1 dB steps from –1 to –31 dB. Contrary to what you might assume at first, a setting of –31 represents no level shift in the consumer’s decoder, and –1 represents the maximum level shift. Here’s why: Dolby Digital consumer decoders normalize the average output level—that is, the output level averaged over time using the equivalent loudness method, Leq(A)—
to –31 dBFS (31 dB below 0 dB full-scale digital output) by applying a shift in level based on the Dialogue Level parameter setting. Note: The –31 dBFS Leq(A) should not be confused with the station reference level (often –18 or –20 dBFS). It is common to have different Leq(A) values for program material that has the same reference level. An average loudness level of –31 dBFS Leq(A) is quite compatible with facilities running at a
variety of reference levels. When a decoder receives an input signal with a Dialogue Level setting of –31, it applies no level shift to the signal because this indicates to the decoder that the signal already matches the target level and therefore requires no shift. In contrast, a louder program requires a shift to match the –31 dB standard. When the Dialogue Level parameter setting is –21, the decoder applies a 10 dB level shift to the signal. When the setting is –11, it applies a 20 dB level shift, and so on.
A Simple Rule:
31 + (dialogue level value) = Shift applied
Example:
31 + (–21) = 10 dB

The most important point to remember is that in setting the Dialogue Level parameter, you are providing your listener with an essential service. For your listeners, setting this level properly means:
• The volume level is consistent with other programs.
• The DRC profiles you make available to them work as you intend.
Once dialogue level is set, you can set up DRC profiles to further benefit the consumer.

Dynamic Range Control
Different home listening environments present a wide range of requirements for dynamic range. Rather than simply compressing the audio program at the transmission source to work well in the poorest listening environments, Dolby Digital encoders calculate and send Dynamic Range Control (DRC) metadata with the signal. This metadata can then be applied to the signal by the decoder to reduce the signal’s
dynamic range. Through the proper setting of DRC profiles during the mastering process, the content producer can provide the best possible presentation of program content in virtually any listening environment, regardless of the quality of the equipment, number of channels, or ambient noise level in the consumer’s home.Many Dolby Digital decoders offer the consumer the option of defeating the Dynamic
Range Control metadata, but some do not. Decoders with six discrete channel outputs (full 5.1-channel capability) typically offer this option. Decoders with stereo, mono, or RF-remodulated outputs, such as those found on DVD players and set-top boxes, often do not. In these cases, the decoder automatically applies the most appropriateDRC metadata for the decoder’s operating mode. The Dolby Digital stream carries metadata for the two possible operating modes in the decoder. The operating modes are known as Line mode and RF mode due to the type of output they are typically associated with. Line mode is typically used on decoders with six- or two-channel line-level outputs and RF mode is used on decoders that have an RF-remodulated output. Full-featured decoders allow the consumer to select whether to use DRC and if so, which operating mode to use. The consumer sees options such as Off, Light Compression, and Heavy Compression instead of None, Line mode, and RF mode. Advanced decoders may also allow custom scaling of the DRC metadata. All that needs to be done during metadata authoring, or encoding, is selection of the dynamic range control profiles for Line mode and RF mode. The profiles are described in the following sections.
Note: While the use of DRC modes during decoding is a consumer-selectable feature, the Dialogue Levelparameter setting is not. Therefore, setting the Dialogue Level parameter properly is essential before previewing a DRC profile.

Line Mode
Line mode offers these features:
• Low-level signal boost compression scaling is allowed.
• High-level signal cut compression scaling is allowed when not downmixing.
• The normalized dialogue level is reproduced from the decoder at a constant
loudness level of –31 dBFS Leq(A), assuming the Dialogue Level parameter
is set correctly.
Line-level or power-amplified outputs from two-channel set-top decoders, two- channel digital televisions, 5.1-channel digital televisions, Dolby Digital A/V surround decoders, and outboard Dolby Digital adapters use Line mode.
Consumer control of the dynamic range is limited when downmixing. Products with stereo or mono outputs do not usually allow consumer scaling of Line mode. This is because these devices are usually downmixing (for example, when receiving a 5.1-channel signal). However, in these products, the consumer may have a choice between Line mode and RF mode.

RF Mode
In RF mode, high- and low-level compression scaling is not allowed. When RF mode is active, that compression profile is always fully applied. RF mode is designed for products (such as set-top boxes) that generate a downmixed signal for connection to the RF/antenna input of a television set; however, it is also useful in situations where heavy DRC is required—for example, when small PC speakers are used for DVD playback. In RF mode, the overall program level is raised 11 dB, this results in dialogue being reproduced at a level of –20 dBFS Leq(A), while the peaks are limited to prevent signal overload in the D/A converter. By limiting headroom, severe overmodulation of television receivers is prevented. The 11 dB gain provides an average loudness level that compares well with existing analog television broadcasts. In some situations it may be necessary to further constrain signal peaks above the average dialogue level so that there is less than 20 dB headroom. The selection of a suitable RF mode profile achieves this.

Dynamic Range Control Profiles Six preset DRC profiles are available to content producers: Film Light, Film
Standard, Music Light, Music Standard, Speech, and None.

• Film Light
Max Boost: 6 dB (below –53 dB)
Boost Range: –53 to –41 dB (2:1 ratio)
Null Band Width: 20 dB (–41 to –21 dB)
Early Cut Range: –26 to –11 dB (2:1 ratio)
Cut Range: –11 to +4 dB (20:1 ratio)

• Film Standard
Max Boost: 6 dB (below –43 dB)
Boost Range: –43 to –31 dB (2:1 ratio)
Null Band Width: 5 dB (–31 to –26 dB)
Early Cut Range: –26 to –16 dB (2:1 ratio)
Cut Range: –16 to +4 dB (20:1 ratio)

• Music Light (No early cut range)
Max Boost: 12 dB (below –65 dB)
Boost Range: –65 to –41 dB (2:1 ratio)
Null Band Width: 20 dB (–41 to –21 dB)
Cut Range: –21 to +9 dB (2:1 ratio)
Dolby Laboratories, Inc. Metadata Guide

• Music Standard
Max Boost: 12 dB (below –55 dB)
Boost Range: –55 to –31 dB (2:1 ratio)
Null Band Width: 5 dB (–31 to –26 dB)
Early Cut Range: –26 to –16 dB (2:1 ratio)
Cut Range: –16 to +4 dB (20:1 ratio)

• Speech
Max Boost: 15 dB (below –50 dB)
Boost Range: –50 to –31 dB (5:1 ratio)
Null Band Width: 5 dB (–31 to –26 dB)
Early Cut Range: –26 to –16 dB (2:1 ratio)
Cut Range: –16 to +4 dB (20:1 ratio)

None
No DRC profile selected. The dialogue level parameter (dialnorm) is still applied. These choices are available to the content producer for both Line mode and RF mode. The content producer chooses which of these profiles to assign to each mode; when the consumer or decoder selects a DRC mode, the profile chosen by the producer is applied. In addition to the DRC profile, metadata can limit signal peaks to prevent clipping during downmixing. This metadata, known as overload protection, is inserted by the encoder only if necessary. For example, consider a 5.1-channel program with signals at digital full-scale on all channels being played through a stereo, downmixed line- level output. Without some form of attenuation or limiting, the output signal would obviously clip. Correct setting of the Dialogue Level and DRC profiles normally prevents clipping and unnecessary application of automatic overload protection. Note: DRC profile settingsare dependent on an accurate dialogue level setting. Improper setting of the dialogue level parameter may result in excessive and audible application of overload-protection limiting.

Downmixing
Downmixing is a function of Dolby Digital that allows a multichannel program to be reproduced over fewer speaker channels than the number for which the program is optimally intended. Simply put, downmixing allows consumers to enjoy a DVD or digital television broadcast without requiring a full-blown home theater setup.
As with stereo mixing where the mix is monitored in mono on occasion to maintain compatibility, multichannel audio mixing requires the engineer to reference the mix to fewer speaker channels to ensure compatibility in downmixing situations. In this way, Dolby Digital, using the metadata parameters thatcontrol downmixing, is an “equal opportunity technology,” in that every consumer who receives the
Dolby Digital data stream can enjoy the best audio reproduction possible, regardless of the playback system.
It is important to consider the output signals from each piece of equipment that can receive a Dolby Digital program in the home. Table 2 shows the output types from different equipment.
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 5th February 2007, 07:04 PM   #8
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
more meta data - sorry I was tied up for a few

Set-top boxes, used to receive terrestrial, cable, or satellite digital television, typically offer an analog mono signal modulated on the RF/Antenna output, a line-level analog stereo signal, and an optical or coaxial digital output. DVD players offer an analog stereo and a digital output, and some offer a six-channel analog output (for a 5.1- channel presentation). Portable DVD players offer analog stereo, headphone, and
digital outputs. DVD players in computers and game consoles offer a digital output as well as analog stereo, headphone, and possibly six-channel analog outputs. 5.1- channel amplifiers, decoders, and receivers have six-channel analog outputs and possibly six speaker-level outputs.

In all of these cases, a Dolby Digital decoder creates the analog audio output signal. In the case of the set-top box or DVD player, the analog stereo output is a downmixed version of the Dolby Digital data stream. The digital output delivers the Dolby Digital data stream to either a downstream decoder or an integrated amplifier with Dolby Digital decoding.

In each of these devices, the analog stereo output is one of two different stereo downmixes. One type is a stereo-compatible Dolby Surround downmix, of the multichannel source program that is suitable for Dolby Surround Pro Logic® decoding. This kind of downmix is also called Pro Logic or Left total/Right total
(Lt/Rt). The other type is a simple stereo representation (called Left only/Right only, or Lo/Ro) suitable for playback on a stereo hi-fi or on headphones, and from which a mono signal is derived for use on an RF/Antenna output. The difference between the downmixes is how the Surround channels are handled. The Lt/Rt downmix sums the Surround channels and adds them, in-phase to the Left channel and out-of-phase to
the Right channel. This allows a Dolby Surround Pro Logic decoder to reconstruct the L/C/R/S channels for a Pro Logic home theater. The Lo/Ro downmix adds the Left and Right Surround channels discretely to the Left and Right speaker channels, respectively. This preserves the stereo separation for stereo-only monitoring and produces a mono-compatible signal. In all downmixes, the LFE channel is not
included.
On most home equipment, the consumer can use the product’s user interface to choose the appropriate stereo output for his playback system. The mono signal feeding the RF/Antenna output is usually derived from the Lo/Ro downmix. There are separate metadata parameters that govern the Lo/Ro and Lt/Rt downmixes. Certain metadata parameters allow the engineer to select how the stereo downmix is constructed and which stereo analog signal is preferred, but Lt/Rt is the default selection in all consumer decoders. See Section 5, Parameter Definitions, for more information on individual parameters.
During downmixing, as we have seen, the adjustment of Dynamic Range Control parameters is limited. Broadly speaking, the stereo outputs use the Line mode compression profile while the mono signal uses RF mode compression. As with dynamic range control, downmixing is ultimately dependent upon each consumer’s unique listening environment.
While the engineer must optimize the multichannel mix for reproduction in an ideal listening environment, it is also important to preview the mix in downmixing conditions to ensure compatibility with different playback systems when selecting the downmixing metadata parameters. These previews can be achieved in real time using the DP570 Multichannel Audio Tool.

5 Parameter Definitions
This section explains both professional and consumer metadata parameters in greater detail.
Metadata parameters include:
• Universal parameters
• Extended Bitstream Information (Extended BSI) parameters
Extended BSI parameters are active only when both the producer chooses to use them and the consumer’s decoder is capable of reading them. All decoders can successfully decode a metadata stream without Extended BSI parameters, and Extended BSI parameters translate seamlessly to decoders that read only universal parameters. Note: Universal parameters include both professional and consumer metadata.

5.1 Universal Parameters
All universal parameters are supported by Dolby E encoders and decoders; all except Program Configuration and Program Description Text are supported by all Dolby Digital encoders and decoders.

Program Configuration
This parameter determines how the audio channels are grouped within a Dolby E bitstream. Up to eight channels can be grouped together in individual programs, where each program contains its own metadata. The default setting is 5.1 + 2. Program Description Text This parameter is a 32-character ASCII text field that allows the metadata author to enter a description of the audio program. For example, this field may contain the name of the program (Movie Channel Promo), a description of the program source (Football Main Feed), or the program language (Danish).

Dialogue Level
The Dialogue Level parameter is discussed in Section 2, Dialogue Level.

Channel Mode
This parameter (also known as Audio Coding mode) indicates the active channels within the encoded bitstream and affects both the encoder and consumer decoder. This parameter instructs the encoder which inputs to use for this particular program; it tells the decoder what channels are present in this program so the decoder can deliver the audio to the correct speakers. The setting is described as X/Y, where X is the number of front channels (Left, Center, Right) and Y the number of rear (Surround) channels. The availability of certain channel modes depends on the Dolby Digital encoder data rate and whether the LFE channel is present. For example, you can’t have a mono stream with an LFE channel (1.1!) or a 3/2 stream at 96 kbps. Appropriate data rates are shown in the definition of each setting.

Note: The presence of the LFE channel is indicated through a different metadata parameter (see LFE Channel).

Channel
Mode
Setting Definition and Data Rate 1+1 Dual mono (not valid for DTV broadcast or DVD production)
1/0 Mono From 56 kbps, usually 96 kbps
2/0 Stereo From 96 kbps, usually 192 kbps
3/0 From 256 kbps
2/1 From 256 kbps
3/1 From 320 kbps
2/2 From 320 kbps
3/2 From 384 kbps, often 448 kbps

LFE Channel
The status of the LFE Channel parameter indicates to a Dolby Digital encoder whether an LFE Channel is present within the bitstream. Channel mode determines whether the LFE Channel parameter can be set. You must have at least three channels to be able to add an LFE channel.
LFE Channel Setting Enabled Disabled

Bitstream Mode
This parameter describes the audio service contained within the Dolby Digital bitstream. A complete audio program may consist of a main audio service (a complete mix of all the program audio), an associated audio service comprising a complete mix, or one main service combined with an associated service. To form a complete audio program, it may be (but rarely is) necessary to decode both a main service and an associated service using a maximum total bit rate of 512 kbps. Refer to the Guide to the Use of the ATSC Digital Television Standard, Document A/54 (see www.atsc.org) for further information. Although a detailed description of each option follows, in practice, most programming uses the default setting, Complete Main. An example of an exception to this rule is a special karaoke DVD, or an emergency service within digital television.


Bitstream Mode Setting Definition
Complete Main
(CM)
CM flags the bitstream as the main audio service for the program and indicates that all elements are present to form a complete audio program. Currently, this is the most common setting. The CM service may contain
from one (mono) to six (5.1) channels. Main M&E (ME) The bitstream is the main audio service for the program, minus a dialogue channel. The dialogue channel, if any, is intended to be carried by an associated dialogue service. Different dialogue services can be associated with a single ME service to support multiple languages.

Assc. Visual Imp.
(VI)
This is typically a single-channel program intended to provide a narrative description of the picture content service. The VI service may also be a complete mix of all program channels, comprising up to six channels. Assc. Hear Imp. (HI) This is typically a single-channel program intended to convey audio that has been processed for increased intelligibility and decoded along with the main audio service. The HI service may also be a complete mix of all program channels, comprising up to six channels.

Assc. Dialogue (D) This is typically a single-channel program intended to provide a dialogue
channel for an ME service. If the ME service contains more than two channels, the D service is limited to only one channel; if the ME service is two channels, the D service can be a stereo pair. The appropriate channels of each service are mixed together (requires special decoders).

Assc. Commentary (C)
This is typically a single-channel program intended to convey additional commentary that can be optionally decoded along with the main audio service. This service differs from a dialogue service because it contains an optional, rather than a required, dialogue channel. The C service may also be a complete mix of all program channels, comprising up to six channels.

Assc. Emergency (E)
This is a single-channel service that is given priority in reproduction. When the E service appears in the bitstream, it is given priority in the decoder and the main service is muted.

Assc. Voice Over
(VO) This is a single-channel service intended to be decoded and mixed to the Center channel (requiresspecial decoders).

Main Sv Karaoke (K)
The bitstream is a special service for karaoke playback. In this case, the Left and Right channels contain music, the Center channel has a guide melody, and the Left and Right Surround channels carry optional backing vocals.
Line Mode Compression Profile
Line mode is discussed in Section 3, Dynamic Range Control.
RF Mode Compression Profile
RF mode is discussed in Section 3, Dynamic Range Control.
Dolby Laboratories, Inc. Metadata Guide

RF Overmodulation Protection
This parameter is designed to protect against overmodulation when a decoded Dolby Digital bitstream is RFmodulated. When enabled, the Dolby Digital encoder includes
pre-emphasis in its calculations for RF Mode compression. The parameter has no effect when decoding using Line mode compression. Except in rare cases, this parameter should be disabled.

RF Overmodulation Protection Setting Enabled Disabled

Center Downmix Level
When the encoded audio has three front channels (L, C, R), but the consumer has only two front speakers (left and right), this parameter indicates the nominal downmix level for the Center channel with respect to the Left and Right channels. Dolby Digital decoders use this parameter during downmixing in Lo/Ro mode when Extended BSI parameters are not active.

Center Downmix Level
Setting Definition
0.707 (–3 dB) default The Center channel is attenuated 3 dB and
sent to the Left and Right channels.
0.596 (–4.5 dB) The Center channel is attenuated 4.5 dB and
sent to the Left and Right channels.
0.500 (–6 dB) The Center channel is attenuated 6 dB and
sent to the Left and Right channels.
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 5th February 2007, 07:09 PM   #9
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
the last of the metadata data... ;)

RF Overmodulation Protection Setting Enabled Disabled

Center Downmix Level
When the encoded audio has three front channels (L, C, R), but the consumer has only two front speakers (left and right), this parameter indicates the nominal downmix level for the Center channel with respect to the Left and Right channels. Dolby Digital decoders use this parameter during downmixing in Lo/Ro mode when Extended BSI parameters are not active.

Center Downmix Level
Setting Definition
0.707 (–3 dB) default The Center channel is attenuated 3 dB and
sent to the Left and Right channels.
0.596 (–4.5 dB) The Center channel is attenuated 4.5 dB and
sent to the Left and Right channels.
0.500 (–6 dB) The Center channel is attenuated 6 dB and
sent to the Left and Right channels.

Surround Downmix Level
When the encoded audio has one or more Surround channels, but the consumer does not have surround speakers, this parameter indicates the nominal downmix level for the Surround channel(s) with respect to the Left and Right front channels. Dolby Digital decoders use this parameter during downmixing in Lo/Ro mode when Extended BSI parameters are not active.

Surround Downmix Level Setting Definition
0.707 (–3 dB) default The Left and Right Surround channels are each attenuated 3 dB and sent to the Left and Right front channels, respectively. 0.5 (–6 dB) Same as above, but the signal is attenuated 6 dB.
0 (–999 dB) The Surround channel(s) are discarded.

Dolby Surround Mode
This parameter indicates to a Dolby Digital decoding product that also contains a Dolby Pro Logic decoder (for example a 5.1-channel amplifier), whether or not the two-channel encoded bitstream contains a Dolby Surround (Lt/Rt) program that requires Pro Logic decoding. Decoders can use this flag to automatically switch on Pro Logic decoding as required.

Dolby Surround Mode
Setting Definition
Not Dolby Surround The bitstream contains information that was not
encoded in Dolby Surround. Dolby Surround The bitstream contains information that was encoded in Dolby Surround. After Dolby Digital decoding, the bitstream is decoded using Pro Logic.
Not Indicated There is no indication either way.

Audio Production Information
This parameter indicates whether the mixing level and room type values are valid. If Yes, then a receiver or amplifier could use these values as described below. If No, then the values in these fields are invalid. In practice, only high-end consumer equipment implements these features.

Audio Production Information Setting Definition Yes Mixing Level and Room Type parameters are valid.
No Mixing Level and Room Type parameters are invalid and should be ignored.

Mixing Level
The Mixing Level parameter describes the peak sound pressure level (SPL) used during the final mixing session at the studio or on the dubbing stage. The parameter allows an amplifier to set its volume control such that the SPL in the replay environment matches that of the mixing room. This control operates in dialogue level control, and is best thought of as the final volume setting on the consumer’s equipment. This value can be determined by measuring the SPL of pink noise at studio reference level and then adding the amount of digital headroom above that level. For example, if 85 dB equates to a reference level of –20 dBFS; the mixing level is 85 + 20, or 105 dB.

Mixing Level Setting
80 to 111 dB in 1 dB increments

Room Type
The Room Type parameter describes the equalization used during the final mixing session at the studio or on the dubbing stage. A Large room is a dubbing stage with the industry standard X-curve equalization; a Small room has flat equalization. This parameter allows an amplifier to be set to the same equalization as that heard in the final mixing environment.

Room Type Setting
Not Indicated
Large
Small

Copyright Bit
This parameter indicates whether the encoded Dolby Digital bitstream is copyright protected. It has no effect on Dolby Digital decoders and its purpose is purely to provide information.
Copyright Bit Setting Yes No

Original Bitstream
This parameter indicates whether the encoded Dolby Digital bitstream is the master version or a copy. It has no effect on Dolby Digital decoders and its purpose is purely to provide information.
Original Bitstream Setting Yes No

Note: The parameters DC Filter, Lowpass Filter, LFE Lowpass Filter, Surround 3 dB Attenuation, and Surround Phase Shift appear after the Extended BSI parameters on Dolby E and Dolby Digital equipment menus.

DC Filter
This parameter determines whether a DC-blocking 3 Hz highpass filter is applied to the main input channels of a Dolby Digital encoder prior to encoding. This parameter is not carried to the consumer decoder. It is used to remove DC offsets in the program audio and would only be switched off in exceptional circumstances.

DC Filter Setting Enabled Disabled

Lowpass Filter
This parameter determines whether a lowpass filter is applied to the main input channels of a Dolby Digital encoder prior to encoding. This filter removes high- frequency signals that are not encoded. At the suitable data rates, this filter operates above 20 kHz. In all cases it prevents aliasing on decoding and is normally switched on. This parameter is not passed to the consumer decoder.

Lowpass Filter Setting Enabled Disabled

LFE Lowpass Filter
This parameter determines whether a 120 Hz eighth-order lowpass filter is applied to the LFE channel input of a Dolby Digital encoder prior to encoding. It is ignored if the LFE channel is disabled. This parameter is not sent to the consumer decoder. The filter removes frequencies above 120 Hz that would cause aliasing when decoded. This filter should only be switched off if the audio to be encoded is known to have no signal above 120 Hz.
LFE Lowpass Filter Setting Enabled Disabled

Surround 3 dB Attenuation
The Surround 3 dB Attenuation parameter determines whether the Surround channel(s) are attenuated 3 dB before encoding. The attenuation actually takes place inside the Dolby Digital encoder. It balances the signal levels between theatrical Dolby Laboratories, Inc. Metadata Guide mixing rooms (dubbing stages) and consumer mixing rooms (DVD or TV studios). Consumer mixing rooms are calibrated so that all five main channels are at the same sound pressure level (SPL). To maintain compatibility with older film formats,
theatrical mixing rooms calibrate the SPL of the Surround channels 3 dB lower than the front channels. The consequence is that signal levels on tape are 3 dB louder. Therefore, to convert from a theatrical calibration to a consumer mix, it is necessary to reduce the Surround levels by 3 dB by enabling this parameter.
Surround 3 dB Attenuation Setting Enabled Disabled

Surround Phase Shift
This parameter causes the Dolby Digital encoder to apply a 90-degree phase shift tothe Surround channels. This allows a Dolby Digital decoder to create an Lt/Rt downmix simply. For most material, the phase shift has a minimal impact when the Dolby Digital program is decoded to 5.1 channels, but it provides an Lt/Rt output that can be decoded with Pro Logic to L, C, R, S, if desired. However, for some phase- critical material (such as music) this phase shift is audible when listening in a 5.1- channel format. Likewise, some material downmixes to a satisfactory Lt/Rt signal without needing this phase shift. It is therefore important to balance the needs of the 5.1 mix and the Lt/Rt downmix for each program. The default setting is Enabled.
Surround Phase Shift Setting Enabled Disabled

5.2 Extended Bitstream Information Parameters
In response to requests from content producers, Dolby Laboratories modified the definitions of several metadata parameters from their original definition as described in ATSC document A/52. The revised definitions allow more information to be carried about the audio program and also allow more choices for stereo downmixing. When the metadata parameters carried in Dolby Digital were first described, they were generically called Bitstream Information, or BSI. We refer to the additional parameter definitions as Extended BSI. Because the revised definitions affect metadata parameters that were not used by the consumer decoders, all decoders will be compatible with the revised bitstream. Newer decoders that are programmed to detect and decode the new parameters will be able to implement the new features Extended BSI provides.

Products that allow emulation of the effects of metadata, such as the DP570, normally have a feature that allows emulation of a new (or compliant) decoder or a legacy decoder.

Preferred Stereo Downmix Mode
This parameter allows the producer to select either the Lt/Rt or the Lo/Ro downmix in a consumer decoder that has stereo outputs. Consumer receivers are able to override this selection, but this parameter provides the opportunity for a 5.1-channel soundtrack to play in Lo/Ro mode without user intervention. This is especially useful on music material.

Preferred Stereo Downmix Mode
Setting
Not Indicated
Lt/Rt Preferred
Lo/Ro Preferred

Lt/Rt Center Downmix Level
This parameter indicates the level shift applied to the Center channel when adding to the left and right outputs as a result of downmixing to an Lt/Rt output. Its operation is similar to the center downmix level in the universal metadata.

Lt/Rt Center Downmix Level Setting
1.414 (+3.0 dB)
1.189 (+1.5 dB)
1.000 (0.0 dB)
0.841 (–1.5 dB)
0.707 (–3.0 dB)
0.595 (–4.5 dB)
0.500 (–6.0 dB)
0.000 (–999 dB)

Dolby Laboratories, Inc. Metadata Guide
23
Lt/Rt Surround Downmix Level
This parameter indicates the level shift applied to the Surround channels when downmixing to an Lt/Rt output. Its operation is similar to the surround downmix level in the universal metadata.

Lt/Rt Surround Downmix Level Setting
0.841 (–1.5 dB)
0.707 (–3.0 dB)
0.595 (–4.5 dB)
0.500 (–6.0 dB)
0.000 (–999 dB)

Lo/Ro Center Downmix Level
This parameter indicates the level shift applied to the Center channel when adding to the left and right outputs as a result of downmixing to an Lo/Ro output. When Extended BSI parameters are active, this parameter replaces the Center Downmix Level parameter in the universal parameters.

Lo/Ro Center Downmix Level Setting
1.414 (+3.0 dB)
1.189 (+1.5 dB)
1.000 (0.0 dB)
0.841 (–1.5 dB)
0.707 (–3.0 dB)
0.595 (–4.5 dB)
0.500 (–6.0 dB)
0.000 (–999 dB)

Lo/Ro Surround Downmix Level
This parameter indicates the level shift applied to the Surround channels when downmixing to an Lo/Ro output. When Extended BSI parameters are active, this parameter replaces the Surround Downmix Level parameter in the universal parameters.

Lo/Ro Surround Downmix Level Setting
0.841 (–1.5 dB)
0.707 (–3.0 dB)
0.595 (–4.5 dB)
0.500 (–6.0 dB)
0.000 (–999 dB)

Surround EX Mode
This parameter is used to identify the encoded audio as material encoded in Surround
EX TM
. This parameter is only used if the encoded audio has two Surround channels. An amplifier or receiver with Dolby Digital Surround EX decoding can use this parameter as a flag to switch the decoding on or off automatically. The behavior is similar to that of the Dolby Surround Mode parameter.
Surround EX Mode
Not Indicated
Not Surround EX
Dolby Surround EX

A/D Converter Type
This parameter allows audio that has passed through a particular A/D conversion stage to be marked as such, so that a decoder may apply the complementary D/A process.
A/D Converter Type Setting Standard HDCD
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 6th February 2007, 04:36 PM   #10
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
some thoughts on Pulldown/Pullups

Hi, some random data on pullups and pulldowns... I don't remember where I found this, but, I found it interesting...

When some engineer had the bright idea of operating the NTSC television system at 59.94 fields per second rather than a nice simple 60, life became more complicated for the sound guys. Telecine is the culprit!! 24 pictures in one second on film do not fit elegantly into 30 available video frames in one second. To complicate things further the video system does not use real seconds. Consequently the film on a Telecine machine is being played at 23.9 frames per second NOT 24, as it was shot. As we discovered when we introduced Time Code to the 1/4" analog tracks, the production mixer had to use 30 FPS time code during recording so that the post house could slow the tape down 0.1% by synchronizing it with a 29.94 time code. That made picture and sound synchronous, if not on speed. Good enough for TV! The term PULL DOWN was born.

Now we have those cool little DAT machines. They operate with rotary scanning heads and internal clock frequencies and other things reminiscent of Video Recorders. Scary `eh what?

Because of the way DAT machines work, they are so stable that no resolving is necessary to achieve synch. That is, the DAT machine will provide a constant speed playback exceeding that of an analog 1/4" tape with crystal sync. Sync requires that the picture also have the same accuracy in speed or that both picture and sound have the same inaccuracy in speed.

In traditional film production the DAT tape needs only to be played back and transferred to mag film without any resolving, providing that the mag film recorder is running from a crystal controlled motor and the camera was, likewise, "Crystal Sync".

Professional DAT machines provide analog and AES/EBU digital outputs. They also usually operate at one of two sampling frequencies 48KHz and 44.1 KHz. Most dialog recording for motion Pictures and Television is done at 48 KHz. The music industry uses 44.1 KHz. to match their CDs.

Which sampling rate to use should be discussed with the production (particularly the post-production) people before beginning the shoot.

In MOST situations the Time Code DAT machine will behave in much the same way as the Time Code 1/4" analog machine did as long as only the analog output is considered. That is, record with 30FPS TC on the set and the Telecine guys will "PULL DOWN" by synchronizing the playback with 29.94XX Time Code. Picture and Sound match...everyone is happy!!

Some engineer will say "We have all this neat digital sound on the tape, why convert it to analog so soon"? He has a point...

Digital audio signals are at a specific sample rate or frequency. If they vary from their specified frequency nothing comes out. Changing the sampling rate from the nominal 48 or 44.1 should only be attempted by professional drivers under controlled conditions. We equipment manufacturers provide adequate horsepower to get you into trouble....you have to know how to stay safe!

The D2 DIGITAL video recorder has a 48KHz. audio track. It must have 48 KHz. or it accepts nothing. If the production mixer records his track with a 48KHz. sampling rate and the Telecine PULLS it DOWN to 47.952 KHz. there is no audio to record onto the videotape. Conventional practice is to convert the digital to analog, transfer the analog to the D2 and then reconvert the analog back to digital. It works.

A more daring solution would be for the production mixer to "PULL UP" on location, and record at 48.048 KHz. (with a 30FPS time code). Then the "PULL DOWN" in telecine would produce a digital audio signal of 48 KHz. making the D2 machine very happy. Remember that this makes the DAT tape totally useless for normal playback.

Many productions are being edited on digital workstations (AVID, LIGHTWORKS etc.) They accept the picture from videotape that is already "PULLED DOWN" by the Telecine process. The sound needs to be "PULLED DOWN" as well to match the picture. These machines are all digital so they are happiest with a digital audio source. A tape recorded at 48 KHz. and then "PULLED DOWN" to 47.952 is useless to them. They have to go through the digital to analog and analog to digital conversion to get the 48 KHz. they need. Had the production mixer "PULLED UP" and recorded at 48.048 the editor could directly down load the digital audio "PULLED DOWN" to 48 KHz. Remember, the DAT cassette will not provide a normal digital signal at nominal speed when recorded "PULLED UP". For a production where the final product is a videotape, this should work. If it is a film production which will be released in film the technique needs more thought...

The picture is "PULLED DOWN" by 0.1% to edit as video. The Key Code cut list for the negative cutters effectively provides the "PULL UP" to bring the film back to 24FPS. If the sound has been recorded with a "PULL UP" to accommodate the editors, it will be impossible to provide a digital signal at nominal speed to lay back to the film. At this point an analog signal will be all that is required and all is well. If, however, the post guys want to stay in digital (I refuse to use the term "DIGITAL DOMAIN") it would have been prudent to have made a nominal speed (48 KHz.) digital dub during the "PULL DOWN" for the editing process.

Music video is another "can of worms", probably on the set as well as on the sound cart. The music production company gives you a DAT cassette for playback with Time Code at 29.97. You know that the film, shot at 24FPS will be "PULLED DOWN" to 23.9 in telecine, but not the sound (it's already made with 29.97 time code) What to do? "PULL UP" the DAT on playback so the sound is 0.1% faster than original. The actors will move 0.1% faster. In telecine the "PULL DOWN" on the film will make the action correct and properly match the sound. The "PULL UP" on playback also makes the 29.97 Time Code into 30 for the electronic slate. Be sure to use a cable or radio link to the electronic slate as "jamming" the generator will not be accurate.

Perhaps it would be worth putting up with the flicker and the eight field color phase of PAL to be able to work at 25FPS for everything.

Maybe simply changing NTSC to a true 60FPS and shooting film at 30FPS with 3 perf pull down would do it ... anyone ever thought of that.....?

Simply put .... film runs at 24 frames per second through a film camera. (That is 90 feet per minute for 35mm) Film runs at 23.976 frames per second through an NTSC Telecine transfer machine. That is 0.1% slow. To make the sound fit the slowed down picture the sound must also be slowed down by 0.1%. Pretty simple eh? It works with no problem in the analog world. The sound is just a bit slow and slightly lower pitched. No big deal. (unless you are a musician with "Golden Ears") However, in the digital world we should keep our sampling rates constant if we intend to use the AES/EBU digital signal for anything practical. That means that the DAT played back at 0.1% slow will have a digital sampling rate also 0.1% slow. Not Good! The simple solution is .... RECORD the material at 0.1% FAST (48.048KHz. instead of 48.00KHz). When the Telecine transfer process slows it down by 0.1% to match the picture the DAT plays back with a 48.00KHz sampling rate... the AES/EBU standard. In fact the cassette recorded at 48.048 will play back at 48.00 on a standard DAT machine ... great for downloading into a Digital Editing system. No bit rate conversion ... higher quality ... simpler.

What about Time Code? If you use 30 Non Drop code at 48.048 it becomes 29.97 Non Drop code when played back at 48.00. If you use 30 Drop Frame code at 48.048 it becomes 29.97 Drop Frame when played at 48.00.

Sync playbacks for Music Videos are basically the opposite, but, usually using 44.1KHz sampling rates. Some people suggest that the music suppliers should produce a special cassette at 44.056 with 29.97 time code. This, played at 44.1 will produce a playback 0.1% fast with 30 fps time code. It will work but a second DAT at 44.1 will be needed for Telecine or other transfers. It is much simpler to have the music suppliers simply supply a standard 44.1KHz 29.97fps (Drop or Non Drop) time code cassette. They need do nothing special or different; then play it back on the set at 44.144KHz. This speeds it up the necessary 0.1% and provides the 30fps time code for the slate. No one, other than the playback mixer has to do anything special and he is, after all, the "specialist" anyway.


cheers
geo
__________________
ms georgia hilton mpse cas
Creative Director
Leviathan Productions
World Wide Audio Inc
NY NY
www.leviathan.us.com


When I am laid in earth, may my wrongs create no trouble in thy breast. Remember me, but ah, forget my fate. - Dido and Aeneas, ARIA
georgia is offline   Reply With Quote
Old 14th February 2007, 12:57 PM   #11
jahtao
Gear maniac
 
jahtao's Avatar
 
Join Date: Dec 2005
Location: London, UK
Posts: 288
iTrader: (0)
Tons of essential info on here. Big up!
jahtao is offline   Reply With Quote
Old 21st February 2007, 01:37 PM   #12
georgia
Gear addict
 
georgia's Avatar
 
Join Date: Dec 2006
Location: NY NY
Posts: 430
iTrader: (0)
some thoughts from a class I teach for beginning indie film makers

Introduction to film sound OUTLINE AND NOTES FOR LECTURE


Film is by nature a visual experience
Even so you cannot afford to underestimate the importance of film sound.
A Meaningful sound track is often as complicated as the image on the screen.

The Sound Mix
Sound Mix is comprised of three areas
• the human voice
• sound effects
• music

These three area (STEMS) must be mixed and balanced so as to produce the necessary emphases which in turn create desired effects and emotions.

Film sound is comprised of conventions and innovations.

We expect an acceleration of music during car chases and creaky doors in horror films. Effects of sound are often subtle and noted only by subconscious. foster an awareness of film sound as well as picture in creating the entire story

The Acoustic World
 sound film reveals our acoustic environment
 the speech of things
 whisperings of nature
 muttering of the sea to the din of a city
 roar of machinery to gentle patter of rain on a windowpane
 The meaning of a floorboard creaking in a deserted room
 a bullet whistling past our ear
 the explosion or the car crash that overwhelms our ears
 the deathwatch beetle ticking in old furniture
 the forest spring tinkling over the stones

IMHO
Sound in film allows the story to have a voice!

Sound & Silence
Preparing for sound design in a film can teach us to analyze day to day noise with our ear and read the sound of life's symphony.
Our ears hear the different voices, and sounds in the general din of day to day and distinguish teach person or thing as a unique character
The director can lead up with both eye and ear to where he/she want us to be both physically and emotionally in a