Sunday, December 23, 2012

DV vs. Betacam SP: 4:1:1 vs. 4:2:2, Artifacts and Other Controversies

Background: In the late 1990s, I became interested in digital videography and purchased for about $2,500 a Sony DCR-VX1000 Mini-DV camcorder, which was the first video device to offer a FireWire (later IEEE-1394) digital output:


imageI was a consultant to Adaptec Corp. for its AHA-8940, the first commercial consumer-oriented FireWire adapter for Wintel PCs and was a contributor to Microsoft’s PC98 System Design Guide for DV disk recording and editing with Windows 98. I also was a beta tester for early versions of Adobe Premiere for Windows video editing software.

I intended to write a Special Edition Using Windows Desktop Video title for QUE Publishing about professional/consumer (“prosumer”) videography using the Mini-DV (DigitalVideo) tape format and the IEEE-1394 bus, but updating my Special Edition Using Microsoft Access and Microsoft Access In Depth series (now in its 14th edition) consumed my available spare time.

This archive is the second of my three lengthy documents about Sony’s implementation of the IEEE-1394 standard and Mini-DV recording format. IEEE-1394 has been superseded by the lower-cost Universal Serial Bus 2.0 and 3.0. The DV data and Mini-DV recording formats were made obsolete by HDTV broadcasting and consumer camcorders recording to high-capacity flash memory in the MPEG-4 AVC format. Other related articles are:
Copyright Ó 1997 Roger Jennings, all rights reserved. Written May 20, 1997. This document may be copied and/or distributed for non-commercial purposes only.

Sony Corp.'s Betacam SP format is the standard of comparison of video tape recording formats. According to Sony Europe, more than 350,000 Betacam SP devices have been sold world-wide. The majority of broadcast electronic news gathering (ENG) operations currently use Betacam SP camcorders and VTRs. Virtually all broadcast stations require (or at least strongly prefer) Betacam SP source footage. Most clients of professional video production firms specify Betacam SP for industrial shoots and are likely to require videographers to use Sony or Ikegami camcorders. Although the M-II format from Panasonic Broadcast and Digital (formerly Television) Systems Company (PB&DSC) offers about the same performance as Betacam SP, Sony and Betacam SP are untouchable when it comes to brand recognition and status. As a result, all other video recording formats are ranked as "not up to Beta SP," "equal to Betacam SP," or "better than Beta SP." These comparisons, based on the beholders' perception of image quality, are reminiscent of the meaningless "broadcast quality" and "studio quality" bullet points on advertisements for consumer and low-end prosumer video gear.

The advent of the Digital Video (DV) format has ignited a controversy among current and prospective users of DV gear. Initially, arguments appeared regarding the "legality" of broadcasting NTSC DV's 480 instead of 483 active lines of video. Obviously, if broadcasting less than 483 active lines was illegal, all U.S. stations transmitting letterboxed movies would have by now lost their licenses. The subsequent controversy, DV's 4:1:1 vs. ITU-R BT.601-4 (formerly CCIR-601) 4:2:2 sampling, has generated thousands of messages in on-line forums, newsgroups, and listservers. This paper represents an attempt to dispel the rumor and innuendo surrounding the 4:1:1 versus 4:2:2 issue, especially as it relates to the "Is DV better (or worse) than Betacam SP?" controversy, and DV compression artifacts.

Horizontal Resolution and Bandwidth of Analog Recording Formats

Traditionally, resolution of analog video cameras, recording formats, and monitors has been expressed as "TV lines" or "horizontal lines." Originally, TV lines were determined by the ability to clearly distinguish sets of vertical lines in a standard TV test pattern. Subsequently, horizontal resolution became related to the bandwidth of the recording format. The generally-accepted "rule of thumb" is 80 lines per MHz of luminance bandwidth for recording formats. The following table lists the lines of horizontal resolution of various recording formats, based on Sony and other manufacturers' data sheets, with the NTSC broadcast signal added for comparison. Bandwidth is implied from the "rule of thumb."

Horizontal Lines Implied Bandwidth, MHz
VHS and 8-mm VCRs
U-matic (3/4-inch) VCRs
U-matic SP VCRs
NTSC Broadcast Signal
S-VHS and Hi8 VCRs
DVD Video
DV Formats

  • Note: DV formats include consumer DV, DVCPRO (25 Mbps), and DVCAM, all of which use the same video data format for NTSC. Horizontal resolution of professional-grade three-CCD video cameras (often 750 or more) are related to the number of pixels per CCD, the type of CCD (field interline transfer, HyperHAD, etc.), the spatial relationship between the three CCDs, and internal signal processing techniques. Increasing the resolution of the camera section results in improved perceived image quality, regardless of the resolution of the recording format used. Horizontal resolution of monitors is determined by a combination of the aperture mask dot (or grill) pitch and design, as well as the video signal bandwidth. Sony claims its popular PVM-1354Q (0.25-mm pitch) and PVM-1954Q (0.40-mm pitch) monitors deliver 600-line resolution, corresponding to a theoretical luma bandwidth of 7.5 MHz. Another rule of thumb is that your monitor's resolution (or bandwidth) should exceed that of the recording format, preferably by 25 percent or more.

Commencing with introduction of the Betacam format, Sony and its competitors ceased publishing horizontal resolution values for professional and broadcast recording formats. The reason for doing so is clear: Using 80 lines/MHz would have positioned oxide Betacam, with it's 4.0 MHz luminance bandwidth, at 320 lines (inferior to U-matic SP.) Even worse, metal-particle Betacam SP would fall below consumer-grade S-VHS and Hi8 TV-line ratings. Both oxide Betacam and Betacam SP produce image quality superior to S-VHS and Hi8 formats, primarily because of Betacam's separate recording tracks for luminance (Y) and chroma (Cr and Cb), rather than the color-under modulation method of S-VHS and Hi8. Thus the two licensees of Thomson's original Betacam technology, Sony and Philips BTS, and their erstwhile competitors (Panasonic and JVC with M-II) began using bandwidth and signal-to-noise ratio (SNR) specifications for the luma and chroma components, instead of specifying TV-lines. The following table lists the published values for luma and chroma bandwidth, plus SNR (in dB) for several Betacam and related VTRs.

Product Line Luma Bandwidth Chroma Bandwidth Luma SNR Chroma SNR
Sony UVW 30 Hz – 4.0 MHz  +1.0/-4.0 dB 30 Hz – 1.5 MHz  +1.0/-4.0 dB > 49 dB > 52 dB
Sony PVW/PVV 30 Hz – 4.5 MHz  +0.5/-4.0 dB 30 Hz – 1.5 MHz  +0.5/-3.0 dB > 51 dB > 53 dB
BTS SP 2000 30 Hz – 4.5 MHz  +0.5/-4.0 dB 30 Hz – 1.5 MHz  +0.5/-4.0 dB > 51 dB > 53 dB
Panasonic M-II W-Series 30 Hz – 4.5 MHz  +1.0/-4.0 dB 30 Hz – 1.5 MHz  +0.5/-3.0 dB > 49 dB > 52 dB

Video recording bandwidth is determined by the modulation method, tape particle size, record/reproduce head design, internal electronic circuitry, and a variety of other factors. Maximum luma bandwidth is reported by the manufacturers at the point at which the signal falls to 63 percent of its original value (-4.0 dB). The high frequency content of the image, such as very small objects with sharply-defined outlines (leaves of trees in backgrounds, blades of grass, and the like) is attenuated in the recording process. The visible effect of attenuation is a combination of lower brightness levels and blurring of tiny objects. Maximum chroma bandwidth is reported at the 63 percent or 71 percent (-3.0 dB) loss point. The human eye is less sensitive to variations in color than in brightness, so the reduced chroma bandwidth gives acceptable perceived image quality. Betacam SP and Panasonic M-II represent the upper limit of quality for commercial analog video recording technology. These formats sustain several reproduce-record generations without obvious image degradation.

Note: The formula for determining the dB voltage values used in bandwidth and SNR measurements is dB = 20 log (Vout/Vin) and dB = 20 log (Vnoise/Vmax), respectively. This formula differs from the dB values used for audio power ratio specifications, dB = 10 log (Pmeas/Pref).

Signal-to-noise ratio is a small-signal effect; thus SNR is important for low-light scenes shot without camera gain boost. Small differences in SNR dB ratings are deceptive—in linear terms, 51 dB (355:1) is about 1.3 times better than 49 dB (281:1). The luma SNR of professional Hi8 VTRs is > 45 dB (Sony EVO-9720 and EVO-9850) and most pro S-VHS decks claim > 46 dB. Noise levels are cumulative in dubbing, so the better SNR of Betacam SP contributes to its multi-generation capability.

Digital Sampling, Analog Bandwidth, and SNR

In the uncompressed digital video realm, sampling rate is conceptually analogous to bandwidth. The standard ITU-R BT.601-4 (D-1, 4:2:2) luma sampling rate for NTSC and PAL video is 13.5 MHz and chroma sampling occurs at half the luma rate or 6.75 MHz. There are two chroma signals (Cr and Cb), so the resulting digital bitstream is composed of equal quantities of luma and chroma data. The Nyquist theorem states that the sampling frequency must be at least twice the highest frequency component of the sampled analog signal. Thus the theoretical maximum luma bandwidth of D-1 is 6.75 MHz and the chroma bandwidth of the two components is 3.375 MHz each. Low-pass filtering is required to prevent aliasing in the digitized signals, so the resulting bandwidths are less than the Nyquist limit. Digital filters using high-performance DSPs (digital signal processors) can provide an analog-equivalent maximum luma bandwidth of about 6.0 MHz, +0.5/– 3.0 dB and a maximum chroma bandwidth in the range of 3.0 MHz, +0.5/-3.0 dB.

DV samples luma at D-1's 13.5 MHz and chroma at one-half the D-1 rate or 3.375 MHz. DV-NTSC uses 4:1:1 sampling in which the chroma is sampled once for every four horizontal luminance samples. DV-PAL uses 4:2:0 sampling, which uses the average value of the chroma signal between successive horizontal samples and scan lines. According to the Japanese Digital Video Consortium, 4:2:0 sampling gives better perceived image quality with PAL's increased number of vertical scan lines. (Panasonic's DVCPRO components use 4:1:1 sampling for both NTSC and PAL.) The ATSC MPEG-2 formats for DTV, including HDTV, also use 4:2:0 sampling. Figure 1 compares 4:2:2, 4:1:1, and 4:2:0 chroma sampling.


Figure 1. A comparison of chroma sampling in 4:2:2, 4:1:1, and 4:2:0 formats.

DV's reduced chroma sampling rate results in a theoretical chroma bandwidth of 1.688 MHz. With digital filtering, the analog-equivalent chroma bandwidth is about 1.5 MHz, +0.5/-3.0 dB. Currently, all digital formats must be converted to analog NTSC or PAL formats for transmission, display on conventional TV monitors, and/or recording to analog tape. The following table lists published bandwidth specifications for the component (YCrCb) output of high-end DVCPRO and DVCAM DVTRs, both of which use the consumer DV data format.

DVTR Model Luma Bandwidth Chroma Bandwidth Signal-to-Noise Ratio
Sony DSR-85 (DVCAM) 5.0 MHz 
+1.0/-1.0 dB
1.5 MHz
+0/-5.0 dB
> 55 dB YCrCb I/O
Panasonic AJ-D750 (DVCPRO) 30 Hz – 5.5 MHz 
+0.5/-0.5 dB 

30 Hz – 5.75 MHz 
+0.5/-2.0 dB
30 Hz – 1.3 MHz  +0.5/-0.5 dB 

30 Hz – 1.5 MHz  +0.5/-5.0 dB
> 60 dB DV -> YCrCb 
> 55 dB YCrCb I/O

The bandwidth values listed in the above table compare more than favorably with mid-range Betacam SP and M-II VTRs, and luma bandwidth exceeds that of the popular UVW-1800 Betacam SP editing recorder by a substantial margin. Reversing the bandwidth analogy, Betacam SP might be considered the equivalent of a 3:1:1 digital format. (4.5/5.8 MHz * 4 = 3.10).

Note: Conversion of bandwidth in MHz to data rate in Mbps isn't valid. Various radio-frequency modulation techniques, such as those used in DTV and cable distribution, pack a substantial number of Mbps per MHz of bandwidth. As an example, 8-VSB modulation (vestigal sideband, the standard for U.S. terrestrial DTV) is capable of delivering HDTV's maximum 19.3-Mbps data rate within a 6-MHz TV channel. Cable TV operators using 64-QAM (quadrature amplitude modulation) can pack 27 Mbps into 6 MHz of coax bandwidth; 256-QAM delivers 38.8 Mbps.

Theoretically, the SNR of an ideal n-bit analog-to-digital or digital-to-analog conversion process is given by the formula: SNR = 6.02n + 1.76 dB (Watkinson, The Art of Digital Video, 2nd Edition, p. 138.) For DV, which records 8-bit samples, the theoretical maximum SNR is 49.92 dB, about the same as for Betacam SP's luma SNR. Simple analog-to-digital conversion, however, results in unacceptable contouring. Contouring causes distinct bands of brightness in images when recording a uniformly-reflective surface with slightly-varying lighting intensity or insufficient lens coverage. Adding Gaussian dither to eliminate contouring results in the formula: SNR = 6.02(n – 1) + 1.76 dB (ibid., p. 147), which gives an 8-bit SNR of 43.9 dB. The assumption is that oversampling (or other magic) on both the analog input (ADC) and output (DAC) is responsible for the 55 dB SNR quoted by Sony and Panasonic for their YCrCb -> DV -> to YCrCb tests. The more important SNR value, quoted only by Panasonic, is for DV -> YCrCb, because recording to an analog component format (usually Betacam SP) is necessary until DV, DVCPRO, or DVCAM gear becomes commonplace in broadcast stations and postproduction facilities.

SNR is not applicable in digital dubs between any of the three formats, assuming the digital data format doesn't change during the process. Using DV-over-IEEE-1394 (FireWire) to make dubs creates a clone of the original DV footage. (The audio data, however, may be bumped from 12-bit to 16-bit or from unlocked to locked in the process.) Similarly, transferring DV footage to a fixed disk using an IEEE-1394 adapter card and then recording back to DV results in a clone of the original content. No decompression or recompression of the DV data is involved in the tape -> disk -> tape process. Neither is SNR a factor in adding transitions, titles, or graphics. Conversion of 24-bit RGB graphics to the DV format, and vice versa, is a mathematical manipulation with a predefined and predictable outcome.

4:1:1 and Chroma Key Uncertainty

Assuming one accepts the preceding argument regarding the equivalent chroma bandwidth of 4:1:1 DV and Betacam SP, the issue of spatial uncertainty of chroma-keyed graphics remains. Casual comparison of the 4:1:1 and 4:2:2 diagrams of figure 1 would lead one to conclude that the horizontal uncertainty of chroma-keyed RGB graphics underlaid on 4:1:1 YCrCb video (+/- 2 pixels) would be twice that of D-1's 4:2:2 format (+/- 1 pixel). This is not necessarily the case because of the definition of the luma component by ITU-R BT.601 as follows: Y = 0.299 R + 0.587 G + 0.114B. The RGB components of the luma signal are weighted to correspond to the sensitivity of human vision to each color. If green is used as the key color the higher luma sampling rate reduces the uncertainty (roughly) as follows:

For 4:2:2: 0.299 * 2 + 0.587 * 4 + 0.114 * 2 = 0.598 + 1.174 + 0.228 = 2.000

For 4:1:1: 0.299 * 1 + 0.587 * 4 + 0.114 * 1 = 0.299 + 1.174 + 0.587 = 1.587

A more accurate comparison requires a multi-dimensional matrix transformation, which is beyond the scope of this paper. However, it's reasonably safe to say that the spatial uncertainty when using green as the key color is only about 30 percent greater (2.000/1.587 = 1.3) than that of 4:2:2 D-1 and, based on comparative chroma bandwidth, is likely to be about the same as that for Betacam SP. Chroma green paint, paper, and cloth is as readily available from video accessory vendors as is chroma blue.

Note: The Panasonic hardware DV codec chipset interpolates chroma samples to deliver a simulated 4:2:2 SDI (SMPTE 259M) serial output, which also is likely to minimize chroma-key spatial uncertainty.

    Compression and Other Artifacts

    Up to this point, the comparison of DV and component analog formats has disregarded the issue of DV's 5:1 DCT-based video compression and the issue of compression-based artifacts. The first issue to lay to rest is the often-reported but seldom-demonstrated "stair-step" artifact shown in figure 2.



    Figure 2. 200 by 300 pixel parts of still captures showing luma undershoot (top) and stairstep artifacts (bottom) created by an early Sony DCR-VX1000 during a moderate-speed zoom-out with autofocus off. Note: Compressed to JPEG, Maximum quality.

    The dramatic artifacts shown in Fgure 2 occur only occasionally and usually happen with well-lighted, high-contrast image elements having an angle of 15 degrees or less to the horizontal. Relatively uniform motion, such as zooming in or out, increases the probability of occurrence. (The lower image was captured while zooming out from the upper image.) Repeated attempts to reproduce this atrocious artifact have been unsuccessful. Stairstep and other gross artifacts are defects in codec implementation or other camcorder circuitry problems, not in the design of the DV codec or in the DCT compression process. Luma undershoot (false black horizontal lines above the upper edge of the white paper) is an analog problem, not a digital artifact.

    More subtle compression artifacts appear in high-frequency image content, such as the previously-mentioned leaves on background trees and bushes, as well as blades of grass, especially when placed in motion by wind. DCT quantization discards very high-frequency information, which the untrained eye sees only with difficulty, resulting in "blockiness" (visually evident 8 x 8-pixel DCT blocks.) The perceived blockiness is due to DV's higher luma bandwidth; higher frequency components suffer less attenuation in DV than by the high-frequency rolloff of the luma component of Betacam SP, which also softens high-frequency image components. DV artifacting is more evident to a trained observer and appears to be ameliorated (but not totally eliminated) when recorded Betacam SP with a Sony UVW-1800.

    To achieve 5:1 compression and a constant 25-Mbps data rate, DV uses adaptive quantization during the recording process. Adaptive quantization chooses the appropriate DCT quantization table for each frame of the sequence to maintain the constant data rate. Figure 3 demonstrates that "Ishmael" on the face-out book cover (about 6 meters from the camera lens) and "Abiding ..." on the poster (despite the reflection) are clearly readable. The almost-readable text (approximately 48-point script at a distance of roughly 5.5 meters) at the bottom of the poster is "Gretchen Lemke-Santangelo," the name of the author of Abiding Courage. The clarity of the image is remarkable, considering that the Sony DCR-VX1000 uses the not-so-remarkable plastic lens and CCDs from the venerable Sony CCD-VX3.

    Figure 3. 320 x 260-pixel portion of a captured DCR-VX1000 image showing
    high-frequency detail in poster text and book covers.
    Note: Compressed to JPEG, Maximum quality.

    Note: A series of preliminary indoor tests were made with the DCR-VX1000 camera section by simultaneously capturing to disk with an M-JPEG codec (Interactive Images Plum card, C-Cube codec chip) at DVs fixed data rate of 3.6 MBps and recording to DV tape. A large four-color half-tone image from a newspaper front page (65-line screen) generates the high-frequency content. A subjective comparison of motion and still images indicates that the DV codec has somewhat better high-frequency response than the M-JPEG codec with similar data rates. Further tests with the Plum M-JPEG codec at higher data rates (up to 5 or 6 MBps) are required to verify this conclusion.

      How Much Compression is Too Much?

      Many detractors of the DV format arbitrarily categorize 5:1 compression as "excessive" for broadcast and corporate video production. Yet Sony is proposing a format with a higher compression rate for high-end video production. Betacam SX is a proprietary MPEG-2 derivative format (4:2:2 Profile@Main Level) proposed by Sony to ultimately replace Betacam SP. Betacam SX competes directly against Panasonic's DVCPRO product line, which has been adopted by a number of networks (most recently CBS) and local broadcast stations for ENG applications. Betacam SX uses adaptive quantization, and MPEG-2 I (inter) and B (bidirectional) frames (I-B-I-B-I..., GOP = 2) to achieve a constant 18 Mbps data rate with 4:2:2 chroma sampling. Sony appears to have picked 18 Mbps to allow uplinking of two simultaneous data streams (or a single stream at twice the data rate) to a single satellite transponder. A simple bit-rate ratio (5 * 25 / 18) results in a 7:1 compression figure. Use of 4:2:2 sampling, however, increases the precompression data rate by a factor of 33% (16 bits/sample versus 12 bits/sample), giving a 9.25:1 compression ratio. Betacam SX gains some compression credit for the use of B-frames and compressing the audio stream, but it's highly likely that the effective spatial compression of Betacam SX is greater than 5:1. Sony Europe claims on its Betacam SX Web site that DV is "designed for the professional corporate market where broadcast quality is not a prerequisite." If the DV format doesn't meet "broadcast quality" standards, it's highly unlikely that CBS would adopt DVCPRO for network news production.

      JVC's Digital-S and Panasonic's forthcoming DVCPRO-50 use 3.3:1 DCT compression and 4:2:2 chroma sampling, requiring a data rate of 50 Mbps. These formats are intended to compete with Sony's Digital Betacam product line, which uses 2:1 DCT compression, and Sony's BVW and PVW Betacam SP offerings. Digital-S gear is priced in the Betacam SP range. Panasonic hasn't announced firm prices for DVCPRO-50 equipment, but it's a reasonable assumption that pricing for DVCPRO-50 and Digital-S products will be similar.

      Overcoming Silk Purse Syndrome

      Many comparisons of DV versus Betacam SP as an acquisition format are flawed as a result of comparing camera sections rather than recording formats. Comparing source footage shot by a $3,500 Sony DCR-VX1000 and a $21,000 DXC-637L/PVV-3 with a $15,000 Fujinon lens is a futile exercise. There is no known substitute for good glass and 2/3-inch CCDs. Fortunately, dockable DV-format recorders are available for Sony (DVCAM) and JVC (mini-DV) cameras. Side-by-side shooting with the same camera or, better yet, simultaneous recording to a DV-format dockable and a portable Betacam SP deck is the only technique that permits objective comparison of these recording formats.

      Video acquisition and editing inevitably involves cost-performance compromises. Camcorders with high-quality lenses and 2/3-inch CCDs range in price from about $20,000 to more than $70,000. The most popular DV-format camcorders are an order of magnitude less expensive. D-1 VTRs offer pristine quality, but cost $150,000 or so; Digital Betacam decks, with 2:1 DCT compression, have list prices in the $50,000 range, as do the BVW series of Betacam SP recorders. The popular UVW-1800 Betacam SP editing recorder has a MSRP of $10,350. Sony's DHR-1000 consumer DV and DSR-30 DVCAM decks have list prices in the $4,000 range and a street prices close to $3,500. Evaporated-metal DV and metal-particle DVCPRP tape stock is considerably less expensive than Betacam SP's metal particle tape.

      The DV format offers a virtually unbeatable performance/cost ratio. Mass-production economics of scale for consumer DV camcorder and DVTR record heads and drive mechanisms result in lower-cost professional products that use common components. An example is the almost-identical pricing of Sony's consumer DHR-1000 and DVCAM DSR-30 DVTRs. Today's administered pricing and artificial scarcity of professional DV equipment ultimately will succumb to competitive pressures as more Japanese and, especially, Korean manufacturers climb on the DV bandwagon.

      Note: Sony can take advantage of common components for DV and DVCAM products because the difference between DV and DVCAM only is track pitch (10 microns version 15 microns.) Sony claims that 15-micron track pitch is required for editing applications, but it's more likely that the motive for wider pitch is to minimize interchange problems. Panasonic doesn't gain the same commonality benefits between its high-volume DV consumer products and low-volume DVCPRO gear because DVCPRO uses metal-particle tape and records audio cue and LTC on the DV format's two optional linear tracks. DVCPRO decks, on the other hand, offer the advantage of being able to play back DV, DVCAM, and DVCPRO tapes.

      IEEE-1394 (FireWire) digital I/O gives a dramatic boost to DV's performance/cost ratio for non-linear editing. FireWire I/O eliminates the analog-to-digital conversion (ADC) process of M-JPEG video capture cards and doesn't require ADC for audio inputs. The $999 DPS Spark ($699 without the full version of Premiere 4.2) is priced competitively with low-end M-JPEG capture cards. The Spark approach creates a hard-drive clone of the DV source data with headers added to accommodate Video for Windows' AVI file format. (Codecs aren't involved in DV tape-to-disk or disk-to-tape transfers.) Adaptec proved at NAB '97 that its software DV codec running on a high-end dual Pentium Pro PC outperforms by a substantial margin the Sony DVBK-1 hardware codec (used by FAST's DVMaster IEEE-1394 adapter) for adding transitions, special effects, and titles. When Adaptec optimizes their DV codec for ActiveMovie 2.0 and MMX, it's likely that the performance advantage of the Adaptec DVSoft codec over the DVBK-1 will increase. The constant 3.6 MBps data rate of DV data is well within the capabilities of the most common Ultra-Wide SCSI disk drives, such as Seagate Barracudas. Recent tests demonstrate that FireWire capture to low-cost 4G and 6G Ultra-DMA (a.k.a. Ultra-ATA) EIDE drives is practical; chances are that Ultra-DMA playback will become feasible shortly but may require a specific system board/drive combination.

      Today, virtually all broadcast stations, advertising agencies, and tape duplicators houses accept Betacam SP tapes. Few broadcasters and even fewer agencies and duplicators currently have DV, DVCPRO, or DVCAM gear in house. Duplication firms are likely to be the first to make the investment to handle DV, DVCAM, and DVCPRO masters, eliminating dubs to Betacam SP for high-quality VHS dupes. As the DV format gains ground, local TV broadcasters will be the next to take DVCPRO and DVCAM source footage from news stringers and independent producers. Agencies, if history is a guide, will resist DV submissions to the bitter end, just as they resisted the move from 1-inch Type C to Betacam SP. Cash-starved cable MSOs and independents, striving to eke the remaining hours from their aging U-matic players, aren't likely to be able to handle DV footage until the turn of the century (or later), so don't throw away your 3/4-inch decks.

      Sony saw the handwriting on the wall: Compressed digital video is a clear threat to the Betacam SP cash cow. Sony's response to the pending demise of new orders for Betacam SP gear is a proprietary, highly-compressed MPEG-2 format and SDDI (a SMPTE 259-M variant) I/O. But Sony's also hedging its Betacam SX bet with DV-compliant DVCAM and industry-standard IEEE-1394 I/O. The most likely scenario is that Betacam SX quickly will join the ranks of niche formats, such as M-II and D-5, and that the majority of new purchases will go to the three DV-compatible formats. Time will tell.

      Acknowledgments: Thanks to Tony Sutorius of The UnReal Film Company, Wellington, NZ, for correcting the 4:2:0 diagram of figure 1. FireWire is a trademark of Apple Computer, Inc. Betacam, Betacam SP, and Betacam SX are trademarks of Sony Corp. 24-bit color screen captures were made with a DPS Spark DV2000 (Adaptec AHA-8940) FireWire adapter card.