SlideShare ist ein Scribd-Unternehmen logo
1 von 50
Downloaden Sie, um offline zu lesen
AN ANALYSIS OF THE
ADVANTAGES, DISADVANTAGES, AND STANDARDIZATION
OF
DIGITAL AUDIO
for
Professor Eugene O. Young
Technical writing Instructor
Texas A&M University
College Station, Texas
by
Paul Robert Teich
Student
Computing Science
April 26, 1982
i
LETTER OF TRANSMITTAL
406 C Manuel Dr.
College Station, Texas 77840
April 26, 1982
Dr. Eugene O. Young
217-D Academic and Agency Building
Texas A&M University
College Station, Texas 77843
Dear Dr. Young:
I have completed the major report for English 301 and am submitting it for
grading. The title of my report is: “An Analysis of the Advantages,
Disadvantages, and Standardization of Digital Audio.” I am a computing science
major, and my hobby is high fidelity sound reproduction. The research I did for
this paper laid the groundwork for my research project this summer, which will be
on replicating human speech inflections and accent with a digital speech
synthesizer.
The field of digital audio is very young, even when compared with the age
of digital computer technology. I believe, though, that in the next decade,
digital audio will take over a major portion of the consumer audio market. This
report is written to audiophiles who have heard about digital audio products, but
have had no exposure to literature on them. There are no books devoted to digital
audio and most articles are very technical, requiring quite a bit of knowledge
about digital computers. To get a fairly extensive knowledge of digital audio
requires days of looking for sources and reading. Some of the information was not
directly available at all, but was the result of written inquiries to various
manufacturers. I have not yet found a more comprehensive report on digital audio
than mine.
Thank you for your advice on using note cards and on revision. This is the
largest paper I have ever completed, and I am going to include it, along with my
summer research on my resume. If you have any questions about my report, please
call me at 693-5560.
Sincerely,
Paul R. Teich
ii
Table of Contents
LETTER OF TRANSMITTAL...................................................... i
LIST OF ILLUSTRATIONS..................................................... iv
LIST OF TABLES............................................................ iv
ABSTRACT .................................................................. v
INTRODUCTION............................................................... 1
THE DIGITAL AUDIO PROCESS.................................................. 2
Digital Audio Recording .................................................. 2
Filtering of analog input signal ....................................... 2
Analog-to-digital conversion of signal ................................. 3
Digital Audio Playback ................................................... 6
Pickup of digital information from storage media........................ 6
Digital-to-analog conversion of binary code ............................ 6
Filtering of output signal ............................................. 7
Summary.................................................................. 7
DISADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................. 7
Aliasing Distortion ...................................................... 8
Quantization Noise ...................................................... 10
ADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................... 13
Specifications .......................................................... 13
Frequency response .................................................... 13
Channel separation .................................................... 13
Total harmonic distortion ............................................. 14
Wow and flutter ....................................................... 14
Signal-to-noise ratio ................................................. 14
Dynamic range ......................................................... 14
Error Detection ......................................................... 16
Parity ................................................................ 16
Cyclic redundancy check code (CREE) ................................... 17
Error Concealment ....................................................... 17
Error Correction ........................................................ 18
Other Advantages ........................................................ 20
iii
PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD .............. 21
Recording Formats ....................................................... 21
Sampling rate ......................................................... 22
Quantizing bits ....................................................... 24
Non-audio information ................................................. 26
Total format .......................................................... 28
Storage/Playback Systems ................................................ 28
Tape .................................................................. 29
Disc .................................................................. 29
SELECTED DIGITAL AUDIO SYSTEMS............................................ 32
Technics by Matsushita .................................................. 32
JVC..................................................................... 33
Sony.................................................................... 34
PCM-F1 ................................................................ 34
DAD-1X ................................................................ 34
FUTURE OF DIGITAL AUDIO................................................... 35
Digital Audio Discs ..................................................... 35
Advantages ............................................................ 35
Disadvantages ......................................................... 37
Support ............................................................... 37
Digital Audio Today ..................................................... 38
CONCLUSION................................................................ 39
APPENDIX ................................................................. 41
GLOSSARY OF TERMINOLOGY ................................................. 41
BIBLIOGRAPHY ............................................................ 43
iv
LIST OF ILLUSTRATIONS
Figure 1 Typical analog music input ........................................ 3
Figure 2 Music signal being sampled ........................................ 4
Figure 3 Square waveform of sampled voltage levels ......................... 5
Figure 4 Aliasing Distortion............................................... 8
Figure 5 High Frequency Sampling........................................... 9
Figure 6 Dynamic range of magnetic recording tape ......................... 15
Figure 7 Sony Crossword code error correction method ...................... 19
Figure 8 Interfacing between 16-bit and 14-bit machines ................... 25
Figure 9 The 3M format.................................................... 28
Figure 10 Laser scanning.................................................. 31
Figure 11 Block diagram of a digital tape machine ......................... 36
LIST OF TABLES
Table 1 A Sampling of the Current Digital Audio Market.................. 11
Table 2 Groups of Manufacturers with Compatible Formats................. 21
Table 3 Preferred Sampling Rates ....................................... 23
Table 4 Number of Quantizing Bits Used in Existing and Proposed Systems . 26
Table 5 Error Correction Methods ....................................... 27
Table 6 Storage/Playback Systems: Comparison of Characteristics......... 32
v
ABSTRACT
This report presents the audiophile with a comprehensive view of the
digital audio field, from a description of how digital audio works to current
controversies in the field. The process of digital audio is described at a level
that requires no prior understanding of digital computers. The advantages and
disadvantages of the process are discussed, along with possible solutions and
alternatives. The problem of standardization within the digital audio field is
examined, and possible solutions are suggested. Finally, the future of digital
audio and its possible repercussions in the audio field are presented.
1
INTRODUCTION
The “Computer Revolution,” started in the mid-1960s, has been gradually
edging into the audio field for years. It started with digital FM tuning
circuits and electronically controlled turntable motors, which used quartz phase-
locked loop (PLL) circuits. In the last three or four years, microprocessor
control has been an integral part of tape decks, pre-amplifiers, and such; and it
recently has been added to equalizers and remote control devices. However, the
nature of recorded sound had not changed. Microprocessors could help get the best
possible analog recordings, but could not improve on the actual method of
recording.
Analog recording, since its invention by Thomas Edison, translates music to
an electronic equivalent. The amplitude of a voltage or current is made
proportional to the loudness of the music, and the frequency matches that of the
music on a one-to-one ratio (4:225). Digital recording takes an analog music
signal and converts it into a computer readable code. The advantage of the
digital process is that the music can be measured for computer coding with much
greater precision than the magnetic field strength of a tape can be measured
(1:132). This means that the computer code can be recorded on tape or disk; and,
when it is played back, the music is not dependent on the physical
characteristics of the tape for its reproduction.
This report will present a detailed description of the digital audio
process, as well as its advantages and disadvantages. The problem of
standardization in the field of consumer digital audio equipment will be
discussed, and some sample systems will be examined. This report assumes that
the reader is well acquainted with most aspects of high fidelity sound systems,
including specifications, measurements and technical terminology. Some working
definitions will be included in the Appendix at the end of the report.
2
Most of the Sources for this paper are from the Journal of the Audio
Engineering Society and Stereo Review Magazine, due to the fact that digital
audio is a very young field with no books published as yet. The rest of the
references are from other trade journals, informational brochures, and a passing
reference to digital audio as a future development in a recent book. Few sources
are over five pages long.
THE DIGITAL AUDIO PROCESS
Digital audio can be divided into two separate processes: digital recording
and playback. The process of digital recording consists of: (1) Filtering of the
analog input signal, (2) analog-to-digital conversion of the signal, and (3)
transfer of the digital information to the recording media. The steps involved
in digital playback are: (1) Pickup of the digital information from the recording
media, (2) digital-to-analog conversion of the binary code, and (3) filtering of
the output signal.
Digital Audio Recording
Filtering of analog input signal. First, an analog input signal, usually
from a pre-amplifier (8:560), is routed through a very sharp-cutoff, high-
frequency-cut filter. This filter, called an anti-aliasing filter, removes all
frequencies above the highest frequency to be reproduced (20:64) (see Figure 1).
In most systems, this frequency is 20 kHz.
3
Figure 1 (a) Typical analog music input from a pre-amplifier, containing very
high frequency noise (20:63).
(b) Music signal after anti-alias filtering (20:64).
Analog-to-digital conversion of signal. The voltage of the filtered analog
signal is next measured at very small, constant intervals of time. The frequency
of this sampling is determined from a mathematical model, which states that the
signal must be sampled at least at twice the highest frequency to be reproduced.
Common sampling rates are between 44 kHz and 5l kHz (23:17). To measure the
voltage of the signal at each time interval, a "sample-and-hold” circuit is used.
This circuit samples the voltage of the signal at the beginning of a time
interval, and then holds its own voltage equal to the sampled value for the
4
duration of the interval (see Figure 2). It repeats this process for each
interval to be measured (20:64).
Figure 2 Music signal being sampled (20:64).
The output of the sample-and-hold circuit is a series of voltage levels in
the form of a square wave (see Figure 3a). The voltage levels are now
individually compared to a set of reference voltage levels and they are then
translated, using an analog-to-digital (A/D) converter, to binary numbers
(34:65). Binary is a number system that uses combinations of ones and zeroes,
called "bits,” to represent numbers. The output of the converter is a string of
bits, called “binary code”, which is an accurate representation of the sample-
and-hold circuit output (see Figure 3c). The process of changing the samples
from analog to binary form is known as "quantization" (2324).
5
Figure 3 (a) Square waveform of sampled voltage levels (20:64).
(b) Waveform after being rounded off to the reference levels (20:65).
(c) Binary code generated by analog-to-digital converter.
The last step in the analog-to-digital conversion sequence is the encoding
of the binary code. This procedure inserts timing and error correction
information, in binary form, into the code. It also can merge two separate
channels for stereo. The output of the encoding step is all of the digital
information necessary for the reconstruction of the analog input signal (1:132).
The final process is the actual recording of the digital information onto
tape. This is done by translating the bits of the binary code into
6
electromagnetic pulses, which are recorded on tape in the same way an analog
signal would be recorded. The tape may now be copied, transferred onto master
audio discs, or transferred to other storage media.
Digital Audio Playback
Pickup of digital information from storage media. First, the digital
information is translated from storage media back into the string of digital
information needed to reconstruct the original analog signal (20:65).
Digital-to-analog conversion of binary code. The first process that the
digital information encounters is the removal of the timing and error correction
information. The error correction information is used to restore the parts of
the binary code that were not picked up properly, due to flaws in the storage
media or pickup mechanism. The timing information is used to control how fast
the digital information is being picked up from the storage media. As the error
correcting is finished, the binary code is put into a waiting area, called a
buffer, for temporary storage. The amount of material the buffer can hold is
limited, however, so the timing information is used to control the pickup
mechanism. For example, if digital information is being picked up and filling
the buffer too fast or too slowly, the timing information corrects the speed
deviation (20:65).
As the binary code is put into the back of the buffer, the codes for the
individual voltage levels are taken from the front. Then, they are removed at a
rate equal to the original sampling frequency which is used in the recording
process. As they are removed, they are translated, using a digital-to-analog
(D/A) converter, back into their corresponding voltage levels. The output of the
digital-to-analog conversion is a series of voltages in the form of a square
waveform. This replicates the output of the sample-and-hold circuit, with the
7
small differences due to the rounding off process during quantization of the
original waveform (20:65).
Filtering of output signal. The final stage of this process is the
changing of the square waveform into a constantly varying analog signal. This is
accomplished by an "output-smoothing" filter, which, as the anti-aliasing filter
did, removes all frequencies above the highest frequency to be reproduced. The
effect is the smoothing of the square waveform into an almost exact replica of
the input to the digital audio recording process (20:65).
Summary
A music signal to be recorded first encounters the anti-aliasing filter.
Next, the signal is sampled, and the samples are rounded off. Then, the samples
are quantized, encoded and, finally, transferred to the recording media. To
restore the music to analog form, the digital information is picked up from the
recording. It is then decoded and sent to the digital-to-analog converter. The
signal from the D/A converter is sent through an output-smoothing filter, which
results in a replica of the original music signal.
DISADVANTAGES OF THE DIGITAL AUDIO PROCESS
There are three disadvantages associated with the process of digital audio.
Aliasing distortion and quantizing noise are the two main problems. Both result
in distortion in a digital audio system, and neither of these forms of distortion
are found in analog systems. While aliasing distortion is a correctable problem,
quantizing noise is rooted in the theoretical capabilities of digital audio
(20:65-6). The third disadvantage is the cost of a digital audio system. This is
a temporary condition, however, as technological advances are already lowering
the cost of digital audio equipment (14:57).
8
Aliasing Distortion
As described earlier, there is a mathematical relationship which states
that the sampling frequency of the music must be at least twice the highest
frequency to be sampled (see Figure 4a). This assures that both the positive and
the negative halves of even the highest frequency are sampled (see Figure 4b)
(16:57).
Figure 4 (a) Comparison of properly filtered system with improperly filtered
system (3:511).
(b) 20 kHz since wave (solid line) being sampled (dotted line) at 44
kHz (16:59).
9
If a frequency above half the sampling rate is sampled, it results in an
incorrect voltage for that sample. During playback, the incorrect voltage will
show up as a lower frequency distortion (see Figure 5). This distortion is
called "aliasing" distortion, because the low frequency signal generated is a
false representation, or "alias,” of the high frequency input (20:64). Aliasing
distortion products are not harmonically related to the music, and so they are
very noticeable (3:510). The effects of aliasing distortion also include the
downgrading of the signal-to-noise ratio of the recording (23:5).
Figure 5 (a) High frequency is sampled at low point.
(b) High frequency is sampled at high point.
(c) Filtered curve is sampled.
To minimize aliasing distortion, the sampling rate must be at least twice
the highest frequency sought. Equally as important is the filtering of the input
to remove any unsought higher frequencies. This is the purpose of the anti-
aliasing filter used in the digital recording process.
10
Quantization Noise
During the quantizing step of digital recording, the sampled voltage levels
are rounded off to the nearest reference level. The reference levels are a part
of the analog-to-digital converter, and are used in the same way that the
markings on a ruler are used to approximate a value that could have very many
decimal places (3 inches compared to 3.002 inches) (20:65). "There is always a
slight error between the original signal level and the quantized value.” (23:5).
These errors are heard as noise, and are called “quantizing noise.”
To minimize quantizing noise, more closely spaced reference levels must be
used. However, this introduces two other problems. First, the finer the spacing
between reference levels, the larger the binary number needed to represent the
levels; therefore, more space is used to record the number. The more space each
number takes up, the less music fits onto a tape or disc. The second problem is
that, as the spacing gets closer, it becomes harder for the analog-to-digital
converter to tell the difference between adjacent levels (20:65). This problem
is rooted in the present limits of integrated electronic circuitry technology.
To combat these problems, it is necessary to compromise accuracy for space and
electronic considerations. How much compromise depends on the whims of the
digital audio system designers. However, when better analog-to-digital
converters and faster, cheaper, computer memory become available, there will be
no need for a tradeoff.
There are two very audible subdivisions of quantizing noise. The first is
called “granulation noise.” As the signal gets weaker (near the bottom of the
system's dynamic range), the rounding off process alters a larger percent of the
input signal. This can add a harsh, gritty distortion to the music signal. "The
subjective effect of this noise is much more unpleasant than ordinary tape hiss.”
11
To correct granulation noise, many digital recorders add a low level hiss to the
music. This changes the granulation noise to a more benign form of hiss (16:58).
The other problem is that when a signal is recorded that is larger than the
highest reference level, it is rounded to the highest reference level. The
analog recording equivalent of this is clipping, but the effect is much more
noticeable in digital audio. The solution to digital clipping is, while
recording the music, to set the recording level a few decibels lower, although
this also decreases the dynamic range of the recording (16:58).
Cost
From the viewpoint of the audio consumer, the most serious disadvantage of
digital audio is the cost. Digital system components are very expensive, though,
as pointed out in The Complete Guide to Stereo Equipment, some digital components
are “less expensive than some of the esoteric models some folks favor” (4:184).
Table l gives a random sampling of the current digital audio market.
Table 1 (30, 33, 43) A Sampling of the Current Digital Audio Market
Model Type Cost
Sony PCM-F1 Portable VCR audio adaptor $2,500
Hitachi PCMV100 Audio/video recorder $1,900
Sanyo VCR audio adaptor $2,900
Sharp VCR audio adaptor $2,950
Technics SVP100 Audio recorder $3,000
Sony PCM10 VCR audio adaptor $3,500
Toshiba VCR audio adaptor $3,750
JVC VCR audio adaptor $7,500
12
There are several reasons why digital audio reproduction is so expensive,
all are by-products of digital computer technology. First, though not an
equipment cost, is the price of video cassette recorder (VCR) tape. VCR tape is
needed, because the frequency of the electronic pulse representing the binary
code is typically measured in megahertz (MHz). In the case of the Sony PCM-10,
the sampling frequency (44.056 kHz) is multiplied by 14 bits per sample and by
two channels (for stereo reproduction) to obtain 1.234 MHz with error correction
and synchronization, the resulting frequency may be as high as 2.643 MHz.
Standard audio cassette tapes cannot record such high frequencies, which is why
VCR tape is used (23:l0). Even when ordered from a warehouse supplier, VCR tape
prices range from $12.00 to above $17.00 per tape (12:2).
The reasons why digital audio equipment, itself, is so expensive deal with
current computer and integrated circuitry technology. To reduce distortion
caused by the elements in this circuitry (as happens in all audio systems), high-
speed, high-precision circuit elements are needed (23:5). In digital audio
systems, the sample-and-hold, analog-to-digital, synchronizing, and error
correction circuitry are all implemented using integrated circuitry (see
Appendix).
Integrated circuits of the complexity needed for audio applications are
very expensive to produce. Only the larger audio electronics corporations, such
as Sony, have the resources needed to design and produce their own integrated
circuitry (14:57). Other electronics companies, outside of the audio field, have
produced chips to sell to audio manufacturers. An example of an integrated
circuit produced for audio applications is the Burr-Brown Corporation's PCM75
analog-to-digital converter. There are several models, ranging from $198 to $249,
depending on the speed and distortion of each model (19:1).
13
ADVANTAGES OF THE DIGITAL AUDIO PROCESS
There are many more advantages to the digital audio process than
disadvantages. The specifications of a digital system's ability to reproduce
sound are much better than analog system specifications. Digital systems make
use of signal error detection, concealment and correction, while analog systems
do not. In addition, copying digital recordings does not degrade the sound.
There is no print-through on digital tapes; and on audio discs (analogous to
records), any wear that may occur will not affect the music.
Specifications
The specifications used to describe how well a digital audio system
reproduces music are the same as used in analog audio. The major categories are:
(1) frequency response, (2) channel separation, (3) total harmonic distortion,
(4) wow and flutter, (5) signal-to-noise ratio, and (6) dynamic range.
Frequency response. The frequency response of digital systems is limited
only in the high-frequency response. This is because of the anti-aliasing filter.
However, if the proper filter and sampling rate are used, the high-frequency
response can be extended beyond the range of human hearing. The frequency
response of a digital recording is also exceptionally flat, displaying none of
the dips, peaks and roll offs associated with analog readings. It is not
uncommon for digital recordings to display a frequency response of 0 - 20 kHz ± 1
dB (17:623).
Channel separation. When recording music digitally, the two (stereo)
channels are sampled separately. After they are converted to binary, they are
merged, giving two values (for left and right channels) for each time interval,
and then are encoded. Upon playback, the binary code is decoded, and the two
14
channels are separated and sent to separate buffers (10:65). This process
insures that the two channels do not intermix, and it gives digital audio a
channel separation specification equal to its dynamic range. For most digital
audio systems, this is equal to or greater than 90 dB (17:623). Crosstalk is
also eliminated.
Total harmonic distortion. In a digital system, harmonic distortions are
produced "only by the electronic circuits for coding and decoding” (28:520).
Given the state of modern digital technology, in most digital systems, the total
harmonic distortion can be kept equal to or below 0.05% (17:623).
Wow and flutter. During the digital-to-analog translation of digital
playback, the individual voltage levels are released from the buffer at
essentially unvarying time intervals. A crystal oscillator timing mechanism,
similar to those used in electronic watches and phase-locked turntables,
implements this (8:66). Wow and flutter is virtually eliminated from digital
systems by this process. If it does occur in greater than “unmeasurably small”
quantities, something is drastically wrong (20:65).
Signal-to-noise ratio. The signal-to-noise ratio (S/N ratio) of a system is
basically attributable to the mechanical limitations of the recording media. The
noise level in a digital recording, likened to tape hiss and surface noise in
analog recordings, is at much lower level and consists mainly of granulation
noise. There are no mechanical sources of noise in a digital system; so, for a
typical digital system, the S/N ratio should be equal to or greater than 90 dB
(again, approximating the value of the dynamic range).
Dynamic range. The most noticeable difference in the sound quality of
digital systems is in the dynamic range captured by digital recordings. In
analog recordings, very soft sounds are often buried below the noise level of the
storage media (tape hiss or record surface noise), while very loud sounds cannot
15
be recorded faithfully. Loud sounds may saturate the tape or cause the wiggles
in record grooves to overrun adjacent grooves (10:63). As a result, music to be
stored on records or tape must be "compressed”. Compression makes soft sounds
louder, and loud sounds softer. While this enables the entire musical selection
to fit onto the tape or record, "It detracts from the lifelike quality of musical
reproduction during playback” (4:225). The music on records and tapes usually
has a dynamic range of under 65 dB, compared to orchestra music, which has a
dynamic range of 85 to 95 dB (10:63) (see Figure 6).
Figure 6 Dynamic range of magnetic recording tape is approximately 60 dB, while
orchestra music has a range of 85 to 95 dB (10:63).
Because the electromagnetic pulses representing the binary code on the
magnetic tape (or the dots on the audio discs) are just a representation of the
sound, the level that they are recorded at is not important. On tape, a level
well above the residual tape hiss and well below the saturation level can be
chosen. The level of the pulses remains constant and, thus, eliminates the
disadvantages of recording analog music on tape (24:10).
16
The accuracy of digital audio techniques has been summed up by E. Brad
Meyer, of Point One Audio: "Digital sound will contain none of the frequency-
response errors, distortion, noise, flutter, and so on that are inherent in even
the best analog recording and reproducing systems" (16:56).
Error Detection
Even in the best digital systems, there will be mechanical causes for
dropouts or errors in reading the recorded music during playback (2315). These
situations, such as a scratch in a digital disc or a tape dropout, can be
disastrous in a digital system. With some types of encoding, a dropout causes a
burst of noise at the maximum possible output level on the machine. Clearly,
some form of error detection, common in computer software packages, is needed.
In fact, computer data error detection is entirely applicable to digital audio
(16:58). There are numerous error detection systems; however, only two shall be
mentioned here.
Parity. As stated earlier, binary code is a string of ones and zeroes
(bits). The binary code can be divided into groups of bits, which are called
"words,” of arbitrary length. Each word represents the level of one sampling of
the sound. All ones in a word then are counted. If the total is odd, a one is
inserted into the binary code immediately after the last bit in the word; if the
total is even, a zero is inserted in the same position. This last number is
called the parity bit. When the playback device reads the binary code from tape
or disc, it re-counts the ones in each word, comparing the results (odd or even)
to the parity bit shown. If these do not match, then an error has been detected.
Unfortunately, the parity procedure does not detect an even number of
errors within the same word, because such an error leaves the count equal to the
parity bit (even + even = even; odd + even = odd). This gives parity a 50%
17
chance of detecting an error. The single-parity-bit method is, at best, a
rudimentary means of error detection (2316).
Cyclic redundancy check code (CREE). CRCC is a method used by Sony
Corporation to provide error detection in their digital systems. It can use any
number of parity bits for each word. If the number of parity bits is "n”, then
the probability of an error being detected is 1 minus 2-n
. If "n" is 16 (as in
the Sony PCM-1600), the probability of error detection is 1 minus 2-16
= .999985
(99.9985%). This is an almost perfect error detection capability. (This
equation also describes one bit parity 1 minus 2-1
=1 minus .5 = .5). (23:6).
Error detection is only the first step in managing errors in digital
systems. Once an error has been detected, something must be done about it.
Error Concealment
To prevent the errors from affecting the quality of the sound, various
methods of concealing the error exist. There are three typical methods of error
concealment:
1. Muting: The word with the error is set to zero (muted) (23:7).
18
2. Previous word holding: The value of the word before the error is
substituted for the error (23:7).
3. Linear interpolation: The average of the two words immediately before
and after the error is substituted for the error. This is the most
accurate method of error concealment, but it assumes that the words on
either side of the error are not themselves erroneous (23:7).
Though error concealment is a great improvement over playing music with
errors in it, it is possible to go even further in retaining the quality of the
music.
Error Correction
“An error correction code not only detects the code errors but also
perfectly corrects them.” To do this, it is necessary to find out where and how
the errors have occurred. The code required to do this is very complex. To
explain how it works, a simplified description of Sony's Crossword Code is
examined in Figure 7 (23:7).
19
Figure 7 This is a simplified description of the Sony Crossword code error
correction method (23:7). To correct code errors, it is essential to
find out where and how they have occurred. This is done as follows:
(a) Data values are given for four samples of music.
(b) Numbers are inserted which, when added, total 20 (an arbitrarily
chosen number) both horizontally and vertically in each
row.
(c) The tape, itself, on which is recorded the original data values,
plus the additional numbers from each horizontal and vertical
column, respectively.
(d) Playback data values. (Note that10 is shown, rather than the 12
originally assigned in the first position).
(e) Both the top horizontal row and the left vertical row add up to
only 18, rather than the 20 originally specified. Therefore, l0
must be an incorrect value.
(f) The correct word is reproduced for playback.
NOTE: All words and correction codes are expressed in ordinary decimal
figures instead of binary codes to facilitate their
understanding.
20
Another Sony method, the cross-interleave rectification code (CIRC) can
fully correct code errors of up to 4000 bits (for a length of 2.5 mm on a digital
audio disc (27:66). This is remarkable, for a scratch that large on a
conventional record would not only destroy the musical passage, it would make the
record unplayable.
Other Advantages
There are several more advantages to digital audio; however, all of them
are by-products of the digital audio technology behind the specifications and
error-resolving techniques. Some of these are:
1. The elimination of print-through on the recordings. This is related to
the code being tape-recorded at any chosen level above the tape hiss
and below the saturation level of the tape (28:520).
2. The possibility of adding any number of alternate channels. Each
channel would still be totally separate from the others. This
suggests the possibility of bringing back quadrophonic sound or of
recording different instruments or vocals on different channels. A
user could turn the vocals off and sing along with no disturbance.
Similarly, a musician could blank out his particular instrument on the
recording and play along (29:512). This also raises the possibility
of recording comments, engineering notes (as to equalization,
microphone and speaker placement, etc.), and other non-music
information on a separate channel (29:512).
3. The possibility of mass producing tapes and audio discs with
definitely superior musical reproduction (as compared to records and
tapes currently on the market) (4:184) for a much lower cost than
today's recordings (16:56).
21
4. ". . . copies can be made from one digital machine to another without
the loss in quality that occurs with conventional recorders . . . an
exact duplicate of the signal that emerged from the recording console
at the original session" (16:58). This is because the binary code is
being recorded -- not the music.
PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD
Recording Formats
The standardization of recording formats is the most important issue
deciding the future of digital audio. Without standardization, the equipment
produced by one manufacturer will not directly be able to play the recordings
produced by another. The consumer would have to buy a format converter to do so;
and these, currently comprised mostly of integrated circuitry, would cost
hundreds of dollars. Digital audio is already a costly addition to a sound
system. Having to buy a converter for each format to be played would convince
most consumers that digital audio is not worth the cost. Table 2 shows which
manufacturers have equipment compatible with other manufacturers.
Table 2 (27) Groups of Manufacturers with Compatible Formats
1 2 3 4
U.S. Pioneer Philips Compact JVC Mitsubishi
Magnavox Built MCA Sony Compact
Philips
Sony Long Play
22
The recording format is a combination of sampling rate, quantization, and
non-audio information (error correction and synchronization). The only standards
set by industry for digital audio are from the electronics industry association
of Japan (EIAJ). The EIAJ standard STC-007, labeled "Consumer Use PCM Encoder-
Decoder”, sets three standards. First, it sets the number of bits used to
represent one sampling (the number of “quantizing bits”) to 14 bits. Second, it
sets the sampling rate to 44,056 Hz. Third, though it does not define what type
of error detection is required or what the decoding stage of the playback device
should do if it encounters an uncorrectable error, the standard requires that
some form of error correction be used (21:20). Though these are the first
industry standards in digital audio, they apply only to consumer equipment (not
professional equipment), and they have little influence over U.S. or other
overseas manufacturers. Without standardization of recording formats, it is
difficult (at best) to transfer a recording from one format to another (28:520).
Each part of the digital format will be examined here to provide the reader with
an idea of the scope of the standardization problem.
Sampling rate. The sampling rate has been the most widely differing factor
of the digital industry. Because most high-fidelity applications require at
least a frequency of 20 kHz, most of the sampling frequencies are above 40 kHz.
However, because the European Broadcast Union (EBU) and satellite communications
networks operate at 32 kHz, some manufacturers are considering this or some other
compatible frequency (17:621). Table 3 shows a list of audio industry leaders
and their preferred sampling rates.
23
Table 3 (3, 9, 17) Preferred Sampling Rates
Organization Sample Rate Remarks
Ampex 50 kHz
Absolute synchronization with
standard video film frame rates
Audio Engineering
Society (AES)
50.4 kHz High quality studio work
44.1 kHz Other studio applications
32 kHz Broadcasting applications
European Broadcasting
Union (EBU)
48 kHz Easy conversion to 32 kHz
SMPTE Video Study
Group
60 kHz
Integer samples of audio data for
frame
JVC 47.25 kHz Audio High Density (AHD) Systems
Sony/Philips 44.1 kHz Compact Audio Disc (CAD) Systems
Telefunken 48.0 kHz Mini Disk (MD) Systems
The hardest part of converting music from one digital format to another is
taking "x" samples per second and converting them to "y" samples per second. “If
system A and system B are to be directly interfaced digitally, they must both use
the same sampling rates" (3:509). If they don't have the same sampling rates,
then there must be some simple ratio between the two rates, so a digital rate
changer can be used. However, if there is no simple ratio between the rates,
then the recording must be converted back to analog, and then sampled at the new
rate (3:510). In addition, "if the conversion is from a high sampling frequency
to a lower one, a digital anti-aliasing filter must be used to take care of this
new, lower, sampling rate.” That is, the higher frequencies that can be recorded
with a higher sampling rate may be above the anti-aliasing filter's cutoff for
the lower sampling rate, so the filtering must be done digitally (17:620). Every
24
time a switch is made from one sampling frequency to another, it is accompanied
by a 4 to 6 dB loss in the S/N ratio (17:620).
While differences in sampling rates are having little impact on the
consumer digital recording market, they will have a profound effect on the pre-
recorded music market, creating a need for expensive translating devices (since
all of their circuitry is integrated). While the STC-007 sampling standard is
needed, it is not compatible with other rates. A sampling rate of 50 kHz would
satisfy all high fidelity needs, as well as mesh with standard video film frame
rates.
Quantizing bits. Digressing for a moment, a bit can be either a zero or a
one, as explained previously. Inside a computer, bits form numbers of base two
(binary). In the human number system of base l0 (decimal), each digit has the
possibility of being one of ten numbers (0 through 9). A decimal number of "n"
digits can represent any one of l0n numbers. (If "n" equals 3, then 1000 (103
)
can be represented by three digits -- 0 through 999). In binary, each digit has
two possibilities, so a binary number of "n" bits could represent any one of Zn
numbers. For a binary number of 14 bits, there would be 16,384 possible values;
and for 16 bits, there would be 65,536 possible values. If a group of 14 to 16
bits is taken as one sample of a music signal, called a word, then the possible
values of the binary number represented by the word could be equated to a set of
reference voltage levels. This is the analog-to-digital converter's task in a
digital recording system and is known as "linear quantization”.
The number of bits in a word determines the dynamic range of a digital
recording. The dynamic range equals roughly six times the number of bits, giving
a theoretical upper limit of about 84 dB dynamic range for 14-bit words and 96 dB
dynamic range for 16-bit words (10:64). Other kinds of quantization are
available, but none are as easy to implement. A type of quantization called
25
“floating point quantization" can be used to make words smaller, but it decreases
a system's S/N ratio (32508). All types of quantization use binary.
To convert from a 14-bit word to a 16-bit word, two zeroes are added to the
end of the 14-bit number. They are in the least significant places, because that
is where the difference in accuracy is. To convert from 16 bits to 14, the last
two bits are ignored, because they affect the music the least (see Figure 8)
(3:509). No dynamic range is gained from converting 14 bits to 16; however, some
dynamic range is lost converting from 16 to 14 bits.
Figure 8 Interfacing between 16-bit and 14-bit machines (7:62).
Top: Missing bits seen as zeroes.
Bottom: Two extra bits ignored.
26
The STC-007 standard calls for 14 bit linear encoding to be used in all
digital audio consumer use applications (21:20). However, most of the systems on
the market now and planned for the immediate future, use the more accurate 16-bit
words. The word size recommended by Sony and Studer is 16 bits (9:624); and the
Audio Engineering Society recently recommended 20 to 24 bits (17:621), although
the technology available for 20 or more bits is not currently feasible. For the
immediate future, 16 bits, with its 90 dB of dynamic range, is the best choice
and the one more companies are going to in their products (see Table 4).
Table 4 (8, 11, 17, 19, 25, 26) Number of Quantizing Bits Used in Existing
and Proposed Systems
Manufacturer Model Quantizing Bits Dynamic Range
*Technics SV-P100 14 84.3
*Sony PCM-F1 14/16
(switchable)
86/90
Sony DAD-1X 16 >95
Sony/Philips CAD 16 90
JVC AHD 16 90
Hitachi PCMVl00 14 85
Burr-Brown PCM75 (A/D chip) 16 90
*On the market now.
Non-audio information. The information encoded into the string of binary
code after the analog-to-digital conversion is the non-audio information. It
includes the error correction and the synchronization coder. The EIAJ STC-007
standards "provide for various levels of error correction and detection, with
27
each manufacturer able to decide and select the degree and sophistication of the
error collection used" (10:87). The standard, however, really doesn't
standardize anything; and it does not define what is to happen when an
uncorrectable error is encountered (81:20). Table 5 gives a survey of the
different error correction methods used by manufacturers in their various models.
Table 5 (23) Error Correction Methods
Manufacturer Error Correction Methods
Ampex Interleave (IL), parity word (PN), erasure
(E)
Columbia Parity (P), duplicated recording (DP)
Hitachi P, Interpolation (IP)
Matsushita P, DP
Mitsubishi Adjacent code (AC), cyclic redundancy check
code (CRCC)
Sony DP, IL, crossword code (CC), AC, modified CC,
E, P, IP
Soundstream CRCC
3M, BBC IL, CRCC, PW, E
Teac DP, IL, P
Toshiba P, previous word holding
Each manufacturer has its own synchronization, but synchronization
information, as with error correction information, is relatively easy to; remove
from the binary code and is really not a large concern in standardization. The
binary code can be run through a short computer program that strips both of these
from the words containing the audio samples.
28
Total format. The last item in format standardization is that each
manufacturer takes all of the audio and non-audio information and puts it
together in the binary code string as he sees fit. As an example, Figure 9 shows
the 3M digital format. Conversion from one format to another requires stripping
the non-audio information -- an easy job -- and then matching the number of
quantizing bits and the sampling rates.
Figure 9 The 3M format. 3M uses T6 bit linear quantizing, 50 kHz samp1ing, and
20 kHz anti-aliasing and smoother filters. Data is organized into
frames of 400 bits. There are, arranged as shown above:
16 16-bit data words (D)
8 16-bit parity words (P)
1 12-bit frame good/bad check word (CRC)
1 4-bit synchronization pulse (SYNC)
Storage/Playback Systems
In digital audio, as with analog audio, there are two general methods of
storing the music. The first is magnetic tape. As discussed earlier, the binary
bits are translated into electromagnetic pulses and recorded in the material of
the tape. The second is storing the music on a rotating disc. There are many
methods of implementing disc storage in a digital audio system, unlike analog
audio, in which records all operate on essentially the same format.
29
Tape. Standardization of digital tape recording formats is not very
important, due to the fact that tape recording systems are more commonly used for
home recording than for playing back pre-recorded material (4:226). This is
supported by the recent addition of digital tape recorders to several companies‘
lines of consumer products. (The recordings from each format are not compatible
with the other formats).
Disc. Unlike tape, a disc-- whether analog or digital -- cannot be
recorded on at home. Once a disc is mastered, nothing can be done to it to add
more music. Different disc formats mean that several discs and playback units
will not be compatible. Therefore, it is extremely important that storage and
playback equipment be standardized.
There are three classes of disc playback devices, each with its own type of
disc. They are classed as to how the information is transferred from disc. The
first type, and closest in principle of operation to the analog record uses
piezoelectric detection. The digital information is recorded on a polyvinyl
chloride (PVC) disc -- the same material used currently in analog records. The
pickup device looks like an analog cartridge. However, the stylus is blunter,
for better wear, and the cartridge contains a piezoelectric transducer instead of
a magnetic transducer. The digital information is recorded as very small ridges
in a groove, much like a conventional record. The ridges cause the stylus to
vibrate; and the piezoelectric transducer in the cartridge translates the
vibration to an electric signal, which can then be processed digitally. This
method, like in analog records, is susceptible to wear, both to the disc and to
the stylus (29:5l2).
The second class of digital playback devices uses electrostatic detection.
This type of device is a spin-off of computer data storage technology. The
digitized music is stored by inducing small amounts of static electricity into
30
the disc. The discs are made of a mixture of PVC and carbon. The carbon holds
the electrostatic charge. Two types of detection schemes are currently
implemented. In the first, the electric charges are aligned within a groove. A
blunt device rides in the groove, detecting the charges. 0f course, with the
mechanical contact, there is wear to the disc and to the stylus. In the second
type of detection, a detector rides over the surface of an ungrooved disc. The
information is still recorded in a line that spirals toward the center of the
disc but, when the disc is rotating, the detector is kept over the line by an
electromechanical tracking device. The detector never touches the surface of the
disc, so no wear is induced. More data can be stored on the second type, because
the static charges can be confined to a width smaller than any groove needed to
guide a pickup device. Computers use this method to access information from
their disc drives (29:512).
The third class of playback devices uses photoelectric detection.
Photoelectric detection essentially consists of shining the light of a small,
low-intensity laser onto the disc. The light from the laser is reflected from
pits in the disc. These reflections can be decoded into the digital information
necessary to rebuild the music. There are two methods of storing the information
on a "laser disc.” The first method, now is use for video laser disc
applications, uses small pits of varying depth etched into a PVC disc with a
high-intensity laser. The playback laser is reflected, with varying degrees of
intensity, from the pits. The digital information is decoded from the intensity
of the reflections. The second method uses a PVC disc coated with a thin layer
of metal. A high-intensity laser etches holes in the layer of metal. The holes
vary in length, not depth. When the disc is being read, a low-intensity laser
scans the disc, looking for the non-reflecting holes in it (24:12) (see Figure
10). The lengths of the non-reflecting portions can be translated directly into
31
bits. Both of the laser discs are coated in a clear plastic in order to prevent
damage to the stored information (29:512).
Figure 10 Laser scanning. The surface of the audio disc:
(a) Is scanned by a beam of laser light;
(b) Focused by a lens;
(c) And the focused beam of light
(d) Is reflected from the PVC surface and measured by sensors
(7:62).
There are no current industry standards, or even preferences, for anyone
storage/playback system. For a comparison of characteristics favored by industry
leaders, see Table 6.
32
Table 6 (5, 7, 11, 29) Storage/Playback Systems: Comparison of
Characteristics Favored by Industry Leaders
Item
AES Digital
Audio
Technical
Committee
Telefunken/
Teledec
Sony/Philips JVC
Channels 2/4 4 2 4
Revolutions /
Minute (R/M)
Not given 278-695
Not given in
R/M
900
Diameter 300 mm max. 135 mm 120 mm 260 mm
Playing time
80 minutes
(two sides)
120 minutes
(two sides)
60 minutes
(one side)
120 min
Type Not given Piezoelectric Photoelectric Electrostatic
SELECTED DIGITAL AUDIO SYSTEMS
The following systems represent equipment manufactured by, or to be
manufactured by, leading corporations in the digital audio field. Availability
of information was the deciding factor in choosing the systems to be reviewed.
No information is available on the organization of their digital formats.
Technics by Matsushita
The Technics SV-P100 Digital Cassette Recorder is the first piece of
digital equipment to appear on the consumer market. Introduced in early 1982, it
was the first to be reviewed by a major trade magazine (Stereo Review, March
1982). The cost of the SV-P100 is $3,000 (26:46). Craig Start, testing the
cassette deck for Stereo Review, commented, ". . . It was the cleanest sound I
have ever recorded, and I've recorded a lot. To me, that says a lot about the
SV-P100‘s capabilities” (26:46).
33
Technics records on standard video tapes, and its specifications are
(26:42):
Sample rate: 44,045 Hz
Quantization: 14 bit linear
Dynamic range: 84.3 dB
Distortion: <.01%
Wow and flutter: Unmeasurable
Signal—to-noise: >80.3 dB
Frequency response: 2 Hz — 20 kHz + 0/-2.5 dB
JVC
JVC's audio high density (AHD) digital audio disc system is not yet on the
market, but it is expected within two years. It uses a grooveless electro-
tracking capacitance pickup system (11:70). The specifications of the disc and
playback system are shown below. The JVC AHD system is unique among the
forthcoming audio disc systems in that it has the capacity to playback visual
information in the form of still pictures or slow motion. The disc can also be
played on JVC s video high density (VHD) video playback system, which is now on
the market. Some of the non-audio information will be used as a guide for a
music search capability, locating the beginning of each music track. The AHD
system provides four channels of information -- three for audio purposes, and one
for video. Two of the audio channels will be used for stereo, leaving one free
for special effects, album notes, engineering notes, or for any other as-yet-
unthought-of purposes (11:68)
JVC's ADH disc specifications are (11:70):
Disc size: 260 mm (10 inches)
Revolutions: 900 (R/M)
Playing time: 2 Hours (one per side)
Channels: 3 audio (2 stereo; 1 extra); 1 video
Quantization: 16 bit linear
Sample rate: 47.25 kHz
34
Sony
Sony Corporation, being one of the front-runners in the digital audio
"revolution" (to turn a phrase), has an entrant both in the tape recorder and the
audio disc categories. The PCM-F1 Portable Digital-Audio Adaptor was introduced
to the market in early 1982, while the Digital Audio Disc (DAD)-1X is awaiting
further standardizations in the digital audio field.
PCM-F1. The Sony PCM-F1 Portable Digital-Audio Adaptor turns any NTSC-
standard video recorder into a two channel digital audio recorder costs $1,900,
assuming it has a video recorder to plug into. If not, a video recorder will
cost $800 to $1,200 more (12:2). The PCM-F1 has a switchable quantizer for either
14 or 16 bit linear quantizing, so that the consumer can decide the accuracy he
needs, compared with how much music he wants to fit on a tape (25:13).
The Sony PCM-F1 Portable Digital-Audio Adaptor specifications are (25:13):
Quantization: 14 bit and 16 bit (both linear)
Dynamic range: 86 dB/14 bit and 90 dB/16 bit
Harmonic distortion: 0.01%/14 bit and 0.007%/16 bit
Sampling rate: 44.056 kHz
Frequency response: 10-20 kHz ± 0.5 dB
Wow and flutter: Unmeasurable
Weight: 8.75 lbs.
DAD-1X. The DAD-1X Long-Play Digital Audio Disc System uses a
photoelectric detection system on PVC discs coated with a reflective layer of
metal. The specifications for this system are as follows (8:977):
Disc size: 303 mm
Playing time: 2.5 hours (single side)
Revolutions: 450 R/M
Channels: 2
Sampling rate: 44.056 kHz
Quantization: 16 bit linear
Dynamic range: >95 dB
Harmonic distortion: 0.03%
Frequency response: 2-20 kHz ± .25 dB
Wow and flutter: Undetectable
Error detection: Cross interleave
35
Note that both of the tape decks reviewed (the only two currently on the
market for consumer use) abide by the EIAJ standards, having been manufactured in
Japan. Sony's digital disc system chose the more accurate 16-bit quantization,
being strictly a decoding system (only able to play the discs). For an
illustration of the working of digital tape decks, see Figure 11.
FUTURE OF DIGITAL AUDIO
Digital audio is still a young science. The first digital cassette
recorders have been on the market for only a few months (26:40), but their effect
has been enormous. The digital recorder is now the most accurate component of
consumer stereo systems -- so accurate that it has exposed handicaps in the
response of most amplifiers and speakers that were not previously a matter of
concern. The coming of the digital audio disc, delayed because of technological
and standardization problems, is heralded by many as the end of the analog
record. But, while the digital disc is superior to conventional records, some
argue that the manufacturers have too much money invested in the production of
the latter to make them obsolete overnight. Along with this argument, there is
the possibility that a selection of non-compatible digital disc systems might
turn the consumer away from digital audio.
Digital Audio Discs
Advantages. The digital audio disc is a vastly superior method of re-
producing sound than on conventional records. All of the shortcomings of
conventional turntables will be circumvented. There will be no inner-groove
distortion, tracking error, tone-arm resonances and record and stylus wear.
Digital discs will not suffer nearly as much as analog records from the effects
36
Figure11Blockdiagramofadigitaltapemachinehavinganaloginputsandoutputs(15:560).
37
of poor vinyl, bad pressings, dust or any of the other problems associated with
the manufacturing and use of the discs (16:56). The digital discs should last
indefinitely, as the only wear that they encounter is being taken out of their
cover to be put in the player. Another advantage is that digital discs may
become cheaper to produce than conventional records (4:184).
Disadvantages. When home digital disc playback systems appear on the
market, possibly in early 1983 (16:56), they will be handicapped in two areas.
One is that the record companies "have invested hundreds of millions of dollars
in conventional record-making equipment.” They will not render their equipment
obsolete without hard evidence that the digital discs will sell. Therefore, they
will not issue a large selection of the digital discs until they are sure that "a
large listening audience has the equipment to play them.” The second problem is
that "the consumer is not likely to buy a digital disc player until there is a
large selection of digital discs to play on it (4:184).” Therein lies the
possibility of a stalemate, as when quadraphonic records were introduced. The
quadraphonic records could be played on existing equipment, but few records were
marketed, so the public became disenchanted with "quad" sound (4:226). There is
little chance of this happening, however. Audio companies, such as Sony, have
spent millions of dollars on digital 14:54). The situation is more like that of
the audio manufacturers abandoning the 78 RPM record for the longer playing 45
and 33 RPM records. The better method -- in this case, the digital disc -- will
win out, but there will be a long transition period (4:184). Analog recordings
could survive up to twenty years in the presence of digital discs.
Support. At the l98l audio fair in Tokyo, about twenty companies dis-
played prototype compact digital-audio-disc (CDAD) players, although only a half-
dozen or so actually had working models (14:57). Toshiba, Denon and Yamaha have
38
adopted the joint Sony/Philips "compact disc" (CD) format (12:53), perhaps
signaling the beginning of informal standardization of the audio disc field.
Digital Audio Today
With digital recorders already on the market, audiophiles are re-examining
previously acceptable performance standards for other components in the chain of
audio reproduction. Because of digital audio‘s wide dynamic range, sharp peaks
(transients from percussion and other sources) to which analog systems cannot
react fast enough for them to record fully, can be recorded. These peaks,
requiring up to 100 times the average power of the music to reproduce, will
result in clipping by most amplifiers-and, sometimes, may even activate speaker
protection circuitry (22:63). Some of the newest amplifiers on the market can
handle these very sharp transients, but they are very expensive. However, safe
headroom is vital to digital recording. Instantaneous true peak reading meters
are necessary, therefore, on digital equipment in order to avoid amplifier
overload (2:244). But, though existing equipment may not be ready for digital
audio, Robert Berkovitz, director of research for Teledyne Acoustic Research
(manufacturers of the AR speaker series) said, "Most of the analog recordings
made during the past decade will be perfectly suited to reproduction on home
digital playback equipment (once they are converted to a digital format)"
(22.61).
Though digital accuracy is a fact, there has been concern by some hard core
audiophiles that digitally recorded music is unwholesome - that somewhere in the
process, detrimental factors creep into the music. At the 1980 Spring Audio
Engineering Society Convention in Los Angeles, Dr. John Diamond, a psychiatrist,
attempted to prove that digitally recorded music produces stress and fatigue in
listeners. Though his experiments were loosely controlled, if controlled at all,
39
they started some serious debate as to what kinds of affects digitizing adds to
the music, even if any changes imparted to the music have no audible effects on
the human ear (16:59). However, “A comparison test between all the major digital
recorders and a professional half-inch-tape, two-track analog machine running at
30 inches per second that was conducted at the Sound Emporium in Nashville last
summer confirmed that there do not seem to be any major degradations ascribable
to the digital process" (16:59).
CONCLUSION
Digital audio reproduction is the most significant development in the audio
field to occur in at least the last two decades. Toshit Doi, manager of Sony's
Digital Audio Project, says, "Before digital audio was introduced, the quality
bottleneck of the whole audio chain was in analog tape recorders and analog
records.” The weakest points are now the transducers (microphones and
loudspeakers) (22:63).
Remember, however, that “the accuracy of the digital process depends on the
sampling and quantization processes" (1:132). State-of-the-art integrated
circuitry can be improved upon, and advances in technology can make digital audio
even more accurate than it is now. If the field of digital audio is rigidly
standardized, the benefits of these technological advantages could be severely
delayed, and there would be no way to get more information out of earlier digital
recordings (16:59).
Of the possible technological advances, there is one that could make
digital tape recorders and audio discs obsolete before they have a chance to gain
a large acceptance. Static Systems Corporation, New York, has developed a system
called the Sonadisc. It is a microprocessor based audio player that uses a coin-
40
sized disc, containing magnetic bubble memory (a state-of-the-art computer memory
technology), to store music. The device has no moving parts, because all of the
music is accessed electronically from the bubble memory disc. The memory disc is
inserted into a receptacle in the player and currently stores only about a
second's worth of music (one megabit's worth). A radical advance in magnetic
bubble memory devices would definitely increase the amount of storage available
(6:6). If, and when, this happens, it will certainly be a more efficient way to
store music.
I believe that the EIAJ STC-007 standard is already obsolete. Sixteen- bit
systems, sampling at a higher rate than 41 kHz, have dramatically better
specifications. Manufacturers realize this, so when the digital audio disc
enters the market, it will not be in accordance with the EIAJ standards.
I favor using an industrial coalition -- the type Sony and Philips have
formed -- to informally standardize the field. This approach assures that
technological advances would reach the marketplace faster than under stringent
governmental industrial standards. I do believe that a standardization of
digital tape deck formats is necessary to accommodate music exchanges between
systems without the need for a converter. This will eventually happen; but, for
now, just having digital tape decks available for consumer use is a big enough
plus.
Digital audio recording and playback are the future of audio reproduction,
and the sooner they reach maturity, the sooner we can stop losing our heritage of
music to the degradations of analog storage media.
41
APPENDIX
GLOSSARY OF TERMINOLOGY
(Taken from The Complete Guide to Stereo Equipment)
Channel separation: Electrical or acoustical difference between left and right
channels in stereo.
Crosstalk: Signal leakage between two channels.
Decibel (dB): Numerical expression of acoustic or electrical ratios, such as
relative intensity of sound or relative strength of signal.
Distortion: Unwanted noise, or sounds that did not exist in studio when original
recording was made. All distortion is undesirable.
Dropout: Momentary loss of signal during tape recording or playback caused by
tape defects.
Dynamic range: Span in volume, expressed in decibels, between loudest and
softest sounds.
Filter: A circuit which attenuates signals above, below or at a particular
frequency.
Frequency: Rule of repetition, in cycles per second (cps), or musical pitch and
of electrical signals, expressed in Hertz (Hz). Low frequencies refer to
bass tones; high frequencies to treble tones.
Frequency range: Range from highest to lowest pitch sounds at usable volume
level.
Frequency response: Ability to reproduce given range of frequencies.
Harmonic distortion: Disturbs original relationship between naturally related
tones and is expressed in percentages.
Head room: Margin between maximum levels indicated on recorder's volume
indicator and actual level of severe tape overload.
Hertz: Name given to number of vibrations or cycles per second in electrical
signal of alternating current. Abbreviated Hz, with capital "H"; but when
spelled out, only lower-case letters are employed. The name derives from
Heinrich Hertz, early electrical scientist.
Integrated circuit (IC): Combination of transistors, diodes and resistors
assembled in pre-packaged circuit capable of providing high gain, low
distortion, and easily controllable performance in extremely miniaturized
form.
Kilohertz (kHz): 1,000 hertz.
42
Megahertz (MHZ): 1,000,000 hertz
Noise: Extraneous sound or signal that intrudes into original as result of
environmental noise, distortion, hum, or defective parts and tubes in
equipment.
Saturation: Condition whereby tape has reached its maximum degree of
magnetization.
Separation: Degree to which stereo signals are kept apart in passing through
reproducing systems.
Signal: Electrical replica of actual sound; in broadcasting, carrier wave
itself.
Signal-to-noise ratio: Often abbreviated “S/N ratio"; relative amount of signal
to undesired and extraneous noises in any device or its output.
Transducer: Device for changing one form of energy into another. Microphone and
phone pickup are sound-to-electrical energy transducers.
Transient: Heavy burst of music, causing the amplitude of the music signal to
increase sharply and then subside.
Wow and flutter: Fluctuations in speed of turntable or tape transport that cause
pitch distortions; wow refers to slow, repeated fluctuations; flutter to
short, rapid fluctuations.
43
BIBLIOGRAPHY
1. Bayliff, R. W. "Digital techniques and the gramophone record.” Gramophone.
vo1. 57, no. 673, June 1979, pp. 131-2, 135.
2. Burkowitz, P. K. "User‘s note on digital audio standard code and procedures,
operational requirements.” Journal of the Audio Engineering Society. Apri1
1978, PP. 242, 244, 246.
3. Busby, E. S. "Digital audio recording on videotape: Some choices." SMPTE
Journal. vol. 89, no. 7, July 1980, pp. 508-12.
4. The Complete Guide to Stereo Equipment. New York: Simon and Schuster
Publishers, 1979.
5. Digital Audio Technical Committee. “Digital audio disk standardization
Conference.” Journal of the Audio Engineering Society. July/August 1980,
pp. 525-27.
6. "Digital music at home.” Computer Music Journal. vol. 5, no. 4, 1981, p.
6.
7. Doi, T. T. “General information on a compact digital audio disk.” Journal
of the Audio Engineering Society. January/February 1981, pp. 60, 62.
8. Doi, T. T., et al. "A long play digital audio disk system.” Journal of the
Audio Engineering Society. December 1979, pp. 975-81.
9. Doi, T. T. and Roger Lagadex. “An outline of the format jointly supported by
Sony and Studer for stationary-head digital audio recording.” Journal of the
Audio Engineering Society. September 1980, p. 624.
10. Feldman, L. "Digital audio for the 1980's." Radio Electron. June 1980, pp.
63-5, 87.
11. Hidaka, T. "JVC AHD digital audio disk system." Journal of the Audio
Engineering Society. January/February 1981, pp. 68, 70.
12. Illinois Audio Catalog. Chicago, Illinois Audio, 1982.
13. “JVC digital audio products.” Computer Music Journal. vol. 5, no. 4, 1981,
pp. 89-90.
14. Klein, Larry. “Audio fair in Tokyo (Oct. ‘81).” Stereo Review. February
1982, pp. 56-60.
15. McCracken, J. A. “A high-performance digital audio recorder.” Journal of
the Audio Engineering Society. July/August 1981, p. 560.
16. Meyer, E. Brad. "Digital.” Stereo Review. Apri1 1982, pp. 56-9.
44
17. "Minutes of the meeting of the Digital Audio Technical Committee." Journal
of the Audio Engineering Society. September 1981, pp. 620-24.
18. "New audio products at Las Vegas CES.” Stereo Review. May 1982, pp. 53-6.
19. PCM75: Designed for Audio. Tucson, Burr-Brown Research Corporation, July
1981.
20. Ranada, David. “Digital audio: A primer.” Stereo Review. February 1981, pp.
63-5.
21. Ranada, David. “Good news: Digital recording standards." Stereo Review.
March 1980, p. 20.
22. Ranada, David. "Superdiscs.” Stereo Review. December 1980, pp. 61-4, 67.
23. Sony‘s Digital Audio Technology - QA. Japan, Sony Corporation, nd.
24. Sony's Digital Audio - The Next Step. Japan, Sony Corporation, May 1981.
25. “Sony‘s small, portable digital audio adaptor." Stereo Review. May 1982, p.
13.
26. Stark, Craig. "Equipment test reports: Technics SV-P100 digital cassette
recorder." Stereo Review. March 1982, pp .40, 42, 46.
27. Traiman, Steve. “Digital decade: The audio/visual future.” Stereo Review.
January 1980, pp. 69-72.
28. Weisser, A. “The digital recording of sound in broadcasting.” SMPTE
Journal. July 1980, pp. 520-4.
29. Welland, K. and H. Redlich. “The MD (Mini Disk) system: A contribution to
the digital audio disk standard.” Journal of the Audio Engineering Society.
July/August 1980, pp. 510-14.

Weitere ähnliche Inhalte

Andere mochten auch

Unabridged encyclopaedia of human knowledge to date
Unabridged encyclopaedia of human knowledge to dateUnabridged encyclopaedia of human knowledge to date
Unabridged encyclopaedia of human knowledge to datethierry mortier
 
Total learning: The new future of work and learning
Total learning: The new future of work and learningTotal learning: The new future of work and learning
Total learning: The new future of work and learningBrightwave Group
 
A Review Of Design Process For Low Energy Solar Homes
A Review Of Design Process For Low Energy Solar HomesA Review Of Design Process For Low Energy Solar Homes
A Review Of Design Process For Low Energy Solar HomesBhisu
 
Simple SEO - SEO Made Simple - Do It Yourself SEO
Simple SEO - SEO Made Simple - Do It Yourself SEOSimple SEO - SEO Made Simple - Do It Yourself SEO
Simple SEO - SEO Made Simple - Do It Yourself SEOThe JAR Group
 
Slide to Unlock: learning design for the mobile learner
Slide to Unlock: learning design for the mobile learnerSlide to Unlock: learning design for the mobile learner
Slide to Unlock: learning design for the mobile learnerBrightwave Group
 
What lucky looks like
What lucky looks likeWhat lucky looks like
What lucky looks likeAlan Bassett
 
Os negros africanos no Brasil Colonial Monize e Hanna
Os negros africanos no Brasil Colonial Monize e HannaOs negros africanos no Brasil Colonial Monize e Hanna
Os negros africanos no Brasil Colonial Monize e Hannanice miranda
 
Red stripe 026a 16 lpb
Red stripe 026a 16 lpbRed stripe 026a 16 lpb
Red stripe 026a 16 lpbAlan Bassett
 
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...Ekokonsultacje
 
หนังสือเล่มเล็ก
หนังสือเล่มเล็กหนังสือเล่มเล็ก
หนังสือเล่มเล็กguest32f0fe9
 
Website analytics beyond your presentation
Website analytics beyond your presentationWebsite analytics beyond your presentation
Website analytics beyond your presentationpbrolley
 
Permutation and combination
Permutation and combinationPermutation and combination
Permutation and combinationmanishkanwar
 
Slayter on planning quant design for flc projects - may 2011
Slayter   on planning quant design for flc projects - may 2011Slayter   on planning quant design for flc projects - may 2011
Slayter on planning quant design for flc projects - may 2011Elspeth Slayter
 
Sociala medier - employer branding Senior & Young
Sociala medier - employer branding Senior & YoungSociala medier - employer branding Senior & Young
Sociala medier - employer branding Senior & YoungTobias Franzén
 
Multiqueue BPF support and other BPF feature
Multiqueue BPF support and other BPF featureMultiqueue BPF support and other BPF feature
Multiqueue BPF support and other BPF featureTakuya ASADA
 
Achieving large-scale organisational change through e-learning - Bupa Health ...
Achieving large-scale organisational change through e-learning - Bupa Health ...Achieving large-scale organisational change through e-learning - Bupa Health ...
Achieving large-scale organisational change through e-learning - Bupa Health ...Brightwave Group
 

Andere mochten auch (19)

Unabridged encyclopaedia of human knowledge to date
Unabridged encyclopaedia of human knowledge to dateUnabridged encyclopaedia of human knowledge to date
Unabridged encyclopaedia of human knowledge to date
 
Total learning: The new future of work and learning
Total learning: The new future of work and learningTotal learning: The new future of work and learning
Total learning: The new future of work and learning
 
A Review Of Design Process For Low Energy Solar Homes
A Review Of Design Process For Low Energy Solar HomesA Review Of Design Process For Low Energy Solar Homes
A Review Of Design Process For Low Energy Solar Homes
 
Simple SEO - SEO Made Simple - Do It Yourself SEO
Simple SEO - SEO Made Simple - Do It Yourself SEOSimple SEO - SEO Made Simple - Do It Yourself SEO
Simple SEO - SEO Made Simple - Do It Yourself SEO
 
Slide to Unlock: learning design for the mobile learner
Slide to Unlock: learning design for the mobile learnerSlide to Unlock: learning design for the mobile learner
Slide to Unlock: learning design for the mobile learner
 
What lucky looks like
What lucky looks likeWhat lucky looks like
What lucky looks like
 
Os negros africanos no Brasil Colonial Monize e Hanna
Os negros africanos no Brasil Colonial Monize e HannaOs negros africanos no Brasil Colonial Monize e Hanna
Os negros africanos no Brasil Colonial Monize e Hanna
 
Red stripe 026a 16 lpb
Red stripe 026a 16 lpbRed stripe 026a 16 lpb
Red stripe 026a 16 lpb
 
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...
Zintegrowany System Gospodarki Odpadami dla Miasta Poznania - szanse i zagroż...
 
หนังสือเล่มเล็ก
หนังสือเล่มเล็กหนังสือเล่มเล็ก
หนังสือเล่มเล็ก
 
Keynote
Keynote Keynote
Keynote
 
Website analytics beyond your presentation
Website analytics beyond your presentationWebsite analytics beyond your presentation
Website analytics beyond your presentation
 
5
55
5
 
Permutation and combination
Permutation and combinationPermutation and combination
Permutation and combination
 
Slayter on planning quant design for flc projects - may 2011
Slayter   on planning quant design for flc projects - may 2011Slayter   on planning quant design for flc projects - may 2011
Slayter on planning quant design for flc projects - may 2011
 
Sociala medier - employer branding Senior & Young
Sociala medier - employer branding Senior & YoungSociala medier - employer branding Senior & Young
Sociala medier - employer branding Senior & Young
 
Multiqueue BPF support and other BPF feature
Multiqueue BPF support and other BPF featureMultiqueue BPF support and other BPF feature
Multiqueue BPF support and other BPF feature
 
Achieving large-scale organisational change through e-learning - Bupa Health ...
Achieving large-scale organisational change through e-learning - Bupa Health ...Achieving large-scale organisational change through e-learning - Bupa Health ...
Achieving large-scale organisational change through e-learning - Bupa Health ...
 
R3 l+ module slides draft
R3 l+ module slides draftR3 l+ module slides draft
R3 l+ module slides draft
 

Ähnlich wie Digital Audio - Technical Writing Class Paper - OCR Reformat

Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...
Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...
Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...Vassilis Valavanis
 
DkIT Audio Research Dissertation
DkIT Audio Research DissertationDkIT Audio Research Dissertation
DkIT Audio Research DissertationSeán Mc Keown
 
thesis_Zhiyuan Lin
thesis_Zhiyuan Linthesis_Zhiyuan Lin
thesis_Zhiyuan LinZhiyuan Lin
 
AndrewShapiro-SeniorThesis-ReedCollege
AndrewShapiro-SeniorThesis-ReedCollegeAndrewShapiro-SeniorThesis-ReedCollege
AndrewShapiro-SeniorThesis-ReedCollegeAndy Shapiro
 
Project_Report-E_Ballestero.compressed
Project_Report-E_Ballestero.compressedProject_Report-E_Ballestero.compressed
Project_Report-E_Ballestero.compressedEric Ballestero
 
Pulsed Fibre Optic Light Source for Optical Tomography
Pulsed Fibre Optic Light Source for Optical TomographyPulsed Fibre Optic Light Source for Optical Tomography
Pulsed Fibre Optic Light Source for Optical TomographyQasim Humayoun
 
Trg trc007-en fundamentals of hvac acoustics
Trg trc007-en fundamentals of hvac acousticsTrg trc007-en fundamentals of hvac acoustics
Trg trc007-en fundamentals of hvac acousticsVoVi Phap Danh
 
Recording and reproduction system
Recording and reproduction systemRecording and reproduction system
Recording and reproduction systemGirish Panchal
 
PROJ322 Roberto Weiser Report
PROJ322 Roberto Weiser ReportPROJ322 Roberto Weiser Report
PROJ322 Roberto Weiser ReportRoberto Weiser
 
Dafx (digital audio-effects)
Dafx (digital audio-effects)Dafx (digital audio-effects)
Dafx (digital audio-effects)shervin shokri
 
Tesi ph d_andrea_barucci_small
Tesi ph d_andrea_barucci_smallTesi ph d_andrea_barucci_small
Tesi ph d_andrea_barucci_smallAndrea Barucci
 
PhD-Thesis-ErhardRank
PhD-Thesis-ErhardRankPhD-Thesis-ErhardRank
PhD-Thesis-ErhardRankErhard Rank
 
Active noise control primer
Active noise control primerActive noise control primer
Active noise control primerCharlton Inao
 
Tieng anh chuyen_nganh_vien_thong_-_bai_tap
Tieng anh chuyen_nganh_vien_thong_-_bai_tapTieng anh chuyen_nganh_vien_thong_-_bai_tap
Tieng anh chuyen_nganh_vien_thong_-_bai_tapMinh Tự Nguyễn
 
Emona-based Interactive Amplitude Modulation/Demodulation iLab
Emona-based Interactive Amplitude Modulation/Demodulation iLabEmona-based Interactive Amplitude Modulation/Demodulation iLab
Emona-based Interactive Amplitude Modulation/Demodulation iLabHuynh MVT
 
A Survey of Modern Music Software Ecosystems
A Survey of Modern Music Software EcosystemsA Survey of Modern Music Software Ecosystems
A Survey of Modern Music Software EcosystemsEli Stine
 

Ähnlich wie Digital Audio - Technical Writing Class Paper - OCR Reformat (20)

Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...
Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...
Real-Time Vowel Synthesis - A Magnetic Resonator Piano Based Project_by_Vasil...
 
DkIT Audio Research Dissertation
DkIT Audio Research DissertationDkIT Audio Research Dissertation
DkIT Audio Research Dissertation
 
ThereminReport
ThereminReportThereminReport
ThereminReport
 
thesis_Zhiyuan Lin
thesis_Zhiyuan Linthesis_Zhiyuan Lin
thesis_Zhiyuan Lin
 
AndrewShapiro-SeniorThesis-ReedCollege
AndrewShapiro-SeniorThesis-ReedCollegeAndrewShapiro-SeniorThesis-ReedCollege
AndrewShapiro-SeniorThesis-ReedCollege
 
Project_Report-E_Ballestero.compressed
Project_Report-E_Ballestero.compressedProject_Report-E_Ballestero.compressed
Project_Report-E_Ballestero.compressed
 
Pulsed Fibre Optic Light Source for Optical Tomography
Pulsed Fibre Optic Light Source for Optical TomographyPulsed Fibre Optic Light Source for Optical Tomography
Pulsed Fibre Optic Light Source for Optical Tomography
 
shifas_thesis
shifas_thesisshifas_thesis
shifas_thesis
 
Trg trc007-en fundamentals of hvac acoustics
Trg trc007-en fundamentals of hvac acousticsTrg trc007-en fundamentals of hvac acoustics
Trg trc007-en fundamentals of hvac acoustics
 
Recording and reproduction system
Recording and reproduction systemRecording and reproduction system
Recording and reproduction system
 
PROJ322 Roberto Weiser Report
PROJ322 Roberto Weiser ReportPROJ322 Roberto Weiser Report
PROJ322 Roberto Weiser Report
 
Dafx (digital audio-effects)
Dafx (digital audio-effects)Dafx (digital audio-effects)
Dafx (digital audio-effects)
 
Compression
CompressionCompression
Compression
 
repport christian el hajj
repport christian el hajjrepport christian el hajj
repport christian el hajj
 
Tesi ph d_andrea_barucci_small
Tesi ph d_andrea_barucci_smallTesi ph d_andrea_barucci_small
Tesi ph d_andrea_barucci_small
 
PhD-Thesis-ErhardRank
PhD-Thesis-ErhardRankPhD-Thesis-ErhardRank
PhD-Thesis-ErhardRank
 
Active noise control primer
Active noise control primerActive noise control primer
Active noise control primer
 
Tieng anh chuyen_nganh_vien_thong_-_bai_tap
Tieng anh chuyen_nganh_vien_thong_-_bai_tapTieng anh chuyen_nganh_vien_thong_-_bai_tap
Tieng anh chuyen_nganh_vien_thong_-_bai_tap
 
Emona-based Interactive Amplitude Modulation/Demodulation iLab
Emona-based Interactive Amplitude Modulation/Demodulation iLabEmona-based Interactive Amplitude Modulation/Demodulation iLab
Emona-based Interactive Amplitude Modulation/Demodulation iLab
 
A Survey of Modern Music Software Ecosystems
A Survey of Modern Music Software EcosystemsA Survey of Modern Music Software Ecosystems
A Survey of Modern Music Software Ecosystems
 

Mehr von Paul Teich

PCA17 Augment Your Product Reality
PCA17 Augment Your Product RealityPCA17 Augment Your Product Reality
PCA17 Augment Your Product RealityPaul Teich
 
Digital Audio - Technical Writing Class Paper - Original scan
Digital Audio - Technical Writing Class Paper - Original scanDigital Audio - Technical Writing Class Paper - Original scan
Digital Audio - Technical Writing Class Paper - Original scanPaul Teich
 
SXSW Interactive 2016 panelpicker proposal AIandJobs
SXSW Interactive 2016 panelpicker proposal AIandJobsSXSW Interactive 2016 panelpicker proposal AIandJobs
SXSW Interactive 2016 panelpicker proposal AIandJobsPaul Teich
 
PCA15: Does Your Product Displace a Job or Augment a Skill?
PCA15: Does Your Product Displace a Job or Augment a Skill?PCA15: Does Your Product Displace a Job or Augment a Skill?
PCA15: Does Your Product Displace a Job or Augment a Skill?Paul Teich
 
PCA10 Heres a Scenario For You
PCA10 Heres a Scenario For YouPCA10 Heres a Scenario For You
PCA10 Heres a Scenario For YouPaul Teich
 
Which Customers are You Building the Right Product For PCA9
Which Customers are You Building the Right Product For PCA9Which Customers are You Building the Right Product For PCA9
Which Customers are You Building the Right Product For PCA9Paul Teich
 
Working and Collaborating Across Cultures
Working and Collaborating Across CulturesWorking and Collaborating Across Cultures
Working and Collaborating Across CulturesPaul Teich
 
Sizing, Segmenting, and Forecasting Markets
Sizing, Segmenting, and Forecasting MarketsSizing, Segmenting, and Forecasting Markets
Sizing, Segmenting, and Forecasting MarketsPaul Teich
 

Mehr von Paul Teich (8)

PCA17 Augment Your Product Reality
PCA17 Augment Your Product RealityPCA17 Augment Your Product Reality
PCA17 Augment Your Product Reality
 
Digital Audio - Technical Writing Class Paper - Original scan
Digital Audio - Technical Writing Class Paper - Original scanDigital Audio - Technical Writing Class Paper - Original scan
Digital Audio - Technical Writing Class Paper - Original scan
 
SXSW Interactive 2016 panelpicker proposal AIandJobs
SXSW Interactive 2016 panelpicker proposal AIandJobsSXSW Interactive 2016 panelpicker proposal AIandJobs
SXSW Interactive 2016 panelpicker proposal AIandJobs
 
PCA15: Does Your Product Displace a Job or Augment a Skill?
PCA15: Does Your Product Displace a Job or Augment a Skill?PCA15: Does Your Product Displace a Job or Augment a Skill?
PCA15: Does Your Product Displace a Job or Augment a Skill?
 
PCA10 Heres a Scenario For You
PCA10 Heres a Scenario For YouPCA10 Heres a Scenario For You
PCA10 Heres a Scenario For You
 
Which Customers are You Building the Right Product For PCA9
Which Customers are You Building the Right Product For PCA9Which Customers are You Building the Right Product For PCA9
Which Customers are You Building the Right Product For PCA9
 
Working and Collaborating Across Cultures
Working and Collaborating Across CulturesWorking and Collaborating Across Cultures
Working and Collaborating Across Cultures
 
Sizing, Segmenting, and Forecasting Markets
Sizing, Segmenting, and Forecasting MarketsSizing, Segmenting, and Forecasting Markets
Sizing, Segmenting, and Forecasting Markets
 

Kürzlich hochgeladen

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Kürzlich hochgeladen (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Digital Audio - Technical Writing Class Paper - OCR Reformat

  • 1. AN ANALYSIS OF THE ADVANTAGES, DISADVANTAGES, AND STANDARDIZATION OF DIGITAL AUDIO for Professor Eugene O. Young Technical writing Instructor Texas A&M University College Station, Texas by Paul Robert Teich Student Computing Science April 26, 1982
  • 2. i LETTER OF TRANSMITTAL 406 C Manuel Dr. College Station, Texas 77840 April 26, 1982 Dr. Eugene O. Young 217-D Academic and Agency Building Texas A&M University College Station, Texas 77843 Dear Dr. Young: I have completed the major report for English 301 and am submitting it for grading. The title of my report is: “An Analysis of the Advantages, Disadvantages, and Standardization of Digital Audio.” I am a computing science major, and my hobby is high fidelity sound reproduction. The research I did for this paper laid the groundwork for my research project this summer, which will be on replicating human speech inflections and accent with a digital speech synthesizer. The field of digital audio is very young, even when compared with the age of digital computer technology. I believe, though, that in the next decade, digital audio will take over a major portion of the consumer audio market. This report is written to audiophiles who have heard about digital audio products, but have had no exposure to literature on them. There are no books devoted to digital audio and most articles are very technical, requiring quite a bit of knowledge about digital computers. To get a fairly extensive knowledge of digital audio requires days of looking for sources and reading. Some of the information was not directly available at all, but was the result of written inquiries to various manufacturers. I have not yet found a more comprehensive report on digital audio than mine. Thank you for your advice on using note cards and on revision. This is the largest paper I have ever completed, and I am going to include it, along with my summer research on my resume. If you have any questions about my report, please call me at 693-5560. Sincerely, Paul R. Teich
  • 3. ii Table of Contents LETTER OF TRANSMITTAL...................................................... i LIST OF ILLUSTRATIONS..................................................... iv LIST OF TABLES............................................................ iv ABSTRACT .................................................................. v INTRODUCTION............................................................... 1 THE DIGITAL AUDIO PROCESS.................................................. 2 Digital Audio Recording .................................................. 2 Filtering of analog input signal ....................................... 2 Analog-to-digital conversion of signal ................................. 3 Digital Audio Playback ................................................... 6 Pickup of digital information from storage media........................ 6 Digital-to-analog conversion of binary code ............................ 6 Filtering of output signal ............................................. 7 Summary.................................................................. 7 DISADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................. 7 Aliasing Distortion ...................................................... 8 Quantization Noise ...................................................... 10 ADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................... 13 Specifications .......................................................... 13 Frequency response .................................................... 13 Channel separation .................................................... 13 Total harmonic distortion ............................................. 14 Wow and flutter ....................................................... 14 Signal-to-noise ratio ................................................. 14 Dynamic range ......................................................... 14 Error Detection ......................................................... 16 Parity ................................................................ 16 Cyclic redundancy check code (CREE) ................................... 17 Error Concealment ....................................................... 17 Error Correction ........................................................ 18 Other Advantages ........................................................ 20
  • 4. iii PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD .............. 21 Recording Formats ....................................................... 21 Sampling rate ......................................................... 22 Quantizing bits ....................................................... 24 Non-audio information ................................................. 26 Total format .......................................................... 28 Storage/Playback Systems ................................................ 28 Tape .................................................................. 29 Disc .................................................................. 29 SELECTED DIGITAL AUDIO SYSTEMS............................................ 32 Technics by Matsushita .................................................. 32 JVC..................................................................... 33 Sony.................................................................... 34 PCM-F1 ................................................................ 34 DAD-1X ................................................................ 34 FUTURE OF DIGITAL AUDIO................................................... 35 Digital Audio Discs ..................................................... 35 Advantages ............................................................ 35 Disadvantages ......................................................... 37 Support ............................................................... 37 Digital Audio Today ..................................................... 38 CONCLUSION................................................................ 39 APPENDIX ................................................................. 41 GLOSSARY OF TERMINOLOGY ................................................. 41 BIBLIOGRAPHY ............................................................ 43
  • 5. iv LIST OF ILLUSTRATIONS Figure 1 Typical analog music input ........................................ 3 Figure 2 Music signal being sampled ........................................ 4 Figure 3 Square waveform of sampled voltage levels ......................... 5 Figure 4 Aliasing Distortion............................................... 8 Figure 5 High Frequency Sampling........................................... 9 Figure 6 Dynamic range of magnetic recording tape ......................... 15 Figure 7 Sony Crossword code error correction method ...................... 19 Figure 8 Interfacing between 16-bit and 14-bit machines ................... 25 Figure 9 The 3M format.................................................... 28 Figure 10 Laser scanning.................................................. 31 Figure 11 Block diagram of a digital tape machine ......................... 36 LIST OF TABLES Table 1 A Sampling of the Current Digital Audio Market.................. 11 Table 2 Groups of Manufacturers with Compatible Formats................. 21 Table 3 Preferred Sampling Rates ....................................... 23 Table 4 Number of Quantizing Bits Used in Existing and Proposed Systems . 26 Table 5 Error Correction Methods ....................................... 27 Table 6 Storage/Playback Systems: Comparison of Characteristics......... 32
  • 6. v ABSTRACT This report presents the audiophile with a comprehensive view of the digital audio field, from a description of how digital audio works to current controversies in the field. The process of digital audio is described at a level that requires no prior understanding of digital computers. The advantages and disadvantages of the process are discussed, along with possible solutions and alternatives. The problem of standardization within the digital audio field is examined, and possible solutions are suggested. Finally, the future of digital audio and its possible repercussions in the audio field are presented.
  • 7. 1 INTRODUCTION The “Computer Revolution,” started in the mid-1960s, has been gradually edging into the audio field for years. It started with digital FM tuning circuits and electronically controlled turntable motors, which used quartz phase- locked loop (PLL) circuits. In the last three or four years, microprocessor control has been an integral part of tape decks, pre-amplifiers, and such; and it recently has been added to equalizers and remote control devices. However, the nature of recorded sound had not changed. Microprocessors could help get the best possible analog recordings, but could not improve on the actual method of recording. Analog recording, since its invention by Thomas Edison, translates music to an electronic equivalent. The amplitude of a voltage or current is made proportional to the loudness of the music, and the frequency matches that of the music on a one-to-one ratio (4:225). Digital recording takes an analog music signal and converts it into a computer readable code. The advantage of the digital process is that the music can be measured for computer coding with much greater precision than the magnetic field strength of a tape can be measured (1:132). This means that the computer code can be recorded on tape or disk; and, when it is played back, the music is not dependent on the physical characteristics of the tape for its reproduction. This report will present a detailed description of the digital audio process, as well as its advantages and disadvantages. The problem of standardization in the field of consumer digital audio equipment will be discussed, and some sample systems will be examined. This report assumes that the reader is well acquainted with most aspects of high fidelity sound systems, including specifications, measurements and technical terminology. Some working definitions will be included in the Appendix at the end of the report.
  • 8. 2 Most of the Sources for this paper are from the Journal of the Audio Engineering Society and Stereo Review Magazine, due to the fact that digital audio is a very young field with no books published as yet. The rest of the references are from other trade journals, informational brochures, and a passing reference to digital audio as a future development in a recent book. Few sources are over five pages long. THE DIGITAL AUDIO PROCESS Digital audio can be divided into two separate processes: digital recording and playback. The process of digital recording consists of: (1) Filtering of the analog input signal, (2) analog-to-digital conversion of the signal, and (3) transfer of the digital information to the recording media. The steps involved in digital playback are: (1) Pickup of the digital information from the recording media, (2) digital-to-analog conversion of the binary code, and (3) filtering of the output signal. Digital Audio Recording Filtering of analog input signal. First, an analog input signal, usually from a pre-amplifier (8:560), is routed through a very sharp-cutoff, high- frequency-cut filter. This filter, called an anti-aliasing filter, removes all frequencies above the highest frequency to be reproduced (20:64) (see Figure 1). In most systems, this frequency is 20 kHz.
  • 9. 3 Figure 1 (a) Typical analog music input from a pre-amplifier, containing very high frequency noise (20:63). (b) Music signal after anti-alias filtering (20:64). Analog-to-digital conversion of signal. The voltage of the filtered analog signal is next measured at very small, constant intervals of time. The frequency of this sampling is determined from a mathematical model, which states that the signal must be sampled at least at twice the highest frequency to be reproduced. Common sampling rates are between 44 kHz and 5l kHz (23:17). To measure the voltage of the signal at each time interval, a "sample-and-hold” circuit is used. This circuit samples the voltage of the signal at the beginning of a time interval, and then holds its own voltage equal to the sampled value for the
  • 10. 4 duration of the interval (see Figure 2). It repeats this process for each interval to be measured (20:64). Figure 2 Music signal being sampled (20:64). The output of the sample-and-hold circuit is a series of voltage levels in the form of a square wave (see Figure 3a). The voltage levels are now individually compared to a set of reference voltage levels and they are then translated, using an analog-to-digital (A/D) converter, to binary numbers (34:65). Binary is a number system that uses combinations of ones and zeroes, called "bits,” to represent numbers. The output of the converter is a string of bits, called “binary code”, which is an accurate representation of the sample- and-hold circuit output (see Figure 3c). The process of changing the samples from analog to binary form is known as "quantization" (2324).
  • 11. 5 Figure 3 (a) Square waveform of sampled voltage levels (20:64). (b) Waveform after being rounded off to the reference levels (20:65). (c) Binary code generated by analog-to-digital converter. The last step in the analog-to-digital conversion sequence is the encoding of the binary code. This procedure inserts timing and error correction information, in binary form, into the code. It also can merge two separate channels for stereo. The output of the encoding step is all of the digital information necessary for the reconstruction of the analog input signal (1:132). The final process is the actual recording of the digital information onto tape. This is done by translating the bits of the binary code into
  • 12. 6 electromagnetic pulses, which are recorded on tape in the same way an analog signal would be recorded. The tape may now be copied, transferred onto master audio discs, or transferred to other storage media. Digital Audio Playback Pickup of digital information from storage media. First, the digital information is translated from storage media back into the string of digital information needed to reconstruct the original analog signal (20:65). Digital-to-analog conversion of binary code. The first process that the digital information encounters is the removal of the timing and error correction information. The error correction information is used to restore the parts of the binary code that were not picked up properly, due to flaws in the storage media or pickup mechanism. The timing information is used to control how fast the digital information is being picked up from the storage media. As the error correcting is finished, the binary code is put into a waiting area, called a buffer, for temporary storage. The amount of material the buffer can hold is limited, however, so the timing information is used to control the pickup mechanism. For example, if digital information is being picked up and filling the buffer too fast or too slowly, the timing information corrects the speed deviation (20:65). As the binary code is put into the back of the buffer, the codes for the individual voltage levels are taken from the front. Then, they are removed at a rate equal to the original sampling frequency which is used in the recording process. As they are removed, they are translated, using a digital-to-analog (D/A) converter, back into their corresponding voltage levels. The output of the digital-to-analog conversion is a series of voltages in the form of a square waveform. This replicates the output of the sample-and-hold circuit, with the
  • 13. 7 small differences due to the rounding off process during quantization of the original waveform (20:65). Filtering of output signal. The final stage of this process is the changing of the square waveform into a constantly varying analog signal. This is accomplished by an "output-smoothing" filter, which, as the anti-aliasing filter did, removes all frequencies above the highest frequency to be reproduced. The effect is the smoothing of the square waveform into an almost exact replica of the input to the digital audio recording process (20:65). Summary A music signal to be recorded first encounters the anti-aliasing filter. Next, the signal is sampled, and the samples are rounded off. Then, the samples are quantized, encoded and, finally, transferred to the recording media. To restore the music to analog form, the digital information is picked up from the recording. It is then decoded and sent to the digital-to-analog converter. The signal from the D/A converter is sent through an output-smoothing filter, which results in a replica of the original music signal. DISADVANTAGES OF THE DIGITAL AUDIO PROCESS There are three disadvantages associated with the process of digital audio. Aliasing distortion and quantizing noise are the two main problems. Both result in distortion in a digital audio system, and neither of these forms of distortion are found in analog systems. While aliasing distortion is a correctable problem, quantizing noise is rooted in the theoretical capabilities of digital audio (20:65-6). The third disadvantage is the cost of a digital audio system. This is a temporary condition, however, as technological advances are already lowering the cost of digital audio equipment (14:57).
  • 14. 8 Aliasing Distortion As described earlier, there is a mathematical relationship which states that the sampling frequency of the music must be at least twice the highest frequency to be sampled (see Figure 4a). This assures that both the positive and the negative halves of even the highest frequency are sampled (see Figure 4b) (16:57). Figure 4 (a) Comparison of properly filtered system with improperly filtered system (3:511). (b) 20 kHz since wave (solid line) being sampled (dotted line) at 44 kHz (16:59).
  • 15. 9 If a frequency above half the sampling rate is sampled, it results in an incorrect voltage for that sample. During playback, the incorrect voltage will show up as a lower frequency distortion (see Figure 5). This distortion is called "aliasing" distortion, because the low frequency signal generated is a false representation, or "alias,” of the high frequency input (20:64). Aliasing distortion products are not harmonically related to the music, and so they are very noticeable (3:510). The effects of aliasing distortion also include the downgrading of the signal-to-noise ratio of the recording (23:5). Figure 5 (a) High frequency is sampled at low point. (b) High frequency is sampled at high point. (c) Filtered curve is sampled. To minimize aliasing distortion, the sampling rate must be at least twice the highest frequency sought. Equally as important is the filtering of the input to remove any unsought higher frequencies. This is the purpose of the anti- aliasing filter used in the digital recording process.
  • 16. 10 Quantization Noise During the quantizing step of digital recording, the sampled voltage levels are rounded off to the nearest reference level. The reference levels are a part of the analog-to-digital converter, and are used in the same way that the markings on a ruler are used to approximate a value that could have very many decimal places (3 inches compared to 3.002 inches) (20:65). "There is always a slight error between the original signal level and the quantized value.” (23:5). These errors are heard as noise, and are called “quantizing noise.” To minimize quantizing noise, more closely spaced reference levels must be used. However, this introduces two other problems. First, the finer the spacing between reference levels, the larger the binary number needed to represent the levels; therefore, more space is used to record the number. The more space each number takes up, the less music fits onto a tape or disc. The second problem is that, as the spacing gets closer, it becomes harder for the analog-to-digital converter to tell the difference between adjacent levels (20:65). This problem is rooted in the present limits of integrated electronic circuitry technology. To combat these problems, it is necessary to compromise accuracy for space and electronic considerations. How much compromise depends on the whims of the digital audio system designers. However, when better analog-to-digital converters and faster, cheaper, computer memory become available, there will be no need for a tradeoff. There are two very audible subdivisions of quantizing noise. The first is called “granulation noise.” As the signal gets weaker (near the bottom of the system's dynamic range), the rounding off process alters a larger percent of the input signal. This can add a harsh, gritty distortion to the music signal. "The subjective effect of this noise is much more unpleasant than ordinary tape hiss.”
  • 17. 11 To correct granulation noise, many digital recorders add a low level hiss to the music. This changes the granulation noise to a more benign form of hiss (16:58). The other problem is that when a signal is recorded that is larger than the highest reference level, it is rounded to the highest reference level. The analog recording equivalent of this is clipping, but the effect is much more noticeable in digital audio. The solution to digital clipping is, while recording the music, to set the recording level a few decibels lower, although this also decreases the dynamic range of the recording (16:58). Cost From the viewpoint of the audio consumer, the most serious disadvantage of digital audio is the cost. Digital system components are very expensive, though, as pointed out in The Complete Guide to Stereo Equipment, some digital components are “less expensive than some of the esoteric models some folks favor” (4:184). Table l gives a random sampling of the current digital audio market. Table 1 (30, 33, 43) A Sampling of the Current Digital Audio Market Model Type Cost Sony PCM-F1 Portable VCR audio adaptor $2,500 Hitachi PCMV100 Audio/video recorder $1,900 Sanyo VCR audio adaptor $2,900 Sharp VCR audio adaptor $2,950 Technics SVP100 Audio recorder $3,000 Sony PCM10 VCR audio adaptor $3,500 Toshiba VCR audio adaptor $3,750 JVC VCR audio adaptor $7,500
  • 18. 12 There are several reasons why digital audio reproduction is so expensive, all are by-products of digital computer technology. First, though not an equipment cost, is the price of video cassette recorder (VCR) tape. VCR tape is needed, because the frequency of the electronic pulse representing the binary code is typically measured in megahertz (MHz). In the case of the Sony PCM-10, the sampling frequency (44.056 kHz) is multiplied by 14 bits per sample and by two channels (for stereo reproduction) to obtain 1.234 MHz with error correction and synchronization, the resulting frequency may be as high as 2.643 MHz. Standard audio cassette tapes cannot record such high frequencies, which is why VCR tape is used (23:l0). Even when ordered from a warehouse supplier, VCR tape prices range from $12.00 to above $17.00 per tape (12:2). The reasons why digital audio equipment, itself, is so expensive deal with current computer and integrated circuitry technology. To reduce distortion caused by the elements in this circuitry (as happens in all audio systems), high- speed, high-precision circuit elements are needed (23:5). In digital audio systems, the sample-and-hold, analog-to-digital, synchronizing, and error correction circuitry are all implemented using integrated circuitry (see Appendix). Integrated circuits of the complexity needed for audio applications are very expensive to produce. Only the larger audio electronics corporations, such as Sony, have the resources needed to design and produce their own integrated circuitry (14:57). Other electronics companies, outside of the audio field, have produced chips to sell to audio manufacturers. An example of an integrated circuit produced for audio applications is the Burr-Brown Corporation's PCM75 analog-to-digital converter. There are several models, ranging from $198 to $249, depending on the speed and distortion of each model (19:1).
  • 19. 13 ADVANTAGES OF THE DIGITAL AUDIO PROCESS There are many more advantages to the digital audio process than disadvantages. The specifications of a digital system's ability to reproduce sound are much better than analog system specifications. Digital systems make use of signal error detection, concealment and correction, while analog systems do not. In addition, copying digital recordings does not degrade the sound. There is no print-through on digital tapes; and on audio discs (analogous to records), any wear that may occur will not affect the music. Specifications The specifications used to describe how well a digital audio system reproduces music are the same as used in analog audio. The major categories are: (1) frequency response, (2) channel separation, (3) total harmonic distortion, (4) wow and flutter, (5) signal-to-noise ratio, and (6) dynamic range. Frequency response. The frequency response of digital systems is limited only in the high-frequency response. This is because of the anti-aliasing filter. However, if the proper filter and sampling rate are used, the high-frequency response can be extended beyond the range of human hearing. The frequency response of a digital recording is also exceptionally flat, displaying none of the dips, peaks and roll offs associated with analog readings. It is not uncommon for digital recordings to display a frequency response of 0 - 20 kHz ± 1 dB (17:623). Channel separation. When recording music digitally, the two (stereo) channels are sampled separately. After they are converted to binary, they are merged, giving two values (for left and right channels) for each time interval, and then are encoded. Upon playback, the binary code is decoded, and the two
  • 20. 14 channels are separated and sent to separate buffers (10:65). This process insures that the two channels do not intermix, and it gives digital audio a channel separation specification equal to its dynamic range. For most digital audio systems, this is equal to or greater than 90 dB (17:623). Crosstalk is also eliminated. Total harmonic distortion. In a digital system, harmonic distortions are produced "only by the electronic circuits for coding and decoding” (28:520). Given the state of modern digital technology, in most digital systems, the total harmonic distortion can be kept equal to or below 0.05% (17:623). Wow and flutter. During the digital-to-analog translation of digital playback, the individual voltage levels are released from the buffer at essentially unvarying time intervals. A crystal oscillator timing mechanism, similar to those used in electronic watches and phase-locked turntables, implements this (8:66). Wow and flutter is virtually eliminated from digital systems by this process. If it does occur in greater than “unmeasurably small” quantities, something is drastically wrong (20:65). Signal-to-noise ratio. The signal-to-noise ratio (S/N ratio) of a system is basically attributable to the mechanical limitations of the recording media. The noise level in a digital recording, likened to tape hiss and surface noise in analog recordings, is at much lower level and consists mainly of granulation noise. There are no mechanical sources of noise in a digital system; so, for a typical digital system, the S/N ratio should be equal to or greater than 90 dB (again, approximating the value of the dynamic range). Dynamic range. The most noticeable difference in the sound quality of digital systems is in the dynamic range captured by digital recordings. In analog recordings, very soft sounds are often buried below the noise level of the storage media (tape hiss or record surface noise), while very loud sounds cannot
  • 21. 15 be recorded faithfully. Loud sounds may saturate the tape or cause the wiggles in record grooves to overrun adjacent grooves (10:63). As a result, music to be stored on records or tape must be "compressed”. Compression makes soft sounds louder, and loud sounds softer. While this enables the entire musical selection to fit onto the tape or record, "It detracts from the lifelike quality of musical reproduction during playback” (4:225). The music on records and tapes usually has a dynamic range of under 65 dB, compared to orchestra music, which has a dynamic range of 85 to 95 dB (10:63) (see Figure 6). Figure 6 Dynamic range of magnetic recording tape is approximately 60 dB, while orchestra music has a range of 85 to 95 dB (10:63). Because the electromagnetic pulses representing the binary code on the magnetic tape (or the dots on the audio discs) are just a representation of the sound, the level that they are recorded at is not important. On tape, a level well above the residual tape hiss and well below the saturation level can be chosen. The level of the pulses remains constant and, thus, eliminates the disadvantages of recording analog music on tape (24:10).
  • 22. 16 The accuracy of digital audio techniques has been summed up by E. Brad Meyer, of Point One Audio: "Digital sound will contain none of the frequency- response errors, distortion, noise, flutter, and so on that are inherent in even the best analog recording and reproducing systems" (16:56). Error Detection Even in the best digital systems, there will be mechanical causes for dropouts or errors in reading the recorded music during playback (2315). These situations, such as a scratch in a digital disc or a tape dropout, can be disastrous in a digital system. With some types of encoding, a dropout causes a burst of noise at the maximum possible output level on the machine. Clearly, some form of error detection, common in computer software packages, is needed. In fact, computer data error detection is entirely applicable to digital audio (16:58). There are numerous error detection systems; however, only two shall be mentioned here. Parity. As stated earlier, binary code is a string of ones and zeroes (bits). The binary code can be divided into groups of bits, which are called "words,” of arbitrary length. Each word represents the level of one sampling of the sound. All ones in a word then are counted. If the total is odd, a one is inserted into the binary code immediately after the last bit in the word; if the total is even, a zero is inserted in the same position. This last number is called the parity bit. When the playback device reads the binary code from tape or disc, it re-counts the ones in each word, comparing the results (odd or even) to the parity bit shown. If these do not match, then an error has been detected. Unfortunately, the parity procedure does not detect an even number of errors within the same word, because such an error leaves the count equal to the parity bit (even + even = even; odd + even = odd). This gives parity a 50%
  • 23. 17 chance of detecting an error. The single-parity-bit method is, at best, a rudimentary means of error detection (2316). Cyclic redundancy check code (CREE). CRCC is a method used by Sony Corporation to provide error detection in their digital systems. It can use any number of parity bits for each word. If the number of parity bits is "n”, then the probability of an error being detected is 1 minus 2-n . If "n" is 16 (as in the Sony PCM-1600), the probability of error detection is 1 minus 2-16 = .999985 (99.9985%). This is an almost perfect error detection capability. (This equation also describes one bit parity 1 minus 2-1 =1 minus .5 = .5). (23:6). Error detection is only the first step in managing errors in digital systems. Once an error has been detected, something must be done about it. Error Concealment To prevent the errors from affecting the quality of the sound, various methods of concealing the error exist. There are three typical methods of error concealment: 1. Muting: The word with the error is set to zero (muted) (23:7).
  • 24. 18 2. Previous word holding: The value of the word before the error is substituted for the error (23:7). 3. Linear interpolation: The average of the two words immediately before and after the error is substituted for the error. This is the most accurate method of error concealment, but it assumes that the words on either side of the error are not themselves erroneous (23:7). Though error concealment is a great improvement over playing music with errors in it, it is possible to go even further in retaining the quality of the music. Error Correction “An error correction code not only detects the code errors but also perfectly corrects them.” To do this, it is necessary to find out where and how the errors have occurred. The code required to do this is very complex. To explain how it works, a simplified description of Sony's Crossword Code is examined in Figure 7 (23:7).
  • 25. 19 Figure 7 This is a simplified description of the Sony Crossword code error correction method (23:7). To correct code errors, it is essential to find out where and how they have occurred. This is done as follows: (a) Data values are given for four samples of music. (b) Numbers are inserted which, when added, total 20 (an arbitrarily chosen number) both horizontally and vertically in each row. (c) The tape, itself, on which is recorded the original data values, plus the additional numbers from each horizontal and vertical column, respectively. (d) Playback data values. (Note that10 is shown, rather than the 12 originally assigned in the first position). (e) Both the top horizontal row and the left vertical row add up to only 18, rather than the 20 originally specified. Therefore, l0 must be an incorrect value. (f) The correct word is reproduced for playback. NOTE: All words and correction codes are expressed in ordinary decimal figures instead of binary codes to facilitate their understanding.
  • 26. 20 Another Sony method, the cross-interleave rectification code (CIRC) can fully correct code errors of up to 4000 bits (for a length of 2.5 mm on a digital audio disc (27:66). This is remarkable, for a scratch that large on a conventional record would not only destroy the musical passage, it would make the record unplayable. Other Advantages There are several more advantages to digital audio; however, all of them are by-products of the digital audio technology behind the specifications and error-resolving techniques. Some of these are: 1. The elimination of print-through on the recordings. This is related to the code being tape-recorded at any chosen level above the tape hiss and below the saturation level of the tape (28:520). 2. The possibility of adding any number of alternate channels. Each channel would still be totally separate from the others. This suggests the possibility of bringing back quadrophonic sound or of recording different instruments or vocals on different channels. A user could turn the vocals off and sing along with no disturbance. Similarly, a musician could blank out his particular instrument on the recording and play along (29:512). This also raises the possibility of recording comments, engineering notes (as to equalization, microphone and speaker placement, etc.), and other non-music information on a separate channel (29:512). 3. The possibility of mass producing tapes and audio discs with definitely superior musical reproduction (as compared to records and tapes currently on the market) (4:184) for a much lower cost than today's recordings (16:56).
  • 27. 21 4. ". . . copies can be made from one digital machine to another without the loss in quality that occurs with conventional recorders . . . an exact duplicate of the signal that emerged from the recording console at the original session" (16:58). This is because the binary code is being recorded -- not the music. PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD Recording Formats The standardization of recording formats is the most important issue deciding the future of digital audio. Without standardization, the equipment produced by one manufacturer will not directly be able to play the recordings produced by another. The consumer would have to buy a format converter to do so; and these, currently comprised mostly of integrated circuitry, would cost hundreds of dollars. Digital audio is already a costly addition to a sound system. Having to buy a converter for each format to be played would convince most consumers that digital audio is not worth the cost. Table 2 shows which manufacturers have equipment compatible with other manufacturers. Table 2 (27) Groups of Manufacturers with Compatible Formats 1 2 3 4 U.S. Pioneer Philips Compact JVC Mitsubishi Magnavox Built MCA Sony Compact Philips Sony Long Play
  • 28. 22 The recording format is a combination of sampling rate, quantization, and non-audio information (error correction and synchronization). The only standards set by industry for digital audio are from the electronics industry association of Japan (EIAJ). The EIAJ standard STC-007, labeled "Consumer Use PCM Encoder- Decoder”, sets three standards. First, it sets the number of bits used to represent one sampling (the number of “quantizing bits”) to 14 bits. Second, it sets the sampling rate to 44,056 Hz. Third, though it does not define what type of error detection is required or what the decoding stage of the playback device should do if it encounters an uncorrectable error, the standard requires that some form of error correction be used (21:20). Though these are the first industry standards in digital audio, they apply only to consumer equipment (not professional equipment), and they have little influence over U.S. or other overseas manufacturers. Without standardization of recording formats, it is difficult (at best) to transfer a recording from one format to another (28:520). Each part of the digital format will be examined here to provide the reader with an idea of the scope of the standardization problem. Sampling rate. The sampling rate has been the most widely differing factor of the digital industry. Because most high-fidelity applications require at least a frequency of 20 kHz, most of the sampling frequencies are above 40 kHz. However, because the European Broadcast Union (EBU) and satellite communications networks operate at 32 kHz, some manufacturers are considering this or some other compatible frequency (17:621). Table 3 shows a list of audio industry leaders and their preferred sampling rates.
  • 29. 23 Table 3 (3, 9, 17) Preferred Sampling Rates Organization Sample Rate Remarks Ampex 50 kHz Absolute synchronization with standard video film frame rates Audio Engineering Society (AES) 50.4 kHz High quality studio work 44.1 kHz Other studio applications 32 kHz Broadcasting applications European Broadcasting Union (EBU) 48 kHz Easy conversion to 32 kHz SMPTE Video Study Group 60 kHz Integer samples of audio data for frame JVC 47.25 kHz Audio High Density (AHD) Systems Sony/Philips 44.1 kHz Compact Audio Disc (CAD) Systems Telefunken 48.0 kHz Mini Disk (MD) Systems The hardest part of converting music from one digital format to another is taking "x" samples per second and converting them to "y" samples per second. “If system A and system B are to be directly interfaced digitally, they must both use the same sampling rates" (3:509). If they don't have the same sampling rates, then there must be some simple ratio between the two rates, so a digital rate changer can be used. However, if there is no simple ratio between the rates, then the recording must be converted back to analog, and then sampled at the new rate (3:510). In addition, "if the conversion is from a high sampling frequency to a lower one, a digital anti-aliasing filter must be used to take care of this new, lower, sampling rate.” That is, the higher frequencies that can be recorded with a higher sampling rate may be above the anti-aliasing filter's cutoff for the lower sampling rate, so the filtering must be done digitally (17:620). Every
  • 30. 24 time a switch is made from one sampling frequency to another, it is accompanied by a 4 to 6 dB loss in the S/N ratio (17:620). While differences in sampling rates are having little impact on the consumer digital recording market, they will have a profound effect on the pre- recorded music market, creating a need for expensive translating devices (since all of their circuitry is integrated). While the STC-007 sampling standard is needed, it is not compatible with other rates. A sampling rate of 50 kHz would satisfy all high fidelity needs, as well as mesh with standard video film frame rates. Quantizing bits. Digressing for a moment, a bit can be either a zero or a one, as explained previously. Inside a computer, bits form numbers of base two (binary). In the human number system of base l0 (decimal), each digit has the possibility of being one of ten numbers (0 through 9). A decimal number of "n" digits can represent any one of l0n numbers. (If "n" equals 3, then 1000 (103 ) can be represented by three digits -- 0 through 999). In binary, each digit has two possibilities, so a binary number of "n" bits could represent any one of Zn numbers. For a binary number of 14 bits, there would be 16,384 possible values; and for 16 bits, there would be 65,536 possible values. If a group of 14 to 16 bits is taken as one sample of a music signal, called a word, then the possible values of the binary number represented by the word could be equated to a set of reference voltage levels. This is the analog-to-digital converter's task in a digital recording system and is known as "linear quantization”. The number of bits in a word determines the dynamic range of a digital recording. The dynamic range equals roughly six times the number of bits, giving a theoretical upper limit of about 84 dB dynamic range for 14-bit words and 96 dB dynamic range for 16-bit words (10:64). Other kinds of quantization are available, but none are as easy to implement. A type of quantization called
  • 31. 25 “floating point quantization" can be used to make words smaller, but it decreases a system's S/N ratio (32508). All types of quantization use binary. To convert from a 14-bit word to a 16-bit word, two zeroes are added to the end of the 14-bit number. They are in the least significant places, because that is where the difference in accuracy is. To convert from 16 bits to 14, the last two bits are ignored, because they affect the music the least (see Figure 8) (3:509). No dynamic range is gained from converting 14 bits to 16; however, some dynamic range is lost converting from 16 to 14 bits. Figure 8 Interfacing between 16-bit and 14-bit machines (7:62). Top: Missing bits seen as zeroes. Bottom: Two extra bits ignored.
  • 32. 26 The STC-007 standard calls for 14 bit linear encoding to be used in all digital audio consumer use applications (21:20). However, most of the systems on the market now and planned for the immediate future, use the more accurate 16-bit words. The word size recommended by Sony and Studer is 16 bits (9:624); and the Audio Engineering Society recently recommended 20 to 24 bits (17:621), although the technology available for 20 or more bits is not currently feasible. For the immediate future, 16 bits, with its 90 dB of dynamic range, is the best choice and the one more companies are going to in their products (see Table 4). Table 4 (8, 11, 17, 19, 25, 26) Number of Quantizing Bits Used in Existing and Proposed Systems Manufacturer Model Quantizing Bits Dynamic Range *Technics SV-P100 14 84.3 *Sony PCM-F1 14/16 (switchable) 86/90 Sony DAD-1X 16 >95 Sony/Philips CAD 16 90 JVC AHD 16 90 Hitachi PCMVl00 14 85 Burr-Brown PCM75 (A/D chip) 16 90 *On the market now. Non-audio information. The information encoded into the string of binary code after the analog-to-digital conversion is the non-audio information. It includes the error correction and the synchronization coder. The EIAJ STC-007 standards "provide for various levels of error correction and detection, with
  • 33. 27 each manufacturer able to decide and select the degree and sophistication of the error collection used" (10:87). The standard, however, really doesn't standardize anything; and it does not define what is to happen when an uncorrectable error is encountered (81:20). Table 5 gives a survey of the different error correction methods used by manufacturers in their various models. Table 5 (23) Error Correction Methods Manufacturer Error Correction Methods Ampex Interleave (IL), parity word (PN), erasure (E) Columbia Parity (P), duplicated recording (DP) Hitachi P, Interpolation (IP) Matsushita P, DP Mitsubishi Adjacent code (AC), cyclic redundancy check code (CRCC) Sony DP, IL, crossword code (CC), AC, modified CC, E, P, IP Soundstream CRCC 3M, BBC IL, CRCC, PW, E Teac DP, IL, P Toshiba P, previous word holding Each manufacturer has its own synchronization, but synchronization information, as with error correction information, is relatively easy to; remove from the binary code and is really not a large concern in standardization. The binary code can be run through a short computer program that strips both of these from the words containing the audio samples.
  • 34. 28 Total format. The last item in format standardization is that each manufacturer takes all of the audio and non-audio information and puts it together in the binary code string as he sees fit. As an example, Figure 9 shows the 3M digital format. Conversion from one format to another requires stripping the non-audio information -- an easy job -- and then matching the number of quantizing bits and the sampling rates. Figure 9 The 3M format. 3M uses T6 bit linear quantizing, 50 kHz samp1ing, and 20 kHz anti-aliasing and smoother filters. Data is organized into frames of 400 bits. There are, arranged as shown above: 16 16-bit data words (D) 8 16-bit parity words (P) 1 12-bit frame good/bad check word (CRC) 1 4-bit synchronization pulse (SYNC) Storage/Playback Systems In digital audio, as with analog audio, there are two general methods of storing the music. The first is magnetic tape. As discussed earlier, the binary bits are translated into electromagnetic pulses and recorded in the material of the tape. The second is storing the music on a rotating disc. There are many methods of implementing disc storage in a digital audio system, unlike analog audio, in which records all operate on essentially the same format.
  • 35. 29 Tape. Standardization of digital tape recording formats is not very important, due to the fact that tape recording systems are more commonly used for home recording than for playing back pre-recorded material (4:226). This is supported by the recent addition of digital tape recorders to several companies‘ lines of consumer products. (The recordings from each format are not compatible with the other formats). Disc. Unlike tape, a disc-- whether analog or digital -- cannot be recorded on at home. Once a disc is mastered, nothing can be done to it to add more music. Different disc formats mean that several discs and playback units will not be compatible. Therefore, it is extremely important that storage and playback equipment be standardized. There are three classes of disc playback devices, each with its own type of disc. They are classed as to how the information is transferred from disc. The first type, and closest in principle of operation to the analog record uses piezoelectric detection. The digital information is recorded on a polyvinyl chloride (PVC) disc -- the same material used currently in analog records. The pickup device looks like an analog cartridge. However, the stylus is blunter, for better wear, and the cartridge contains a piezoelectric transducer instead of a magnetic transducer. The digital information is recorded as very small ridges in a groove, much like a conventional record. The ridges cause the stylus to vibrate; and the piezoelectric transducer in the cartridge translates the vibration to an electric signal, which can then be processed digitally. This method, like in analog records, is susceptible to wear, both to the disc and to the stylus (29:5l2). The second class of digital playback devices uses electrostatic detection. This type of device is a spin-off of computer data storage technology. The digitized music is stored by inducing small amounts of static electricity into
  • 36. 30 the disc. The discs are made of a mixture of PVC and carbon. The carbon holds the electrostatic charge. Two types of detection schemes are currently implemented. In the first, the electric charges are aligned within a groove. A blunt device rides in the groove, detecting the charges. 0f course, with the mechanical contact, there is wear to the disc and to the stylus. In the second type of detection, a detector rides over the surface of an ungrooved disc. The information is still recorded in a line that spirals toward the center of the disc but, when the disc is rotating, the detector is kept over the line by an electromechanical tracking device. The detector never touches the surface of the disc, so no wear is induced. More data can be stored on the second type, because the static charges can be confined to a width smaller than any groove needed to guide a pickup device. Computers use this method to access information from their disc drives (29:512). The third class of playback devices uses photoelectric detection. Photoelectric detection essentially consists of shining the light of a small, low-intensity laser onto the disc. The light from the laser is reflected from pits in the disc. These reflections can be decoded into the digital information necessary to rebuild the music. There are two methods of storing the information on a "laser disc.” The first method, now is use for video laser disc applications, uses small pits of varying depth etched into a PVC disc with a high-intensity laser. The playback laser is reflected, with varying degrees of intensity, from the pits. The digital information is decoded from the intensity of the reflections. The second method uses a PVC disc coated with a thin layer of metal. A high-intensity laser etches holes in the layer of metal. The holes vary in length, not depth. When the disc is being read, a low-intensity laser scans the disc, looking for the non-reflecting holes in it (24:12) (see Figure 10). The lengths of the non-reflecting portions can be translated directly into
  • 37. 31 bits. Both of the laser discs are coated in a clear plastic in order to prevent damage to the stored information (29:512). Figure 10 Laser scanning. The surface of the audio disc: (a) Is scanned by a beam of laser light; (b) Focused by a lens; (c) And the focused beam of light (d) Is reflected from the PVC surface and measured by sensors (7:62). There are no current industry standards, or even preferences, for anyone storage/playback system. For a comparison of characteristics favored by industry leaders, see Table 6.
  • 38. 32 Table 6 (5, 7, 11, 29) Storage/Playback Systems: Comparison of Characteristics Favored by Industry Leaders Item AES Digital Audio Technical Committee Telefunken/ Teledec Sony/Philips JVC Channels 2/4 4 2 4 Revolutions / Minute (R/M) Not given 278-695 Not given in R/M 900 Diameter 300 mm max. 135 mm 120 mm 260 mm Playing time 80 minutes (two sides) 120 minutes (two sides) 60 minutes (one side) 120 min Type Not given Piezoelectric Photoelectric Electrostatic SELECTED DIGITAL AUDIO SYSTEMS The following systems represent equipment manufactured by, or to be manufactured by, leading corporations in the digital audio field. Availability of information was the deciding factor in choosing the systems to be reviewed. No information is available on the organization of their digital formats. Technics by Matsushita The Technics SV-P100 Digital Cassette Recorder is the first piece of digital equipment to appear on the consumer market. Introduced in early 1982, it was the first to be reviewed by a major trade magazine (Stereo Review, March 1982). The cost of the SV-P100 is $3,000 (26:46). Craig Start, testing the cassette deck for Stereo Review, commented, ". . . It was the cleanest sound I have ever recorded, and I've recorded a lot. To me, that says a lot about the SV-P100‘s capabilities” (26:46).
  • 39. 33 Technics records on standard video tapes, and its specifications are (26:42): Sample rate: 44,045 Hz Quantization: 14 bit linear Dynamic range: 84.3 dB Distortion: <.01% Wow and flutter: Unmeasurable Signal—to-noise: >80.3 dB Frequency response: 2 Hz — 20 kHz + 0/-2.5 dB JVC JVC's audio high density (AHD) digital audio disc system is not yet on the market, but it is expected within two years. It uses a grooveless electro- tracking capacitance pickup system (11:70). The specifications of the disc and playback system are shown below. The JVC AHD system is unique among the forthcoming audio disc systems in that it has the capacity to playback visual information in the form of still pictures or slow motion. The disc can also be played on JVC s video high density (VHD) video playback system, which is now on the market. Some of the non-audio information will be used as a guide for a music search capability, locating the beginning of each music track. The AHD system provides four channels of information -- three for audio purposes, and one for video. Two of the audio channels will be used for stereo, leaving one free for special effects, album notes, engineering notes, or for any other as-yet- unthought-of purposes (11:68) JVC's ADH disc specifications are (11:70): Disc size: 260 mm (10 inches) Revolutions: 900 (R/M) Playing time: 2 Hours (one per side) Channels: 3 audio (2 stereo; 1 extra); 1 video Quantization: 16 bit linear Sample rate: 47.25 kHz
  • 40. 34 Sony Sony Corporation, being one of the front-runners in the digital audio "revolution" (to turn a phrase), has an entrant both in the tape recorder and the audio disc categories. The PCM-F1 Portable Digital-Audio Adaptor was introduced to the market in early 1982, while the Digital Audio Disc (DAD)-1X is awaiting further standardizations in the digital audio field. PCM-F1. The Sony PCM-F1 Portable Digital-Audio Adaptor turns any NTSC- standard video recorder into a two channel digital audio recorder costs $1,900, assuming it has a video recorder to plug into. If not, a video recorder will cost $800 to $1,200 more (12:2). The PCM-F1 has a switchable quantizer for either 14 or 16 bit linear quantizing, so that the consumer can decide the accuracy he needs, compared with how much music he wants to fit on a tape (25:13). The Sony PCM-F1 Portable Digital-Audio Adaptor specifications are (25:13): Quantization: 14 bit and 16 bit (both linear) Dynamic range: 86 dB/14 bit and 90 dB/16 bit Harmonic distortion: 0.01%/14 bit and 0.007%/16 bit Sampling rate: 44.056 kHz Frequency response: 10-20 kHz ± 0.5 dB Wow and flutter: Unmeasurable Weight: 8.75 lbs. DAD-1X. The DAD-1X Long-Play Digital Audio Disc System uses a photoelectric detection system on PVC discs coated with a reflective layer of metal. The specifications for this system are as follows (8:977): Disc size: 303 mm Playing time: 2.5 hours (single side) Revolutions: 450 R/M Channels: 2 Sampling rate: 44.056 kHz Quantization: 16 bit linear Dynamic range: >95 dB Harmonic distortion: 0.03% Frequency response: 2-20 kHz ± .25 dB Wow and flutter: Undetectable Error detection: Cross interleave
  • 41. 35 Note that both of the tape decks reviewed (the only two currently on the market for consumer use) abide by the EIAJ standards, having been manufactured in Japan. Sony's digital disc system chose the more accurate 16-bit quantization, being strictly a decoding system (only able to play the discs). For an illustration of the working of digital tape decks, see Figure 11. FUTURE OF DIGITAL AUDIO Digital audio is still a young science. The first digital cassette recorders have been on the market for only a few months (26:40), but their effect has been enormous. The digital recorder is now the most accurate component of consumer stereo systems -- so accurate that it has exposed handicaps in the response of most amplifiers and speakers that were not previously a matter of concern. The coming of the digital audio disc, delayed because of technological and standardization problems, is heralded by many as the end of the analog record. But, while the digital disc is superior to conventional records, some argue that the manufacturers have too much money invested in the production of the latter to make them obsolete overnight. Along with this argument, there is the possibility that a selection of non-compatible digital disc systems might turn the consumer away from digital audio. Digital Audio Discs Advantages. The digital audio disc is a vastly superior method of re- producing sound than on conventional records. All of the shortcomings of conventional turntables will be circumvented. There will be no inner-groove distortion, tracking error, tone-arm resonances and record and stylus wear. Digital discs will not suffer nearly as much as analog records from the effects
  • 43. 37 of poor vinyl, bad pressings, dust or any of the other problems associated with the manufacturing and use of the discs (16:56). The digital discs should last indefinitely, as the only wear that they encounter is being taken out of their cover to be put in the player. Another advantage is that digital discs may become cheaper to produce than conventional records (4:184). Disadvantages. When home digital disc playback systems appear on the market, possibly in early 1983 (16:56), they will be handicapped in two areas. One is that the record companies "have invested hundreds of millions of dollars in conventional record-making equipment.” They will not render their equipment obsolete without hard evidence that the digital discs will sell. Therefore, they will not issue a large selection of the digital discs until they are sure that "a large listening audience has the equipment to play them.” The second problem is that "the consumer is not likely to buy a digital disc player until there is a large selection of digital discs to play on it (4:184).” Therein lies the possibility of a stalemate, as when quadraphonic records were introduced. The quadraphonic records could be played on existing equipment, but few records were marketed, so the public became disenchanted with "quad" sound (4:226). There is little chance of this happening, however. Audio companies, such as Sony, have spent millions of dollars on digital 14:54). The situation is more like that of the audio manufacturers abandoning the 78 RPM record for the longer playing 45 and 33 RPM records. The better method -- in this case, the digital disc -- will win out, but there will be a long transition period (4:184). Analog recordings could survive up to twenty years in the presence of digital discs. Support. At the l98l audio fair in Tokyo, about twenty companies dis- played prototype compact digital-audio-disc (CDAD) players, although only a half- dozen or so actually had working models (14:57). Toshiba, Denon and Yamaha have
  • 44. 38 adopted the joint Sony/Philips "compact disc" (CD) format (12:53), perhaps signaling the beginning of informal standardization of the audio disc field. Digital Audio Today With digital recorders already on the market, audiophiles are re-examining previously acceptable performance standards for other components in the chain of audio reproduction. Because of digital audio‘s wide dynamic range, sharp peaks (transients from percussion and other sources) to which analog systems cannot react fast enough for them to record fully, can be recorded. These peaks, requiring up to 100 times the average power of the music to reproduce, will result in clipping by most amplifiers-and, sometimes, may even activate speaker protection circuitry (22:63). Some of the newest amplifiers on the market can handle these very sharp transients, but they are very expensive. However, safe headroom is vital to digital recording. Instantaneous true peak reading meters are necessary, therefore, on digital equipment in order to avoid amplifier overload (2:244). But, though existing equipment may not be ready for digital audio, Robert Berkovitz, director of research for Teledyne Acoustic Research (manufacturers of the AR speaker series) said, "Most of the analog recordings made during the past decade will be perfectly suited to reproduction on home digital playback equipment (once they are converted to a digital format)" (22.61). Though digital accuracy is a fact, there has been concern by some hard core audiophiles that digitally recorded music is unwholesome - that somewhere in the process, detrimental factors creep into the music. At the 1980 Spring Audio Engineering Society Convention in Los Angeles, Dr. John Diamond, a psychiatrist, attempted to prove that digitally recorded music produces stress and fatigue in listeners. Though his experiments were loosely controlled, if controlled at all,
  • 45. 39 they started some serious debate as to what kinds of affects digitizing adds to the music, even if any changes imparted to the music have no audible effects on the human ear (16:59). However, “A comparison test between all the major digital recorders and a professional half-inch-tape, two-track analog machine running at 30 inches per second that was conducted at the Sound Emporium in Nashville last summer confirmed that there do not seem to be any major degradations ascribable to the digital process" (16:59). CONCLUSION Digital audio reproduction is the most significant development in the audio field to occur in at least the last two decades. Toshit Doi, manager of Sony's Digital Audio Project, says, "Before digital audio was introduced, the quality bottleneck of the whole audio chain was in analog tape recorders and analog records.” The weakest points are now the transducers (microphones and loudspeakers) (22:63). Remember, however, that “the accuracy of the digital process depends on the sampling and quantization processes" (1:132). State-of-the-art integrated circuitry can be improved upon, and advances in technology can make digital audio even more accurate than it is now. If the field of digital audio is rigidly standardized, the benefits of these technological advantages could be severely delayed, and there would be no way to get more information out of earlier digital recordings (16:59). Of the possible technological advances, there is one that could make digital tape recorders and audio discs obsolete before they have a chance to gain a large acceptance. Static Systems Corporation, New York, has developed a system called the Sonadisc. It is a microprocessor based audio player that uses a coin-
  • 46. 40 sized disc, containing magnetic bubble memory (a state-of-the-art computer memory technology), to store music. The device has no moving parts, because all of the music is accessed electronically from the bubble memory disc. The memory disc is inserted into a receptacle in the player and currently stores only about a second's worth of music (one megabit's worth). A radical advance in magnetic bubble memory devices would definitely increase the amount of storage available (6:6). If, and when, this happens, it will certainly be a more efficient way to store music. I believe that the EIAJ STC-007 standard is already obsolete. Sixteen- bit systems, sampling at a higher rate than 41 kHz, have dramatically better specifications. Manufacturers realize this, so when the digital audio disc enters the market, it will not be in accordance with the EIAJ standards. I favor using an industrial coalition -- the type Sony and Philips have formed -- to informally standardize the field. This approach assures that technological advances would reach the marketplace faster than under stringent governmental industrial standards. I do believe that a standardization of digital tape deck formats is necessary to accommodate music exchanges between systems without the need for a converter. This will eventually happen; but, for now, just having digital tape decks available for consumer use is a big enough plus. Digital audio recording and playback are the future of audio reproduction, and the sooner they reach maturity, the sooner we can stop losing our heritage of music to the degradations of analog storage media.
  • 47. 41 APPENDIX GLOSSARY OF TERMINOLOGY (Taken from The Complete Guide to Stereo Equipment) Channel separation: Electrical or acoustical difference between left and right channels in stereo. Crosstalk: Signal leakage between two channels. Decibel (dB): Numerical expression of acoustic or electrical ratios, such as relative intensity of sound or relative strength of signal. Distortion: Unwanted noise, or sounds that did not exist in studio when original recording was made. All distortion is undesirable. Dropout: Momentary loss of signal during tape recording or playback caused by tape defects. Dynamic range: Span in volume, expressed in decibels, between loudest and softest sounds. Filter: A circuit which attenuates signals above, below or at a particular frequency. Frequency: Rule of repetition, in cycles per second (cps), or musical pitch and of electrical signals, expressed in Hertz (Hz). Low frequencies refer to bass tones; high frequencies to treble tones. Frequency range: Range from highest to lowest pitch sounds at usable volume level. Frequency response: Ability to reproduce given range of frequencies. Harmonic distortion: Disturbs original relationship between naturally related tones and is expressed in percentages. Head room: Margin between maximum levels indicated on recorder's volume indicator and actual level of severe tape overload. Hertz: Name given to number of vibrations or cycles per second in electrical signal of alternating current. Abbreviated Hz, with capital "H"; but when spelled out, only lower-case letters are employed. The name derives from Heinrich Hertz, early electrical scientist. Integrated circuit (IC): Combination of transistors, diodes and resistors assembled in pre-packaged circuit capable of providing high gain, low distortion, and easily controllable performance in extremely miniaturized form. Kilohertz (kHz): 1,000 hertz.
  • 48. 42 Megahertz (MHZ): 1,000,000 hertz Noise: Extraneous sound or signal that intrudes into original as result of environmental noise, distortion, hum, or defective parts and tubes in equipment. Saturation: Condition whereby tape has reached its maximum degree of magnetization. Separation: Degree to which stereo signals are kept apart in passing through reproducing systems. Signal: Electrical replica of actual sound; in broadcasting, carrier wave itself. Signal-to-noise ratio: Often abbreviated “S/N ratio"; relative amount of signal to undesired and extraneous noises in any device or its output. Transducer: Device for changing one form of energy into another. Microphone and phone pickup are sound-to-electrical energy transducers. Transient: Heavy burst of music, causing the amplitude of the music signal to increase sharply and then subside. Wow and flutter: Fluctuations in speed of turntable or tape transport that cause pitch distortions; wow refers to slow, repeated fluctuations; flutter to short, rapid fluctuations.
  • 49. 43 BIBLIOGRAPHY 1. Bayliff, R. W. "Digital techniques and the gramophone record.” Gramophone. vo1. 57, no. 673, June 1979, pp. 131-2, 135. 2. Burkowitz, P. K. "User‘s note on digital audio standard code and procedures, operational requirements.” Journal of the Audio Engineering Society. Apri1 1978, PP. 242, 244, 246. 3. Busby, E. S. "Digital audio recording on videotape: Some choices." SMPTE Journal. vol. 89, no. 7, July 1980, pp. 508-12. 4. The Complete Guide to Stereo Equipment. New York: Simon and Schuster Publishers, 1979. 5. Digital Audio Technical Committee. “Digital audio disk standardization Conference.” Journal of the Audio Engineering Society. July/August 1980, pp. 525-27. 6. "Digital music at home.” Computer Music Journal. vol. 5, no. 4, 1981, p. 6. 7. Doi, T. T. “General information on a compact digital audio disk.” Journal of the Audio Engineering Society. January/February 1981, pp. 60, 62. 8. Doi, T. T., et al. "A long play digital audio disk system.” Journal of the Audio Engineering Society. December 1979, pp. 975-81. 9. Doi, T. T. and Roger Lagadex. “An outline of the format jointly supported by Sony and Studer for stationary-head digital audio recording.” Journal of the Audio Engineering Society. September 1980, p. 624. 10. Feldman, L. "Digital audio for the 1980's." Radio Electron. June 1980, pp. 63-5, 87. 11. Hidaka, T. "JVC AHD digital audio disk system." Journal of the Audio Engineering Society. January/February 1981, pp. 68, 70. 12. Illinois Audio Catalog. Chicago, Illinois Audio, 1982. 13. “JVC digital audio products.” Computer Music Journal. vol. 5, no. 4, 1981, pp. 89-90. 14. Klein, Larry. “Audio fair in Tokyo (Oct. ‘81).” Stereo Review. February 1982, pp. 56-60. 15. McCracken, J. A. “A high-performance digital audio recorder.” Journal of the Audio Engineering Society. July/August 1981, p. 560. 16. Meyer, E. Brad. "Digital.” Stereo Review. Apri1 1982, pp. 56-9.
  • 50. 44 17. "Minutes of the meeting of the Digital Audio Technical Committee." Journal of the Audio Engineering Society. September 1981, pp. 620-24. 18. "New audio products at Las Vegas CES.” Stereo Review. May 1982, pp. 53-6. 19. PCM75: Designed for Audio. Tucson, Burr-Brown Research Corporation, July 1981. 20. Ranada, David. “Digital audio: A primer.” Stereo Review. February 1981, pp. 63-5. 21. Ranada, David. “Good news: Digital recording standards." Stereo Review. March 1980, p. 20. 22. Ranada, David. "Superdiscs.” Stereo Review. December 1980, pp. 61-4, 67. 23. Sony‘s Digital Audio Technology - QA. Japan, Sony Corporation, nd. 24. Sony's Digital Audio - The Next Step. Japan, Sony Corporation, May 1981. 25. “Sony‘s small, portable digital audio adaptor." Stereo Review. May 1982, p. 13. 26. Stark, Craig. "Equipment test reports: Technics SV-P100 digital cassette recorder." Stereo Review. March 1982, pp .40, 42, 46. 27. Traiman, Steve. “Digital decade: The audio/visual future.” Stereo Review. January 1980, pp. 69-72. 28. Weisser, A. “The digital recording of sound in broadcasting.” SMPTE Journal. July 1980, pp. 520-4. 29. Welland, K. and H. Redlich. “The MD (Mini Disk) system: A contribution to the digital audio disk standard.” Journal of the Audio Engineering Society. July/August 1980, pp. 510-14.