SlideShare ist ein Scribd-Unternehmen logo
1 von 48
Sound: Audio and Music




B.Sc. (Hons) Multimedia Computing      Media Technologies
Agenda

         History and overview of sound systems
         Digitised Sound
         Sound Formats
         Sound for Multimedia
         Software: Sound Editors




B.Sc. (Hons) Multimedia Computing                 Media Technologies
Edison Cylinder record
Earliest recorded sounds
   Thomas Edison’s      Au Clair de la
    voice (1890’s)        Lune
                         Vocal Scale
                      (Made by Scott, 1860)
Kymograph recording
How does it work?




A diagonal movement provides either the LEFT or the RIGHT channel.

A left-right movement provides the SUM of the L and R channels.

An up-down movement provides the DIFFERENCE between the L and R channels.
Compact Disc
What’s in a CD?

         A. A polycarbonate disc layer has the data
            encoded by using ‘pits’.

         C. A reflective layer reflects the laser back.

         E. A lacquer layer is used to prevent oxidation

         G. Artwork is screen printed on the top of the
            disc.

         I. A laser beam reads the polycarbonate disc, is
            reflected back, and read by the player.
Encoding format on an audio CD (1)                  (Courtesy Wikipedia)



   The smallest entity in a CD is called a frame.
   A frame consists of 33 bytes and contains six complete 16-bit
    stereo samples (2 bytes × 2 channels × six samples: equals 24
    bytes). The other nine bytes consist of eight Cross-Interleaved
    Reed-Solomon Coding (CIRC) error correction bytes and one
    subcode byte, used for control and display.
   Each byte is translated into a 14-bit word using Eight-to-
    Fourteen Modulation, which alternates with 3-bit merging
    words. In total there are 33 × (14 + 3) = 561 bits.
   A 27-bit unique synchronization word is added, so that the
    number of bits in a frame totals 588 (of which only 192 bits are
    music).
Encoding format on an audio CD (2)                  (Courtesy Wikipedia)




   These 588-bit frames are in turn grouped into sectors.
   Each sector contains 98 frames, totaling 98 × 24 = 2352 bytes
    of music.
   The CD is played at a speed of 75 sectors per second, which
    results in 176,400 bytes per second.
   Divided by 2 channels and 2 bytes per sample, this results in a
    sample rate of 44,100 samples per second.
Announced January 27th 2010…
Audio vs. Music

          Computer sounds can be digital (e.g. .mp3) or
           synthesised (e.g. MIDI)
          Digital sound is referred to as Audio
          Synthesised sound is referred to as Music
          Digital sounds are recordings of real sounds
          Synthesized sounds are programmed
           reproductions of sounds based on algorithms
           and hardware tone generators.


B.Sc. (Hons) Multimedia Computing                          Media Technologies
•Digitized Sound
          Digital sound involves sampling, which means
           the encoding of data in the form of ones and
           zeros.
          The computer converts an analogue data
           source to a digital data stream with an analog-
           to-digital converter.
          These converters are devices that exist on the
           computers sound cards/modules, and are
           controlled by the software you use to process
           the sound, SoundForge on the PC, Sound Edit
           on the MAC, for example.

B.Sc. (Hons) Multimedia Computing                        Media Technologies
What is the optimum sample rate for an
audio / video channel?

The sample rate for the transmission of data through a
noisy channel of restricted bandwidth was established by
Shannon and Weaver in 1948…

                                            Signal of amplitude 21



                                            Plus noise of amplitude 3



                                            Results in a noisy
                                            signal of amplitude 24.
How fast must we sample to accurately
represent the signal?
Sample rate and resolution?
   There is no point in sampling      There is no point in sampling
    at a frequency higher than          at a resolution greater than
    twice the bandwidth, that is,       the maximum error – in this
    at 2 * W samples per                case, the error is the noise,
    second. where W is the              which is 3 units in 24 total
    bandwidth of the channel.           units.
   Bandwidth could be thought         This gives us a resolution
    of as the frequency response        (R)of 3 in 24 == 1 in 8 == 3
    of the channel.                     bits.
                                       So R=3=log2 Signal / Noise
                                        (log2 of 8 is 3)
Nearly there…
   Let’s call the sample rate R
    (Bits / Sec)…
   Since there are 2 * W
    samples per second,
   R = 2 * W * log2 (S+N)/N
   However, it’s more useful to
    measure power rather than
    amplitude (cables restrict
    power, not amplitude), and
   Power is proportional to the
    square of the amplitude,
    so…     R = sqrt(2 * W * log2 (S+N)/N)
We’re there…



  R = sqrt(2 * W * log2 (S+N)/N)
An example…
   Let’s say we want to transmit human speech at a
    frequency range of 5Khz and a distortion (error) rate of
    0.1% (i.e. 1 in 1000)
   R = 2 * 5,000 * log2 (1000+1)/1 = 50,000 bits/sec
   If we can tolerate an error of 4% (1000:4), then:
   R = 40,000 bits / sec.
   So, a small increase in error (distortion) rate has
    allowed a 20% reduction in the bandwidth of the
    channel (hence cost).
Synthesized Sound and MIDI

         MIDI = Musical Instrument Digital Interface
         Synthesized sound isn't digitally recorded; its a
          mathematical reproduction of a sound based
          on a description. Synthesizers use hardware
          and algorithms to generate sounds on-the-fly
          from a description of the desired sound.




B.Sc. (Hons) Multimedia Computing                             Media Technologies
Sound Formats
         Audio
            .WAV (Developed by IBM and Microsoft):

             Uncompressed digital samples.
            .AU

            .AIFF (Audio Interchange File Format)

            .MP3 (MPEG)

            .SND (Mac)

         Music
            .MID (Musical Instrument Digital Interface)

            and other proprietary formats

B.Sc. (Hons) Multimedia Computing                          Media Technologies
.mp3         (courtesy Wikipedia)




   The use in MP3 of a lossy compression algorithm is
    designed to greatly reduce the amount of data required
    to represent the audio recording and still sound like a
    faithful reproduction of the original uncompressed
    audio for most listeners.
   An MP3 file that is created using the setting of 128
    kbits/s will result in a file that is about 1/11th the size of
    the CD file created from the original audio source. An
    MP3 file can also be constructed at higher or lower bit
    rates, with higher or lower resulting quality.
.mp3 (contd.)
   The compression works by reducing the accuracy of
    certain parts of sound that are deemed beyond the
    auditory resolution ability of most people.
   This method is commonly referred to as perceptual
    coding. It internally provides a representation of sound
    within a short-term time/frequency analysis window, by
    using psychoacoustic models to discard or reduce the
    precision of components less audible to human hearing,
    and recording the remaining information in an efficient
    manner (think about a handclap in a quiet room as
    oppose to in a noisy street –it’s masked by the noise).
Normal hearing thresholds…
Masked hearing thresholds…
Finally…
   Filters are used to split a signal into 32 bands, and a
    masking level for each band is computed.
   Signals that fall below the threshold can be
    discarded.
   Further compression is done, including resampling
    and lowering the bit rate of parts that don’t need full
    precision.
.mp3 file format
Sound for Multimedia
     Generate sounds for multimedia scenarios
     including:
      Sound effects

      Ambient sound for mood and effect

      Virtual Juke Box

      Voice-overs




B.Sc. (Hons) Multimedia Computing               Media Technologies
Software: Sound Editors
            Sound Editors - edit waveforms of
             digitised sounds
                  Trim
                  Apply effects - e.g. reverb, echo,
                   equalisation (EQ)
                  Insert silence
                  Fade-in, fade-out
                  Combine sounds
                  Adjust / Normalise volume (gain)

B.Sc. (Hons) Multimedia Computing                       Media Technologies
Software: Sound Sequencers
              Synthesized sounds played back via pre-
               recorded or live MIDI note event data -
               velocity, pitch, etc.
              Sound cards may also have prerecorded digital
               samples stored on them.
              These 'samples' can then be played by
               sequencing software that will play a number of
               tracks simultaneously in an ensemble.
              Software such as Cakewalk, Steinberg's
               Cubase, and Logic Pro.


B.Sc. (Hons) Multimedia Computing                        Media Technologies
B.Sc. (Hons) Multimedia Computing   Media Technologies
B.Sc. (Hons) Multimedia Computing   Media Technologies
Hardware: Creative X-Fi
•24-bit Analog-to-Digital conversion of analog inputs at 96kHz sample rate

•24-bit Digital-to-Analog conversion of digital sources at 96kHz to analog 7.1
speaker output

•24-bit Digital-to-Analog conversion of stereo digital sources at 192kHz to stereo
output

•ASIO (Audio Stream Input / Output) 2.0 support at 16-bit/44.1kHz, 16-bit/48kHz,
24-bit/44.1kHz 24-bit/48kHz and 24-bit/96kHz with direct monitoring

•16-bit to 24-bit recording sampling rates: 8, 11.025, 16, 22.05, 24, 32, 44.1, 48
and 96kHz

•Digital SPDIF interface support with 24bit/96kHz quality format

•Signal-to-Noise ratio: 109dB

•Total Harmonic Distortion + Noise at 1kHz = 0.004%
X-Fi board
X-Fi control panel
X-Fi software control panel
Please read…
   Chapman & Chapman Chapter 8 (in
    your bundle if you have it)

Weitere ähnliche Inhalte

Was ist angesagt?

Digital Audio & Signal Processing (Elad Gariany)
Digital Audio & Signal Processing (Elad Gariany)Digital Audio & Signal Processing (Elad Gariany)
Digital Audio & Signal Processing (Elad Gariany)
Ron Reiter
 
Chapter 02 audio recording - part i
Chapter 02   audio recording - part iChapter 02   audio recording - part i
Chapter 02 audio recording - part i
Nazihah Ahwan
 
Audio compression
Audio compressionAudio compression
Audio compression
Sahil Garg
 
multimedia chapter1
multimedia chapter1multimedia chapter1
multimedia chapter1
nes
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.ppt
Videoguy
 
Digital technology
Digital technologyDigital technology
Digital technology
Paula Mills
 
Digital Audio
Digital  AudioDigital  Audio
Digital Audio
surprisem
 

Was ist angesagt? (20)

Audio compression
Audio compressionAudio compression
Audio compression
 
Digital Audio & Signal Processing (Elad Gariany)
Digital Audio & Signal Processing (Elad Gariany)Digital Audio & Signal Processing (Elad Gariany)
Digital Audio & Signal Processing (Elad Gariany)
 
Chap65
Chap65Chap65
Chap65
 
MPEG/Audio Compression
MPEG/Audio CompressionMPEG/Audio Compression
MPEG/Audio Compression
 
3 Digital Audio
3 Digital Audio3 Digital Audio
3 Digital Audio
 
Digaudio
DigaudioDigaudio
Digaudio
 
Audio compression
Audio compression Audio compression
Audio compression
 
Audio compression 1
Audio compression 1Audio compression 1
Audio compression 1
 
Chapter 02 audio recording - part i
Chapter 02   audio recording - part iChapter 02   audio recording - part i
Chapter 02 audio recording - part i
 
Audio compression
Audio compressionAudio compression
Audio compression
 
Sampling rate bit depth
Sampling rate bit depthSampling rate bit depth
Sampling rate bit depth
 
Multimedia seminar ppt
Multimedia seminar pptMultimedia seminar ppt
Multimedia seminar ppt
 
Multimedia elements
Multimedia elementsMultimedia elements
Multimedia elements
 
Sample rate
Sample rateSample rate
Sample rate
 
Audio and Video Compression
Audio and Video CompressionAudio and Video Compression
Audio and Video Compression
 
multimedia chapter1
multimedia chapter1multimedia chapter1
multimedia chapter1
 
Ch05
Ch05Ch05
Ch05
 
Digitization of Audio.ppt
Digitization of Audio.pptDigitization of Audio.ppt
Digitization of Audio.ppt
 
Digital technology
Digital technologyDigital technology
Digital technology
 
Digital Audio
Digital  AudioDigital  Audio
Digital Audio
 

Ähnlich wie Soundpres

Ig2 task 1 no2
Ig2 task 1 no2Ig2 task 1 no2
Ig2 task 1 no2
Kenyon101
 
Analog to digital conversion
Analog to digital conversionAnalog to digital conversion
Analog to digital conversion
Firman Bachtiar
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
JoshCollege
 

Ähnlich wie Soundpres (20)

Sound.pptx
Sound.pptxSound.pptx
Sound.pptx
 
M1L1-2.ppt
M1L1-2.pptM1L1-2.ppt
M1L1-2.ppt
 
Sound recording glossary preivious
Sound recording glossary preiviousSound recording glossary preivious
Sound recording glossary preivious
 
Digital Audio in Multimedia
Digital Audio in MultimediaDigital Audio in Multimedia
Digital Audio in Multimedia
 
Multimedia and-system-design-sound-images by zubair yaseen& yameen shakir
Multimedia and-system-design-sound-images by zubair yaseen& yameen shakirMultimedia and-system-design-sound-images by zubair yaseen& yameen shakir
Multimedia and-system-design-sound-images by zubair yaseen& yameen shakir
 
MultiMedia-Lecture-4.pptx
MultiMedia-Lecture-4.pptxMultiMedia-Lecture-4.pptx
MultiMedia-Lecture-4.pptx
 
Optical recording and reproduction
Optical recording and reproductionOptical recording and reproduction
Optical recording and reproduction
 
Sound recording glossary
Sound recording glossarySound recording glossary
Sound recording glossary
 
Digital audio
Digital audioDigital audio
Digital audio
 
Ig2 task 1
Ig2 task 1Ig2 task 1
Ig2 task 1
 
add9.5.ppt
add9.5.pptadd9.5.ppt
add9.5.ppt
 
Audio-1
Audio-1Audio-1
Audio-1
 
Audio
AudioAudio
Audio
 
Ig2 task 1 no2
Ig2 task 1 no2Ig2 task 1 no2
Ig2 task 1 no2
 
Analog to digital conversion
Analog to digital conversionAnalog to digital conversion
Analog to digital conversion
 
Sound recording glossary
Sound recording glossarySound recording glossary
Sound recording glossary
 
Compression
CompressionCompression
Compression
 
AUDIO VOICE RECORDER AND TRANSMISSION THROUGH A WIRELESS MEDIUM BETWEEN PC AN...
AUDIO VOICE RECORDER AND TRANSMISSION THROUGH A WIRELESS MEDIUM BETWEEN PC AN...AUDIO VOICE RECORDER AND TRANSMISSION THROUGH A WIRELESS MEDIUM BETWEEN PC AN...
AUDIO VOICE RECORDER AND TRANSMISSION THROUGH A WIRELESS MEDIUM BETWEEN PC AN...
 
Ig2 task 1 work sheet
Ig2 task 1 work sheetIg2 task 1 work sheet
Ig2 task 1 work sheet
 
Audio Mastering
Audio MasteringAudio Mastering
Audio Mastering
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Soundpres

  • 1. Sound: Audio and Music B.Sc. (Hons) Multimedia Computing Media Technologies
  • 2. Agenda  History and overview of sound systems  Digitised Sound  Sound Formats  Sound for Multimedia  Software: Sound Editors B.Sc. (Hons) Multimedia Computing Media Technologies
  • 4. Earliest recorded sounds  Thomas Edison’s  Au Clair de la voice (1890’s) Lune  Vocal Scale (Made by Scott, 1860)
  • 6.
  • 7.
  • 8.
  • 9.
  • 10. How does it work? A diagonal movement provides either the LEFT or the RIGHT channel. A left-right movement provides the SUM of the L and R channels. An up-down movement provides the DIFFERENCE between the L and R channels.
  • 11.
  • 12.
  • 13.
  • 14.
  • 16. What’s in a CD? A. A polycarbonate disc layer has the data encoded by using ‘pits’. C. A reflective layer reflects the laser back. E. A lacquer layer is used to prevent oxidation G. Artwork is screen printed on the top of the disc. I. A laser beam reads the polycarbonate disc, is reflected back, and read by the player.
  • 17. Encoding format on an audio CD (1) (Courtesy Wikipedia)  The smallest entity in a CD is called a frame.  A frame consists of 33 bytes and contains six complete 16-bit stereo samples (2 bytes × 2 channels × six samples: equals 24 bytes). The other nine bytes consist of eight Cross-Interleaved Reed-Solomon Coding (CIRC) error correction bytes and one subcode byte, used for control and display.  Each byte is translated into a 14-bit word using Eight-to- Fourteen Modulation, which alternates with 3-bit merging words. In total there are 33 × (14 + 3) = 561 bits.  A 27-bit unique synchronization word is added, so that the number of bits in a frame totals 588 (of which only 192 bits are music).
  • 18. Encoding format on an audio CD (2) (Courtesy Wikipedia)  These 588-bit frames are in turn grouped into sectors.  Each sector contains 98 frames, totaling 98 × 24 = 2352 bytes of music.  The CD is played at a speed of 75 sectors per second, which results in 176,400 bytes per second.  Divided by 2 channels and 2 bytes per sample, this results in a sample rate of 44,100 samples per second.
  • 19.
  • 20.
  • 21.
  • 23. Audio vs. Music  Computer sounds can be digital (e.g. .mp3) or synthesised (e.g. MIDI)  Digital sound is referred to as Audio  Synthesised sound is referred to as Music  Digital sounds are recordings of real sounds  Synthesized sounds are programmed reproductions of sounds based on algorithms and hardware tone generators. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 24. •Digitized Sound  Digital sound involves sampling, which means the encoding of data in the form of ones and zeros.  The computer converts an analogue data source to a digital data stream with an analog- to-digital converter.  These converters are devices that exist on the computers sound cards/modules, and are controlled by the software you use to process the sound, SoundForge on the PC, Sound Edit on the MAC, for example. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 25. What is the optimum sample rate for an audio / video channel? The sample rate for the transmission of data through a noisy channel of restricted bandwidth was established by Shannon and Weaver in 1948… Signal of amplitude 21 Plus noise of amplitude 3 Results in a noisy signal of amplitude 24.
  • 26. How fast must we sample to accurately represent the signal?
  • 27. Sample rate and resolution?  There is no point in sampling  There is no point in sampling at a frequency higher than at a resolution greater than twice the bandwidth, that is, the maximum error – in this at 2 * W samples per case, the error is the noise, second. where W is the which is 3 units in 24 total bandwidth of the channel. units.  Bandwidth could be thought  This gives us a resolution of as the frequency response (R)of 3 in 24 == 1 in 8 == 3 of the channel. bits.  So R=3=log2 Signal / Noise (log2 of 8 is 3)
  • 28. Nearly there…  Let’s call the sample rate R (Bits / Sec)…  Since there are 2 * W samples per second,  R = 2 * W * log2 (S+N)/N  However, it’s more useful to measure power rather than amplitude (cables restrict power, not amplitude), and  Power is proportional to the square of the amplitude, so… R = sqrt(2 * W * log2 (S+N)/N)
  • 29. We’re there… R = sqrt(2 * W * log2 (S+N)/N)
  • 30. An example…  Let’s say we want to transmit human speech at a frequency range of 5Khz and a distortion (error) rate of 0.1% (i.e. 1 in 1000)  R = 2 * 5,000 * log2 (1000+1)/1 = 50,000 bits/sec  If we can tolerate an error of 4% (1000:4), then:  R = 40,000 bits / sec.  So, a small increase in error (distortion) rate has allowed a 20% reduction in the bandwidth of the channel (hence cost).
  • 31. Synthesized Sound and MIDI  MIDI = Musical Instrument Digital Interface  Synthesized sound isn't digitally recorded; its a mathematical reproduction of a sound based on a description. Synthesizers use hardware and algorithms to generate sounds on-the-fly from a description of the desired sound. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 32. Sound Formats  Audio  .WAV (Developed by IBM and Microsoft): Uncompressed digital samples.  .AU  .AIFF (Audio Interchange File Format)  .MP3 (MPEG)  .SND (Mac)  Music  .MID (Musical Instrument Digital Interface)  and other proprietary formats B.Sc. (Hons) Multimedia Computing Media Technologies
  • 33. .mp3 (courtesy Wikipedia)  The use in MP3 of a lossy compression algorithm is designed to greatly reduce the amount of data required to represent the audio recording and still sound like a faithful reproduction of the original uncompressed audio for most listeners.  An MP3 file that is created using the setting of 128 kbits/s will result in a file that is about 1/11th the size of the CD file created from the original audio source. An MP3 file can also be constructed at higher or lower bit rates, with higher or lower resulting quality.
  • 34. .mp3 (contd.)  The compression works by reducing the accuracy of certain parts of sound that are deemed beyond the auditory resolution ability of most people.  This method is commonly referred to as perceptual coding. It internally provides a representation of sound within a short-term time/frequency analysis window, by using psychoacoustic models to discard or reduce the precision of components less audible to human hearing, and recording the remaining information in an efficient manner (think about a handclap in a quiet room as oppose to in a noisy street –it’s masked by the noise).
  • 37. Finally…  Filters are used to split a signal into 32 bands, and a masking level for each band is computed.  Signals that fall below the threshold can be discarded.  Further compression is done, including resampling and lowering the bit rate of parts that don’t need full precision.
  • 39. Sound for Multimedia Generate sounds for multimedia scenarios including:  Sound effects  Ambient sound for mood and effect  Virtual Juke Box  Voice-overs B.Sc. (Hons) Multimedia Computing Media Technologies
  • 40. Software: Sound Editors  Sound Editors - edit waveforms of digitised sounds  Trim  Apply effects - e.g. reverb, echo, equalisation (EQ)  Insert silence  Fade-in, fade-out  Combine sounds  Adjust / Normalise volume (gain) B.Sc. (Hons) Multimedia Computing Media Technologies
  • 41. Software: Sound Sequencers  Synthesized sounds played back via pre- recorded or live MIDI note event data - velocity, pitch, etc.  Sound cards may also have prerecorded digital samples stored on them.  These 'samples' can then be played by sequencing software that will play a number of tracks simultaneously in an ensemble.  Software such as Cakewalk, Steinberg's Cubase, and Logic Pro. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 42. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 43. B.Sc. (Hons) Multimedia Computing Media Technologies
  • 44. Hardware: Creative X-Fi •24-bit Analog-to-Digital conversion of analog inputs at 96kHz sample rate •24-bit Digital-to-Analog conversion of digital sources at 96kHz to analog 7.1 speaker output •24-bit Digital-to-Analog conversion of stereo digital sources at 192kHz to stereo output •ASIO (Audio Stream Input / Output) 2.0 support at 16-bit/44.1kHz, 16-bit/48kHz, 24-bit/44.1kHz 24-bit/48kHz and 24-bit/96kHz with direct monitoring •16-bit to 24-bit recording sampling rates: 8, 11.025, 16, 22.05, 24, 32, 44.1, 48 and 96kHz •Digital SPDIF interface support with 24bit/96kHz quality format •Signal-to-Noise ratio: 109dB •Total Harmonic Distortion + Noise at 1kHz = 0.004%
  • 48. Please read…  Chapman & Chapman Chapter 8 (in your bundle if you have it)