View our poster presentation! - clear

Download Report

Transcript View our poster presentation! - clear

AUDIOFILES
Harika Basana ([email protected] ), Elizabeth Chan ([email protected] ), Nikolai Sinkov([email protected] ), Frank Zhang ([email protected] ) 6100 Main Street, Rice University, Houston, Texas 77005
GOAL
To explore the MP3 technology and
to implement various audio data compression
algorithms.
Analyze This
 Audio compression is to compress an audio
Ding.wav before compression
Ding.wav with frequencies within 1 std
from the mean
Original signal sampled at 44100Hz
After linear quantization
The x-axis DT sample and the y-axis is the amplitude
file into a smaller-sized file.
 People cannot differentiate between these
two files by just hearing.
 Due to its smaller size, the new file can be easily
transferred via the Internet.
 People try to find better audio compression
algorithms that retain satisfying audio quality.
Ding.wav with frequencies within 2 std
from the mean
Ding.wav with frequencies within 3 std
from the mean
Psycho Acoustic Algorithm
Algorithms
Linear, tangent or arctangent quantization of the
signal.
Average Energy Algorithm
Zeroes out selected high and low frequencies of
Procedure
 Perform the Discrete Cosine Transform (DCT)
the audio file.
After tangent quantization
After arctangent quantization
Masking Algorithm
The presence of a signal at a particular frequency can
raise the perceptual threshold of signals close to the
the masking frequency.
Procedure
 Go through every sample and remove the following
samples if they are below a certain threshold.
 Quantize the signal in one of the following ways :
Procedure
 Perform the Discrete Cosine Transform (DCT).
Diagram of the quantization “buckets” for the three
methods
Results
 No significant improvement. Need a better way of
implementing to get good results.
 Calculate the signal’s energy.
 Find the mean and the standard deviation of
from the energy spectrum.
Conclusion
 Keep all frequencies with energies within 1
standard deviation (std) from the mean.
 Used the underlying concepts.
 Zero out frequencies with energies outside this range.
 Similarly, keep frequencies with energies within 2
and 3 stds from the mean.
 We didn’t create MP3 files.
 Produced much smaller files.
 Give certain frequency bands more bits
(1000 – 5100 Hz and 12500 - 15200Hz).
 Throw away frequencies below 20Hz and above
20,000Hz.
 Perform the Inverse DCT and get the output.
 Perform the Inverse DCT.
Results
 Amount of compression is insignificant.
 Algorithm would probably work better if the signal
is very short, has monotonous tones, and has little
noise.
Results
 Compression is very significant.
 Quality is good for the amount of compression.
 Arctangent quantization yields the best quality.
 Psycho Acoustic Algorithm is the best, in terms of
- amount of compression
- sound quality of the output.
Improvements
 Implement windowing
 Implement temporal masking
Bibliography:
http://www.sospubs.co.uk/sos/may00/articles/mp3.htm
http://www.besar.dcs.gla.ac.uk/labs/audiolab/real_site/
tutorials/mp3/mp3how.php
and more…