CONCEALMENT2.PPT
Download
Report
Transcript CONCEALMENT2.PPT
From Error Control to Error
Concealment
Dr Farokh Marvasti
Multimedia Lab
King’s College London
Outline
Philosophical Issues
Previous Work on Forward Error
Control/Concealment
Error Concealment in the Source
Encoded Coefficient Domain
Error Concealment in the Time
(Speech) or Space (Image & Video)
Domain
Suggestions for UMTS and Future Work
Philosophical Issues
•
•
•
•
The Better the Source Encoder, the Worse
the Error Concealment (EC)
Trade off Between FEC and EC: The More
Powerful the FEC, the Less Frequent the
EC
Measure of Performance: 1-FEC-Prob of
Error or SNR. 2-EC- Subjective Evaluation
FEC is a science and EC is an art!
C Shannon
•
“However, any redundancy in the
source will usually help if it is utilized
at the receiving point. In particular, if
the source has already redundancy and
no attempt is made to eliminate it in
matching to the channel, this
redundancy will help combat noise”
J Hagenauer
•
•
“We claim that whenever some form of
concealment of the decoded source signal is
applicable, which means that some
redundancy is left in the source signal, “The
Source-Controlled Channel Decoding” can
also be applied. It is better to avoid errors
rather than to conceal them.”
“One can use more sophisticated error
concealment by using the channel decoder
soft output…i.e., hard decision and its
Previous Work: Vary, et al
•
•
MAP Estimation: C= Max P(Ci |C
recived
Min-MSE:
i
i
C P C
givenCreceived
CMS
i
hard decision
Ch
) over all i
A Posteriori
soft output:
Reliability
P(Ci|Crec)
Estimator
Cestimate
My Previous Work in FEC/EC
FEC Using GF of Real Numbers
Oversampling and Nonuniform
Sampling Th for Error Correction
Works for Erasure Channels and
Impulsive Channels
Extension to Error Concealment
Brief Explanation and Examples of
Multimedia Signals
Summary of the EC Work at
Lucent
Image & Video EC: Presentation by Two
of my Ph.D... Students on July 23rd at
1:30pm
1- EC for compressed Video (H.263 &
MPEG 4) by Hasan
2- Robust Video Codecs for UMTS by
Nick
EC for FR GSM
Direct FFT Approach does not work too
well for prediction. However, it does
show correlation of coefficients within
frames
The Best Linear Prediction is derived
from Yule Walker equations… (Alexis)
Ad-Hoc Methods: averaging the
previous and the future frame
coefficients works better than simple
substitution
EC for FR GSM
Combining the previous N frame and
the future M Frame Coefficients with a
weight factor works best
Mathcad Examples:
Comparison of Yule-Walker with
Substitution
p 0
p 5
coefficient
33
n 3
coefficient
22
n 2
coefficient
2
n 3
coefficient
2
n 2
coefficient
27
n 1
coefficient
13
n 1
yulewakern 26.977
yulewakern 10.563
p 4
p 15
coefficient
7
n 3
coefficient
7
n 2
coefficient
4
n 3
coefficient
4
n 2
coefficient
7
n 1
coefficient
3
n 1
yulewakern 5.652
yulewakern 3.039
Enhanced GSM, G.729, etc.
It is not always a good idea to do
interpolation/prediction in the source
encoded domain (e.g., the 76 GSM
coefficients).
If previous frames are stored in time
domain or its frequency transform (FFT,
DCT, Wavelet, etc.), extrapolation
seems to be easier to perform. See next
slide.
*This approach does not depend on
Prediction of 18 future samples
from the past 54 samples of a real
female speech signal
17.277689
Y
k
Y1
2.752105 10
20
10
3
0
0
10
1
20
30
40
k
50
60
70
80
72
trace 1
4
1.5 10
n
( e)
4
1 10
j
( e1 )
q
j
5000
0
0
10
20
trace 1
trace 2
30
j
40
50
60
70
80
72
2
n
Problems with FFT
High accuracy of computation is
needed. Otherwise, the prediction
becomes unstable, see next slide
Research on other exponential or other
kernels are under study.
Algorithm for Extrapolation
2
XXj
ttj
m=
exp
j
ii
m
1
q
DD XX
hh
1
s
90
1.05
120
hh
tt
60
120
0.79
150
mag ( s )
ii
30
0.53
180
150
mag ( s )
0
0
210
240
180
0
1 2
300
330
240
Er
j ( q )
trace 1
Er
r
t h2
t
Er
n inn
2
1 1
2
t
E( t
h2
300
270
( q )
1
0
210
trace 1
t
30
0.53
270
lk
60
0.26
j
330
ii
90
1.05
0.79
0.26
lk
CFFT ( XX)
Err
rr
n inn
2
rr ) h2
t
h 2
t
E( rr
1
n inn
2 )
A different Kernel: exp(j)
120
90
1.05
1.05469
60
0.79
150
mag ( s )
ii
30
0.53
0.26
lk
180
0
0
210
1
330
240
300
270
ii ( q )
lk
trace 1
The above kernel was implemented on a DSP.
It is good for burst error correction but not for EC.
Frequency Domain Approach
Take Karhunen Loeve Transform of
Previous Speech Frames and Try to do
Prediction in the Transform Domain.
DCT, FFT, etc are Suboptimal Solutions.
The Advantage is that Both Types of
Correlation (Intra and Inter Frame) are
Taken into Account
Future Work
• Complete the Yule
Walker Prediction
• Develop the Time
Domain Extrapolation
• Study the Frequency
Domain Approach
• Joint Optimization of
Channel Coding,
Source Coding and
EC.
Conclusion: Lessons for UMTS
VAD Should Send A Code for Silence
Frames Immediately. This is Essential
for EC.
Smaller Frame Sizes are Better for EC
at the Expense of Less Compression.
More Frame Delay Should be Accepted
in the Standards for Better EC.
Silence Frames Can be Used for the
Retransmission of Previous Frames if