Transcript Slide 1
Image Compression with a Geometrical Entropy Coder Onur G. Guleryuz and Arthur L. da Cunha DoCoMo USA Labs Palo Alto, CA University of Illinois at Urbana-Champaign, Champaign, IL 1 Overview In a nutshell: This work is about compressing images with geometrical singularities (~ edges along curves). We utilize a known suboptimal transform (the wavelet transform) and try to improve its performance on such images by adding singularity intelligence (~directional prediction). Summary of transform compression. Optimality in 1-D vs. in 2-D. Illustration of the problems for wavelet transforms in 2-D. Our approach with examples. Algorithm outline. Results Conclusion 2 Transform Compression of Signals Type of signal: Stationary, Gaussian signals Optimal Transform Compression: Karhunen-Loeve transform + “classical” coefficient coder [Gersho&Gray] 1-D Piecewise smooth with point singularities at random locations Wavelet transform 2-D Piecewise smooth with singularities along “geometric” curves Curvelet/Contourlet/... transform Classical Coefficient Coder Syntax: Send number of “significant” coefficients Send each significant coefficient + “modern” coefficient coder [Falzon&Mallat, Donoho-Vetterli-DeVore-Daubechies, CohenDaubechies-Guleryuz-Orchard] + modern coefficient coder [Candes&Donoho, Do&Vetterli, Peyre&Mallat, Cohen, Simoncelli and others] Modern Coefficient Coder Syntax: Send number of significant coefficients Send which coefficients are significant. Send each significant coefficient [JPEG,SPIHT,JPEG2000,...] 3 Practical Algorithms in TwoDimensions There are many practical difficulties that newly designed transforms must surmount. New, compression targeted image-adaptive representations and transforms try to overcome these issues Wavelet footprints, Bandelettes, Directionlets, h.264/AVC INTRA mode, directional lifting based approaches … Simple Observation: While wavelet based algorithms are not good over geometric singularities, they are very good away from geometric singularities (it is difficult to beat wavelet coders on images with few singularities). Our approach: Take something that works well (wavelets) and add singularity intelligence to it (directional prediction using geometric flow). Try to get the best of both worlds. 4 Why? Practical issues that motivate this work: General design steps for new transform compression algorithms: • Design new representation • Design associated modern coefficient coder • Test against the state-of-the-art (SPIHT, JPEG2000, ...) Our approach: Keep old representation but add geometric prediction Optimally (R-D) incorporate inside state-of-the-art, modern coefficient coder (SPIHT in our case) Always do equal or better than the state-of-the art Hard Hard Hard Easy Conceptual issues that motivate this work: Can we exploit regularity in wavelet domain using simple, first principal approaches rather than using sophisticated mathematical constructs (such as approaches based on complex wavelets, Alpert wavelets, etc.) ? 5 Simple Illustration of the Problem Faced by the Wavelet Transform TOO MANY SIGNIFICANT COEFFICIENTS! 2-D Discrete Wavelet Transform (DWT) LHLH HLHL HHHH LH Band Legend for wavelet transform image: gray : zero valued coefficient, light : large positive value, dark : large negative value HL Band HH Band 6 Our Solution: New Geometrical Representation Image Wavelet Transform Wavelet Coefficients Image adaptive geometrical transform New Coefficients Image adaptive “Flow” New Syntax: Send geometric flow. Send number of “significant” coefficients Send which coefficients are significant. Send each coefficient. (i.e., send flow then use a modern coefficient coder to code new coefficients.) 7 What is a “Flow” The flow field is a field along which a function is regular (~ two points connected by a flow do not have a singularity between them along the flow.) Image Image with superimposed flow ... 8 Algorithm Outline & Properties • Calculate wavelet transform coefficients. • Compute a set of helper variables to use in flow based prediction. • Compute optimal flow (each wavelet band has its own flow). • Encode flow. • Generate a new set of coefficients conditioned on the flow (do directional prediction in wavelet domain based on the flow and generate prediction errors). • Encode the new coefficients with a modern coefficient coder (SPIHT). 9 Example • Our algorithm amounts to doing directional prediction in wavelet domain. Computed flow on LH band ... LHLH LH Band New LHLH New LH Band Too easy! 10 Directional prediction in wavelet domain is hard! 11 UDWT as Helper in Prediction • We causally compute undecimated wavelet transform coefficients and use them as helper variables in prediction. DWT (NxN) (Flow direction is unclear) UDWT (2Nx2N) (Flow direction is clear) 12 Harder Example LH Band Computed flow on LH Band (superimposed on the image) New LH Band (~4dB better in R-D) 13 The picture faced by our predictor We generate UDWT coefficients, i.e., auxiliary variables (128x128) (256x256) Difficult prediction Easier prediction 14 Views of Data to be Predicted (Profile depends on wavelet filter and singularity structure) We optimize over a discretized set of directions using simple interpolation. We use AR prediction. 15 Example Flows Computed on LH Band 16 Algorithm Details Separate flow for each band. Flow on a quad-tree. Optimized using bottomup dynamic programming. Causal UDWT computation (this is detailed, please look a the paper if interested): Two buffers, spatial domain buffer of current image approximation UDWT buffer Encode next DWT coefficient. Update spatial buffer. Update UDWT buffer. (Decoder does the same). For each DWT coefficient: Form a 1D causal sequence of coefficient values along the flow (UDWT). Use these to predict DWT coefficient 17 Two Coders “Geometrical Entropy Coder” Image DPCM Image Wavelet Transform Wavelet Coefficients Wavelet Transform Wavelet Coefficients Quantization Quantization + Flow based prediction Quantized Coefficients (Invertible) New Coefficients Flow based prediction New Coefficients (The utilized flow is the one that is rate optimized for the geometric entropy coder.) 18 Some Rate-Distortion Results (Computer generated image) Better SPIHT mode (better by ~ 1+ dB) First order entropy (better by ~ 2+ dB) 19 Some Rate-Distortion Results SPIHT mode (better by ~ 1 dB) First order entropy (better by ~ 220dB) Some Rate-Distortion Results SPIHT mode (better by ~ 0.6 dB) First order entropy (better by ~ 1.521dB) Conclusion • Flow bits < %5 (we need to encode the flow better to be more competitive). • Current state of our work: • Currently “up to” ~ %40 rate improvements on toy images and ~ %10 rate improvements on some natural images. • Lena, boat, etc., ~ small improvements • Better prediction and interpolation will improve our results. • Issues with bit-plane coders (255=128). • Our technique keeps working at low rates (the results in the paper stop predicting after a certain DWT level) . • Advantages of our work: • We do directional prediction along lines but we can also do so along curves easily. • Our work can be used to design sophisticated context based arithmetic coders. 22 [1] E. J. Cand`es and D. L. Donoho, “New tight frames of curvelets and optimal representations of objects with piecewise C2 singularities,” Comm. Pure and Appl. Math, vol. 57, no. 2, pp. 219–266, February 2004. [2] E. Le Pennec and S. Mallat, “Sparse geometric image representation with bandelets,” IEEE Trans. Image Proc., vol. 14, no. 4, pp. 423–438, April 2005. [3] M. B. Wakin, J. K. Romberg, H. Choi, and R. G. Baraniuk, “Wavelet-domain approximation and compression of piecewise smooth images,” in IEEE Trans. Image Proc., to appear, 2005. [4] R. Shukla, P. L. Dragotti, M. N. Do, and M. Vetterli, “Rate-distortion optimized tree structured compression algorithms for piecewise smooth images,” IEEE Transactions on Image Processing, vol. 14, pp. 342–359, 2005. [5] A. Said and W.A. Pearlman, “A new fast and efficient image codec based on set partitioning in hierachical trees,” IEEE Trans. on Circuits and Systems for Video technology, vol. 6, no. 3, pp. 243-250, June 1996. … (please contact me if you would like more references) 23