PARALLELIZATION OF WAVELET FILTERS USING SIMD EXTENSIONS
Abstract
Much work has been done to optimize wavelet transforms for SIMD extensions of modern CPUs. However, these approaches are mostly restricted to the vertical part of 2-D transforms with line-wise organized memory layouts because this leads to a rather straight forward SIMD-implementation. This work shows for an example of a common wavelet filter new approaches to use SIMD operations on 1-D transforms that are able to produce reasonable speedups. As a result, the performance of algorithms that use wavelet transforms, such as JPEG2000, can be increased significantly. Various variants of parallelization are presented and compared. Their advantages and disadvantages for general filters are discussed.
References
- ISO/IEC JPEG committee. JPEG 2000 image coding system — ISO/IEC 15444-1:2000, December 2000 . Google Scholar
- Journal of Computing and Information Technology (Special Issue on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia) 8(1), 31 (2000). Google Scholar
M.-L. Woo , Parallel discrete wavelet transform on the Paragon MIMD machine, Proceedings of the seventh SIAM conference on parallel processing for scientific computing, eds.R. S. Schreiber (1995) pp. 3–8. Google Scholar- Multidimensional Systems and Signal Processing 8(1–2), 185 (1997). Crossref, ISI, Google Scholar
M. M. Pic , H. Essafi and D. Juvin , Wavelet transform on parallel SIMD architectures,SPIE Proceedings 1961,Visual Information Processing II (), eds.F. O. Huck and R. D. Juday (SPIE, 1993) pp. 316–323. Google Scholar- IEEE Transactions on Signal Processing 3(43), 759 (1995). ISI, Google Scholar
M. Feil and A. Uhl , Wavelet packet decomposition and best basis selection on massively parallel SIMD arrays, Proceedings of the International Conference "Wavelets and Multiscale Methods" (IWC'98) (INRIA, Rocquencourt, 1998) p. 4. Google Scholar-
C. Tenllado , Vectorization of the 2D wavelet lifting transform using SIMD extensions ,Workshop on Parallel and Distributed Image Processing, Video Processing, and Multimedia, PDIVM '03 ( 2003 ) . Google Scholar -
D. Chaver , 2-D wavelet transform enhancement on general-purpose microprocessors: Memory hierarchy and SIMD parallelism exploitation , Proceedings of the 2000 International Conference on High Performance Computing . Google Scholar - IEEE Transactions on Image Processing 9(3), 378 (2000). Crossref, ISI, Google Scholar
- Journal of Fourier Analysis Applications 4(3), 245 (1998). ISI, Google Scholar
- Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation" 93(2), 232 (2005). Google Scholar


