DENSE MATRIX-VECTOR MULTIPLICATION ON THE CUDA ARCHITECTURE
Abstract
Recently GPUs have acquired the ability to perform fast general purpose computation by running thousands of threads concurrently. This paper presents a new algorithm for dense matrix-vector multiplication on the NVIDIA CUDA architecture. The experiments are conducted on a PC with GeForce 8800GTX and 2.0 GHz Intel Xeon E5335 CPU. The results show that the proposed algorithm runs a maximum of 11.19 times faster than NVIDIA's BLAS library CUBLAS 1.1 on the GPU and 35.15 times faster than the Intel Math Kernel Library 9.1 on a single core x86 with SSE3 SIMD instructions. The performance of Jacobi's iterative method for solving linear equations, which includes the data transfer time between CPU and GPU, shows that the proposed algorithm is practical for real applications.
References
- Computer Graphics Forum 26, 80 (2007), DOI: 10.1111/j.1467-8659.2007.01012.x. Crossref, ISI, Google Scholar
- ACM Queue 6, 40 (2008), DOI: 10.1145/1365490.1365500. Crossref, Google Scholar
S. Sengupta , Scan Primitives for GPU Computing, Proc. Graphics Hardware 2007 (2007) pp. 97–106. Google Scholar- ACM Transactions on Graphics 23, 777 (2004), DOI: 10.1145/1015706.1015800. Crossref, ISI, Google Scholar
- ACM Transactions on Graphics 22, 908 (2003). Crossref, ISI, Google Scholar
D. Tarditi , S. Puri and J. Oglesby , Accelerator: Using Data Parallelism to Program GPUs for General-Purpose Uses, Proc. 12th International Conference on Architectural Support for Programming Languages and Operating Systems (2006) pp. 325–335. Google Scholar- NVIDIA, CUDA Programming Guide 1.1, http://www.nvidia.com/object/cuda_develop.html, Nov. 2007 . Google Scholar
- NVIDIA, CUDA CUBLAS Library 1.1, http://www.nvidia.com/object/cuda_develop.html, Sept. 2007 . Google Scholar
- NVIDIA, Technical Brief: NVIDIA GeForce 8800 GPU Architecture Overview, http://www.nvidia.com/page/8800_tech_briefs.html, Nov. 2006 . Google Scholar
- IEEE Micro 28, 39 (2008), DOI: 10.1109/MM.2008.31. Crossref, ISI, Google Scholar
-
G. Strang , Linear Algebra and Its Applications , 3rd edn. ( Harcourt Inc , 1988 ) . Google Scholar


