Implementation and Performance Analysis of CPU-GPU Parallel Matrix Multiplication
2010; East China Computer Technology Research Institute; Linguagem: Inglês
ISSN
1000-3428
Autores Tópico(s)Advanced Data Storage Technologies
ResumoThe implementation of the CPU-GPU hybrid DGEMM is carried out on the ATI platform to improve the computing performance by computing both on GPU and CPU.Experimental results show that when matrix size is large,its performance on AMD Phenom II X4 940 and ATI FireStream 9270 platform,compared with using GPU alone,can be improved 16% on average.The evaluation method is verified along with the discussion of the factors which impact the hybrid DGEMM performance.
Referência(s)