Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU)
2012; Morgan & Claypool Publishers; Volume: 7; Issue: 2 Linguagem: Inglês
10.2200/s00451ed1v01y201209cac020
ISSN1935-3243
AutoresHyesoon Kim, Richard Vuduc, Sara S. Baghsorkhi, Jee Choi, Wen‐mei Hwu,
Tópico(s)Interconnection Networks and Systems
ResumoGeneral-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories (less than 1 megabyte vs. 1-10 megabytes).
Referência(s)