Accelerating kernel density estimation on the GPU using the CUDA framework

Artigo

Accelerating kernel density estimation on the GPU using the CUDA framework

2013; Volume: 7; Linguagem: Inglês

10.12988/ams.2013.13133

ISSN

1314-7552

Autores

Panagiotis D. Michailidis, Konstantinos G. Margaritis,

Tópico(s)

Advanced Data Compression Techniques

Resumo

The main problem of the kernel density estimation methods is the huge computational requirements, especially for large data sets. One way for accelerating these methods is to use the parallel processing. Recent advances in parallel processing have focused on the use Graphics Processing Units (GPUs) using Compute Unied Device Architecture (CUDA) programming model. In this work we discuss a naive and two optimised CUDA algorithms for the two kernel estimation methods: univariate and multivariate. These optimised algorithms are based on the use of shared memory tiles and loop unrolling techniques. We also present exploratory experimental results of the proposed CUDA algorithms according to the several values of parameters such as number of threads per block, tile size, loop unroll level, number of variables and data (sample) size. The experimental results show signicant performance gains of all proposed CUDA algorithms over serial CPU version and small performance speed-ups of the two optimised CUDA algorithms over naive GPU algorithms. Finally, based on extended performance results are obtained general conclusions of all proposed CUDA algorithms for some parameters.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Accelerating kernel density estimation on the GPU using the CUDA framework