Artigo Revisado por pares

An object-oriented implementation of a solver of the time-dependent Schrödinger equation using the CUDA technology

2011; Elsevier BV; Volume: 183; Issue: 3 Linguagem: Inglês

10.1016/j.cpc.2011.11.026

ISSN

1879-2944

Autores

Tomasz Dziubak, Jacek Matulewski,

Tópico(s)

Spectroscopy and Quantum Chemical Studies

Resumo

Abstract We present a set of C++ classes which allow one to use the graphics card processorʼs cores for quantum ab initio simulations, i.e. a direct solving of the time-dependent Schrodinger equation, gaining the benefits from the parallel architecture of the graphical processor units. We use the Chebyshev polynomial and FFT algorithm. The solution is based on NVIDIA CUDA technology. The speed-up factor in the test runs of our classes performed using the graphics card processor can even be of order of 300 in comparison with the test runs using only the single core of CPU. Not only the Schrodinger equation can be integrated using the presented solver. With only small changes it can be used for solving the nonlinear Gross–Pitaevskii equation of BECʼs dynamics, the heat equation, the diffusion equation or other parabolic partial differential equations of second order. 1 Program summary Program title: QnDynCUDA Catalogue identifier: AELE_v1_0 Program summary URL: http://cpc.cs.qub.ac.uk/summaries/AELE_v1_0.html Program obtainable from: CPC Program Library, Queenʼs University, Belfast, N. Ireland Licensing provisions: Standard CPC licence, http://cpc.cs.qub.ac.uk/licence/licence.html No. of lines in distributed program, including test data, etc.: 101 359 No. of bytes in distributed program, including test data, etc.: 3 165 228 Distribution format: tar.gz Programming language: C++, C for CUDA Computer: Graphics card with CUDA technology recommended Operating system: No limits (tested on 32-bit and 64-bit Windows and 64-bit Linux) Has the code been vectorized or parallelized?: Yes, number of processors used – one CPU core and all CUDA cores of the selected processor of graphics card RAM: Dependent on userʼs parameters, typically between several tens of megabytes and several gigabytes (this concerns also the graphics cardʼs memory) Supplementary material: Test input and output files (approx. 3.4 Gigabytes) are available Classification: 2.7, 6.5 Nature of problem: Solving the time-dependent Schrodinger equation. Solution method: FFT and Chebyshev polynomial algorithm, CUDA technology. Running time: Every test example included in the distribution package takes approximately an hour or so if the GPU is engaged and a day or so if only CPU is used.

Referência(s)
Altmetric
PlumX