A New Vectorization Technique for Expression Templates in C++
2012; Volume: 10; Issue: 4 Linguagem: Inglês
10.33697/ajur.2012.003
ISSN2375-8732
AutoresJ. Progsch, Yves Ineichen, Andreas Adelmann,
Tópico(s)Embedded Systems Design Techniques
ResumoVector operations play an important role in high performance computing and are typically provided by highly optimized libraries that implement the Basic Linear Algebra Subprograms (BLAS) interface. In C++ templates and operator overloading allow the implementation of these vector operations as expression templates which construct custom loops at compile time and providing a more abstract interface. Unfortunately existing expression template libraries lack the performance of fast BLAS implementations. This paper presents a new approach - Statically Accelerated Loop Templates (SALT) - to close this performance gap by combining expression templates with an aggressive loop unrolling technique. Benchmarks were conducted using the Intel C++ compiler and GNU Compiler Collection to assess the performance of our library relative to Intel's Math Kernel Library as well as the Eigen template library. The results show that the approach is able to provide optimization comparable to the fastest available BLAS implementations, while retaining the convenience and flexibility of a template library.
Referência(s)