Auto-Vectorization of Loops on Intel 64 and Intel Xeon Phi: Analysis and Evaluation
2017; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-3-319-62932-2_13
ISSN1611-3349
AutoresOlga V. Moldovanova, Mikhail G. Kurnosov,
Tópico(s)Numerical Methods and Algorithms
ResumoThis paper evaluates auto-vectorizing capabilities of modern optimizing compilers Intel C/C++, GCC C/C++, LLVM/Clang and PGI C/C++ on Intel 64 and Intel Xeon Phi architectures. We use the Extended Test Suite for Vectorizing Compilers consisting of 151 loops. In this work, we estimate speedup by running the loops in scalar and vector modes for different data types and determine loop classes which the compilers used in the study fail to vectorize. We use the dual CPU system (NUMA, 2 x Intel Xeon E5-2620v4, Intel Broadwell microarchitecture) with the Intel Xeon Phi 3120A co-processor for our experiments.
Referência(s)