Auto-Vectorization of Loops on Intel 64 and Intel Xeon Phi: Analysis and Evaluation

Capítulo de livro Revisado por pares

Auto-Vectorization of Loops on Intel 64 and Intel Xeon Phi: Analysis and Evaluation

2017; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-319-62932-2_13

ISSN

1611-3349

Autores

Olga V. Moldovanova, Mikhail G. Kurnosov,

Tópico(s)

Numerical Methods and Algorithms

Resumo

This paper evaluates auto-vectorizing capabilities of modern optimizing compilers Intel C/C++, GCC C/C++, LLVM/Clang and PGI C/C++ on Intel 64 and Intel Xeon Phi architectures. We use the Extended Test Suite for Vectorizing Compilers consisting of 151 loops. In this work, we estimate speedup by running the loops in scalar and vector modes for different data types and determine loop classes which the compilers used in the study fail to vectorize. We use the dual CPU system (NUMA, 2 x Intel Xeon E5-2620v4, Intel Broadwell microarchitecture) with the Intel Xeon Phi 3120A co-processor for our experiments.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Auto-Vectorization of Loops on Intel 64 and Intel Xeon Phi: Analysis and Evaluation