Artigo Revisado por pares

Efficient and retargetable SIMD translation in a dynamic binary translator

2018; Wiley; Volume: 48; Issue: 6 Linguagem: Inglês

10.1002/spe.2573

ISSN

1097-024X

Autores

Sheng‐Yu Fu, Ding‐Yong Hong, Yuping Liu, Jan‐Jan Wu, Wei‐Chung Hsu,

Tópico(s)

Advanced Data Storage Technologies

Resumo

Summary The single‐instruction multiple‐data (SIMD) computing capability of modern processors is continually improved to deliver ever better performance and power efficiency. For example, Intel has increased SIMD register lengths from 128 bits in streaming SIMD extension to 512 bits in AVX‐512. The ARM scalable vector extension supports SIMD register length up to 2048 bits and includes predicated instructions. However, SIMD instruction translation in dynamic binary translation has not received similar attention. For example, the widely used QEMU emulates guest SIMD instructions with a sequence of scalar instructions, even when the host machines have relevant SIMD instructions. This leaves significant potential for performance enhancement. We propose a newly designed SIMD translation framework for dynamic binary translation, which takes advantage of the host's SIMD capabilities. The proposed framework has been built in HQEMU, an enhanced QEMU with a separate thread for applying LLVM optimizations. The current prototype supports ARMv7, ARMv8, and IA32 guests on the X86‐64 AVX‐2 host. Compared with the scalar‐translation version HQEMU, our framework runs up to 1.84 times faster on Standard Performance Evaluation Corporation 2006 CFP benchmarks and up to 6.81 times faster on selected real applications.

Referência(s)
Altmetric
PlumX