Capítulo de livro Revisado por pares

Debugging Distributed Shared Memory Applications

2006; Springer Science+Business Media; Linguagem: Inglês

10.1007/11946441_75

ISSN

1611-3349

Autores

Jeffrey Olivier, Chih-Ping Chen, Jay Hoeflinger,

Tópico(s)

Distributed systems and fault tolerance

Resumo

A debugger is a crucial part of any programming system, and is especially crucial for those supporting a parallel programming paradigm, like OpenMP. A parallel, relaxed-consistency, distributed shared memory (DSM) system presents unique challenges to a debugger for several reasons: 1) the local copies of a given variable are not always consistent between distributed machines, so directly accessing the variable in the local memory by the debugger won’t always work as expected; 2) if the DSM and debugger both modify page protections, they will likely interfere with each other; and 3) since a large number of SIGSEGVs occur as part of the normal operation of a DSM program, a program error producing a SIGSEGV may be missed. In this paper, we discuss these problems and propose solutions.

Referência(s)