Artigo Revisado por pares

A hexagonal orthogonal-oriented pyramid as a model of image representation in visual cortex

1989; Institute of Electrical and Electronics Engineers; Volume: 36; Issue: 1 Linguagem: Inglês

10.1109/10.16453

ISSN

1558-2531

Autores

Andrew B. Watson, A. J. Ahumada,

Tópico(s)

Visual perception and processing mechanisms

Resumo

Retinal ganglion cells represent the visual image with a spatial code, in which each cell conveys information about a small region in the image. In contrast, cells of the primary visual cortex use a hybrid space-frequency code in which each cell conveys information about a region that is local in space, spatial frequency, and orientation. A mathematical model for this transformation is described. The hexagonal orthogonal-oriented quadrature pyramid (HOP) transform, which operates on a hexagonal input lattice, uses basis functions that are orthogonal, self-similar, and localized in space, spatial frequency, orientation, and phase. The basis functions, which are generated from seven basic types through a recursive process, form an image code of the pyramid type. The seven basis functions, six bandpass and one low-pass, occupy a point and a hexagon of six nearest neighbors on a hexagonal lattice. The six bandpass basis functions consist of three with even symmetry, and three with odd symmetry. At the lowest level, the inputs are image samples. At each higher level, the input lattice is provided by the low-pass coefficients computed at the previous level. >

Referência(s)