Stabilized Structure from Motion without Disparity Induces Disparity Adaptation

Artigo Acesso aberto Revisado por pares

Stabilized Structure from Motion without Disparity Induces Disparity Adaptation

2004; Elsevier BV; Volume: 14; Issue: 3 Linguagem: Inglês

10.1016/j.cub.2004.01.031

ISSN

1879-0445

Autores

Fang Fang, Sheng He,

Tópico(s)

Ophthalmology and Visual Impairment Studies

Resumo

3D structures can be perceived based on the patterns of 2D motion signals [1.Rogers B. Graham M. Motion parallax as an independent cue for depth perception.Perception. 1979; 8: 125-134Crossref PubMed Scopus (435) Google Scholar, 2.Wallach H. O'Connell D.N. The kinetic depth effect.J. Exp. Psychol. 1953; 45: 205-217Crossref PubMed Scopus (685) Google Scholar]. With orthographic projection of a 3D stimulus onto a 2D plane, the kinetic information can give a vivid impression of depth, but the depth order is intrinsically ambiguous, resulting in bistable or even multistable interpretations [3.Hol K. Koene A. van Ee R. Attention-biased multi-stable surface perception in three-dimensional structure-from-motion.J. Vis. 2003; 3: 486-498Crossref PubMed Scopus (38) Google Scholar]. For example, an orthographic projection of dots on the surface of a rotating cylinder is perceived as a rotating cylinder with ambiguous direction of rotation [4.Andersen R. Bradley D. Perception of three-dimensional structure from motion.Trends Cogn. Sci. 1998; 2: 222-228Abstract Full Text Full Text PDF PubMed Scopus (58) Google Scholar]. We show that the bistable rotation can be stabilized by adding information, not to the dots themselves, but to their spatial context. More interestingly, the stabilized bistable motion can generate consistent rotation aftereffects. The rotation aftereffect can only be observed when the adapting and test stimuli are presented at the same stereo depth and the same retinal location, and it is not due to attentional tracking. The observed rotation aftereffect is likely due to direction-contingent disparity adaptation, implying that stimuli with kinetic depth may have activated neurons sensitive to different disparities, even though the stimuli have zero relative disparity. Stereo depth and kinetic depth may be supported by a common neural mechanism at an early stage in the visual system. 3D structures can be perceived based on the patterns of 2D motion signals [1.Rogers B. Graham M. Motion parallax as an independent cue for depth perception.Perception. 1979; 8: 125-134Crossref PubMed Scopus (435) Google Scholar, 2.Wallach H. O'Connell D.N. The kinetic depth effect.J. Exp. Psychol. 1953; 45: 205-217Crossref PubMed Scopus (685) Google Scholar]. With orthographic projection of a 3D stimulus onto a 2D plane, the kinetic information can give a vivid impression of depth, but the depth order is intrinsically ambiguous, resulting in bistable or even multistable interpretations [3.Hol K. Koene A. van Ee R. Attention-biased multi-stable surface perception in three-dimensional structure-from-motion.J. Vis. 2003; 3: 486-498Crossref PubMed Scopus (38) Google Scholar]. For example, an orthographic projection of dots on the surface of a rotating cylinder is perceived as a rotating cylinder with ambiguous direction of rotation [4.Andersen R. Bradley D. Perception of three-dimensional structure from motion.Trends Cogn. Sci. 1998; 2: 222-228Abstract Full Text Full Text PDF PubMed Scopus (58) Google Scholar]. We show that the bistable rotation can be stabilized by adding information, not to the dots themselves, but to their spatial context. More interestingly, the stabilized bistable motion can generate consistent rotation aftereffects. The rotation aftereffect can only be observed when the adapting and test stimuli are presented at the same stereo depth and the same retinal location, and it is not due to attentional tracking. The observed rotation aftereffect is likely due to direction-contingent disparity adaptation, implying that stimuli with kinetic depth may have activated neurons sensitive to different disparities, even though the stimuli have zero relative disparity. Stereo depth and kinetic depth may be supported by a common neural mechanism at an early stage in the visual system. Ambiguous structure from motion generated from orthographic projection of 3D moving objects can be disambiguated by information (e.g., disparity, speed, contrast, etc.) that specifies the depth order to the moving elements [5.Braunstein M.L. Perception of rotation in depth: a process model.Psychol. Rev. 1972; 79: 510-524Crossref PubMed Scopus (17) Google Scholar, 6.Longuet-Higgins H.C. Prazdny K. The interpretation of a moving retinal image.Proc. R. Soc. Lond. B. Biol. Sci. 1980; 208: 385-397Crossref PubMed Scopus (887) Google Scholar, 7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar, 8.Schwartz B. Sperling G. Luminance controls the perceived 3-D structure of dynamic 2-D displays.Bull Psychon Soc. 1983; 21: 456-458Crossref Scopus (48) Google Scholar]. Multiple ambiguous stimuli tend to covary [9.Eby D.W. Loomis J.M. Solomon E.M. Perceptual linkage of multiple objects rotating in depth.Perception. 1989; 18: 427-444Crossref PubMed Scopus (31) Google Scholar, 10.Gillam B. Grouping of multiple ambiguous contours: towards an understanding of surface perception.Perception. 1976; 5: 203-209Crossref PubMed Scopus (15) Google Scholar, 11.Grossmann J.K. Dobbins A. Differential ambiguity reduces grouping of metastable objects.Vision Res. 2003; 43: 359-369Crossref PubMed Scopus (26) Google Scholar], suggesting the possibility that the perception of an ambiguous stimulus could be influenced by its spatial context. Sereno and Sereno (1999) demonstrated that motion of the 2D surround of an ambiguously rotating stimulus can bias the oppositely moving dots to be perceived as the front surface of a 3D kinetic sphere as a result of a 2D motion contrast effect, thus partially stabilizing the ambiguous rotation in a subset of the observers [12.Sereno M.E. Sereno M.I. 2-D center-surround effects on 3-D structure-from-motion.J. Exp. Psychol. Hum. Percept. Perform. 1999; 25: 1834-1854Crossref PubMed Scopus (20) Google Scholar]. Stabilization could also be achieved through temporal manipulations, such as intermittent presentation of the stimulus [13.Leopold D.A. Wilke M. Maier A. Logothetis N.K. Stable perception of visually ambiguous patterns.Nat. Neurosci. 2002; 5: 605-609Crossref PubMed Scopus (297) Google Scholar, 14.Maier A. Wilke M. Logothetis N.K. Leopold D.A. Perception of temporally interleaved ambiguous patterns.Curr. Biol. 2003; 13: 1076-1085Abstract Full Text Full Text PDF PubMed Scopus (96) Google Scholar]. We observed that information presented in the context of the ambiguous stimulus could almost completely stabilize the ambiguous stimuli. The stimulus used in our study is a typical rotating cylinder generated from an orthographic projection of dots on a rotating 3D cylinder and is similar to stimuli used in previous psychophysical [3.Hol K. Koene A. van Ee R. Attention-biased multi-stable surface perception in three-dimensional structure-from-motion.J. Vis. 2003; 3: 486-498Crossref PubMed Scopus (38) Google Scholar, 7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar] and physiological [4.Andersen R. Bradley D. Perception of three-dimensional structure from motion.Trends Cogn. Sci. 1998; 2: 222-228Abstract Full Text Full Text PDF PubMed Scopus (58) Google Scholar, 15.Bradley D. Chang G. Andersen R. Encoding of three-dimensional structure-from-motion by primate area MT neurons.Nature. 1998; 392: 714-717Crossref PubMed Scopus (237) Google Scholar, 16.Dodd J. Krug K. Cumming B. Parker A. Perceptually bistable three-dimensional figures evokes high choice probabilities in cortical area MT.J. Neurosci. 2001; 21: 4809-4821PubMed Google Scholar] studies. The ambiguous stimulus, perceived as a rotating cylinder with its rotation direction switching every few seconds, was presented to only one eye, (Figure 1A). (The percepts of two concave or convex sheets, moving across each other, are also possible [3.Hol K. Koene A. van Ee R. Attention-biased multi-stable surface perception in three-dimensional structure-from-motion.J. Vis. 2003; 3: 486-498Crossref PubMed Scopus (38) Google Scholar] but were rarely seen by our observers; hence, they are not discussed in this paper and not depicted in figures.) When disparity information was added to the two ends of this bistable cylinder (i.e., a whole cylinder was presented to one eye, and only two ends of the cylinder were presented to the other eye), the whole cylinder was perceived to rotate in the direction specified by the disparity in the two ends, although the middle section contained no information to specify the depth order (Figure 1B). For the four observers tested, all perceived the cylinder as rotating unambiguously, 100% of the time, over multiple 1 min test periods. The spatial contextual cue was very effective in disambiguating the ambiguous motion. Our observation differs from earlier reports of contextual biases on ambiguous rotation. The contextual bias due to simple 2D motion contrast simply enhances the opposite direction of motion in the central region and thus biases dots moving in such a direction to be perceived as being in front [12.Sereno M.E. Sereno M.I. 2-D center-surround effects on 3-D structure-from-motion.J. Exp. Psychol. Hum. Percept. Perform. 1999; 25: 1834-1854Crossref PubMed Scopus (20) Google Scholar]. In the case of linkage between multiple bistable stimuli, the coupling tends to break down between unambiguous and ambiguous stimuli [11.Grossmann J.K. Dobbins A. Differential ambiguity reduces grouping of metastable objects.Vision Res. 2003; 43: 359-369Crossref PubMed Scopus (26) Google Scholar]. The key reason that the ambiguous and unambiguous sections in our stimulus remain strongly linked is that monocular presentation of the ambiguous section of the stimulus reduced the disparity contrast between nonzero relative disparity in the unambiguous sections and zero relative disparity in the ambiguous section. Additionally, unlike in earlier studies in which the ambiguous and unambiguous stimuli appeared as separate and distinct objects, we made the ambiguous and unambiguous sections of the stimulus appear to be parts of the same object and thus enhanced the effectiveness of the disambiguation. Occlusion in general is a strong cue to depth relationships. The occlusion cue has been shown to be somewhat effective in disambiguating ambiguous kinetic depth perception [17.Proffitt D.R. Bertenthal B.I. Roberts Jr., R.J. The role of occlusion in reducing multistability in moving point-light displays.Percept. Psychophys. 1984; 36: 315-323Crossref PubMed Scopus (36) Google Scholar, 18.Braunstein M.L. Anderson G.J. Riefer D.M. The use of occlusion to resolve ambiguity in parallel projections.Percept. Psychophys. 1982; 31: 261-267Crossref PubMed Scopus (41) Google Scholar]. We also tested if an occlusion cue can disambiguate the surface assignment of the bistable cylinder and, hence, disambiguate its direction of rotation. First, we simply removed a vertical section of dots moving in one direction, our intention being to create a subjective occluder in the middle of the cylinder that blocks part of the back surface (Figure 1C). However, with this manipulation, the stimulus remained bistable. Observers perceived alternations between two percepts, as depicted in Figure 1C: two partial cylinders alternating with a missing section, either on the front or the back surface. We then sought to enhance the occluder by making it explicit. A checkered rectangle was placed behind the front surface and blocked part of the back surface. This manipulation was very effective in eliminating the ambiguity of surface assignment (Figure 1D). The perceived rotation became completely unambiguous for three of the four observers (see Experimental Procedures) over multiple 2 min test periods and became almost completely unambiguous for the observer S.H., who occasionally (less than 10% of the time) saw the dots traveling behind a semitransparent occluder. Prolonged exposure to unambiguous rotating stimuli [7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar, 19.Petersik J.T. Build-up and decay of a three-dimensional rotational aftereffect obtained with a three-dimensional figure.Perception. 2002; 31: 825-836Crossref PubMed Scopus (16) Google Scholar], but not to an ambiguously rotating stimulus [20.Webster W.R. Panthradil J.T. Conway D.M. A rotational stereoscopic 3-dimensional movement aftereffect.Vision Res. 1998; 38: 1745-1752Crossref PubMed Scopus (5) Google Scholar], can lead to rotation aftereffects. Can we observe an aftereffect from a stimulus that is perceptually stabilized by its context? Note that in the current study the adapting properties, direction of rotation or the sets of dots that are in front, are not specified in the local adapting stimulus but are perceptually stabilized by context. Immediately after 1 min of adaptation to one of the four adapting stimuli, observers were presented with a bistable test cylinder for 15 s (Figure 2A). As shown in Figure 2B, consistent with earlier studies [7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar, 20.Webster W.R. Panthradil J.T. Conway D.M. A rotational stereoscopic 3-dimensional movement aftereffect.Vision Res. 1998; 38: 1745-1752Crossref PubMed Scopus (5) Google Scholar], adapting to the cylinder that was disambiguated by full disparity resulted in a very strong aftereffect. However, adapting to the context-stabilized ambiguous rotating cylinder also resulted in a very strong aftereffect. All four observers perceived the test stimulus rotating in the direction opposite the adapting direction for most of the 15 s testing period. In addition to the two stabilized rotation stimuli included as adaptors (full disparity unambiguous, context-stabilized, ambiguous), two control conditions were also included. In one control (context only), observers adapted to the two end units alone, without the middle ambiguous section. This was to test whether the aftereffect could simply be a spreading of adaptation from adjacent regions as a result of, for example, large receptive fields of the underlying neurons. Another control condition (bistable) was simply the extended bistable cylinder. This was to test whether merely being exposed to a bistable rotating cylinder for 1 min would lead to some stabilization during the test phase. After adaptation in both control conditions, observers perceived the testing cylinder as a bistable one, alternatively rotating in either direction with close to 50% chance (Figure 2B). When adapted to the two end units alone, the two naive observers (J.M. and L.W.) showed a weak aftereffect, likely due to less stable fixation during adaptation. However, the small aftereffect is much weakerthan that generated by the stabilized, ambiguous adaptor. When the ambiguous cylinder was stabilized with an occluder, the adaptation effect was also very strong (Figure 3). Three of the four observers always perceived the test stimulus to be rotating in the direction opposite the adapted direction. Observer S.H. was the only one who saw occasional reversals in rotation direction during adaptation and, consequently, showed a slightly weaker adaptation effect (test stimulus rotating in the aftereffect direction 88% instead of 100% of the time). For a control condition, we took advantage of the observation that when the occluder was not explicitly depicted (subjective occluder), perception was not stable, but alternated between the two interpretatations of depth (see Figure 1C). The 2D motion in the control condition was the same as motion with the explicit occluder. However, after adaptation to the control stimulus for 2 min, none of the observers showed any evidence of an aftereffect (Figure 3B). Note that, in both the test and the control condition, there was only one direction of motion signal in the middle section, which could and did lead to a simple 2D motion aftereffect. However, the simple 2D motion aftereffect could not influence the assignment of dots to the front or the back surface of the ambiguous test cylinder, as demonstrated by the absence of a rotation aftereffect in the control condition (Figure 3). The adaptation effect found here is retinotopically specific. It requires that the test pattern be presented at the same retinal location as the adapting pattern [21.Nawrot M. Blake R. The interplay between stereopsis and structure from motion.Percept. Psychophys. 1991; 49: 230-244Crossref PubMed Scopus (81) Google Scholar, 22.Nawrot M. Blake R. On the perceptual identity of dynamic stereopsis and kinetic depth.Vision Res. 1993; 33: 1561-1571Crossref PubMed Scopus (40) Google Scholar]. This retinotopic specificity is evident after adaptation to a rotating cylinder that has been disambiguated by disparity or stabilized by context or occluder. For example, in Figure 2, the context-only condition did not generate the adaptation effect. In further tests, the aftereffect was not observed as long as there was no spatial overlap between the adapting and testing stimuli. More surprisingly, this adaptation effect also requires that the test pattern be placed at the same stereo depth plane as the adapting pattern. The aftereffect disappeared if the adapting and test stimuli were presented with different absolute disparities (Figure 4A). Under such conditions, all observers perceived that the test pattern alternated direction of rotation, with each direction being observed for nearly the same amount of time (black bars in Figure 2, Figure 3). The retinotopic and disparity specificity of this aftereffect implies that this adaptation occurs relatively early in the visual system when one considers that rotation-sensitive neurons have quite large receptive fields [23.Andersen R. Neural mechanisms of visual motion perception in primates.Neuron. 1997; 18: 865-872Abstract Full Text Full Text PDF PubMed Scopus (86) Google Scholar]. It is interesting to note that the stabilization of rotation direction, over intermittent presentations [13.Leopold D.A. Wilke M. Maier A. Logothetis N.K. Stable perception of visually ambiguous patterns.Nat. Neurosci. 2002; 5: 605-609Crossref PubMed Scopus (297) Google Scholar, 14.Maier A. Wilke M. Logothetis N.K. Leopold D.A. Perception of temporally interleaved ambiguous patterns.Curr. Biol. 2003; 13: 1076-1085Abstract Full Text Full Text PDF PubMed Scopus (96) Google Scholar], seems to be somewhat retinotopic specific but not disparity specific [24.Chen X. He S. What factors determine the stabilization of bi-stable stimulus?.Journal of Vision. 2003; 3: 254aCrossref Scopus (2) Google Scholar]. The aftereffect could originate in mechanisms encoding depth together with translational motion. Alternatively, the aftereffect could be a rotation aftereffect [19.Petersik J.T. Build-up and decay of a three-dimensional rotational aftereffect obtained with a three-dimensional figure.Perception. 2002; 31: 825-836Crossref PubMed Scopus (16) Google Scholar]. In the latter case, because the aftereffect was observed only when the test stimuli and adapting stimuli were presented at the same disparity and location, our data suggest that, at the same retinal location, there are separate rotation-sensitive neurons of different disparities. This requirement makes the rotation adaptation model less parsimonious, although theoretically possible. However, additional considerations argue against this model. First, an opponent mechanism tuned to rotation would predict that after prolonged adaptation to an unambiguous rotation, one would perceive a static cylinder to rotate in the opposite direction. However, this is not the case [7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar]. We failed to observe a rotation aftereffect with a static test pattern. Second, neurons responsible for complex motion perception show a large degree of position and scale invariance [23.Andersen R. Neural mechanisms of visual motion perception in primates.Neuron. 1997; 18: 865-872Abstract Full Text Full Text PDF PubMed Scopus (86) Google Scholar, 25.Sakata H. Shibutani H. Ito Y. Tsurugai K. Mine S. Kusunoki M. Functional properties of rotation-sensitive neurons in the posterior parietal association cortex of the monkey.Exp. Brain Res. 1994; 101: 183-202Crossref PubMed Scopus (47) Google Scholar], but, here, the aftereffect observed was quite specific in location and size. Third, the aftereffect is not tied to the structure of the adapting [21.Nawrot M. Blake R. The interplay between stereopsis and structure from motion.Percept. Psychophys. 1991; 49: 230-244Crossref PubMed Scopus (81) Google Scholar, 22.Nawrot M. Blake R. On the perceptual identity of dynamic stereopsis and kinetic depth.Vision Res. 1993; 33: 1561-1571Crossref PubMed Scopus (40) Google Scholar] or testing stimulus. We observed that, after adaptation to the stabilized rotating cylinder, two flat sheets of oppositely moving dots with zero relative disparity showed a depth order consistent with the prediction of the disparity adaptation contingent on motion direction. We favor the interpretation that the aftereffect is a motion direction-contingent disparity aftereffect, similar to that proposed by Nawrot and Blake [7.Nawrot M. Blake R. Neural integration of information specifying structure from stereopsis and motion.Science. 1989; 244: 716-718Crossref PubMed Scopus (129) Google Scholar] (see Figure 4B). However, the key difference between our results and the results of Nawrot and Blake is that Nawrot and Blake found nonzero relative disparity between the two sets of dots moving in opposite directions, whereas in our experiment the two sets of dots had zero relative disparity. In other words, we believe that the kinetic depth adapted disparity-sensitive neurons as if they had nonzero relative disparities. This interpretation implies that, within certain limits, kinetic depth indeed is equivalent to the disparity depth in the sense that the disparity-tuned neurons are selectively responsive to depth signals defined by motion. Nawrot and Blake (1993) showed that disparity and kinetic depth could be perceptually metameric [22.Nawrot M. Blake R. On the perceptual identity of dynamic stereopsis and kinetic depth.Vision Res. 1993; 33: 1561-1571Crossref PubMed Scopus (40) Google Scholar]. Here, our experiments suggest that the two mechanisms can cross-adapt, which is a stronger indication that the two have shared neural mechanisms. In 2D motion, attentional tracking can induce a motion aftereffect when tested with a dynamic or flicker stimulus [26.Culham J. Verstraten F. Ashida H. Cavanagh P. Independent aftereffects of attention and motion.Neuron. 2000; 28: 607-615Abstract Full Text Full Text PDF PubMed Scopus (68) Google Scholar]. Attention was also shown to modulate the adaptation to 3D rotation [27.Shulman G.L. Attentional modulation of mechanisms that analyze rotation in depth.J. Exp. Psychol. Hum. Percept. Perform. 1991; 17: 726-737Crossref Scopus (32) Google Scholar]. Can attentional tracking account for our observation? We tested this possibility by reducing the number of dots in the disparity-defined, unambiguous rotating cylinder while preserving the perception of a rotating cylinder. The logic is that the attention system tracks the direction of rotation, whether there are 600 or 30 dots, but a system that depends on the energy of the motion and disparity signal would be much less stimulated by the 30 dots than the 600 dots. If the aftereffect were due to attentional tracking, then we would expect that tracking 30 dots should also generate an aftereffect. However, we failed to observe an aftereffect when we reduced the number of dots, suggesting that the aftereffect was not due to attentional tracking. Contextual and pictorial information can disambiguate and stabilize an ambiguous kinetic stimulus. The stabilized ambiguous motion can generate a consistent aftereffect. The aftereffect observed is likely to be a motion direction-contingent disparity aftereffect, originated from the neuronal equivalence between disparity and motion parallax.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Stabilized Structure from Motion without Disparity Induces Disparity Adaptation