Long-range interactions in visual perception

Transcription

Long-range interactions in visual perception
REVIEW
L. Spillmann and J.S. Werner
– Long-range interactions
Long-range interactions in visual perception
Lothar Spillmann and John S. Werner
Classical
receptive-field
conceptshavebeenusedto explainlocalperceptualeffectssuchasborder
contrastand Machbands,but are not suficientto explainglobalperceptualeffects.Examplesare
the perceptionof illusorycontours,areacontrast,colorconstancy,
depthplanes,coherentmotion
and texture contrast.Thesediverseeffectsrequire neurophysiological
mechanismswithinthe
visualpathwayswith long-rangeinteractions.Candidatemechanismsare suggested,including
convergingfeedforwardprojectionto accountfor the emergenceof newresponsepropertiesat
higherlevels,recruitmentof lateralconnectionsto compensatefor lossof afferenceand explain
filling-in,andre-entrantprojectionsfrom higherlevelsusingsynchronization
of neuronalresponses
to accountfor binding.
Trends Neurosci. (1996) 19, 428-434
I
Lothar Spillmcmnis
at the Instituteof
Biophysicsand
RadiationBiology,
Brain Research
Unit,LJniversi~of
Freiburg,
Hansastrasse9,
D-79104 Freiburg,
Germany,and
John S. Werner
is at the
Deptof Psychology,
Universityof
Colorado,Boulder,
CO80309-0345,
USA.
428
N THEIR PIONEERINGSTUDIES of contrast
phenomena, Mach’ and Hering23in the last century
proposedthat each region of the retina might interact
with many other distant regions (’Wechselwirkung
der Sehfeldstellen’).Fiftyyears later, the Gesaltists, in
their attempt to understand phi motion (that is,
apparent motion between two successivestimuli presented in differentlocations) and figure-groundsegregation’, called for similar interactions in the realm
of form and motion perception. In many demonstrations they showedthat the appearanceof a given
part of the stimulus pattern depends on the context
within which it is embedded.It is only recently that
the anatomicaland physiologicalmechanismsfor these
figure-groundrelations(Wertheimer’s‘Querfunktionen’
or lateral functions) have come under scrutiny.
The classical receptive-field concept originating
with the work of Kuffler5and Barlow6provided the
basis for contemporary ideas about neural representations of visual form. Yet, correlations between perception and center-surround antagonism of retinal
receptive fields have been limited to short-range
effects such as Mach bands7and illusory spots in the
Hermann grid (for further examples, see Refs 8-10).
Such short-range interactions cannot explain more
complex aspectsof perception involving spatiallyseparated areas in the domains of illusory contours,
brightness, color, depth, motion and texture. Longrange interactions, primarily in the cortex, are
required.
It is now accepted that in many visual neurons,
stimuli presented outside the classical receptive field
exert a strong and selective influence on the response
to stimuli presented within the classical receptive
field”. Among the earliest examples of long-range
interactions was the peripheryeffect12,which showed
that the response of retinal ganglion cells in the cat
can be facilitated by moving a stimulus in the far
peripheryof the visual field. A similarinteraction was
demonstratedin retina and lateral geniculate nucleus
with a shifting grating surround13’14.
The perceptual
significance of these effects is unclear, although they
might play a role in visual attention and suppression
during saccadic eye movements15.The problems disTIM Vol. 19, No. 10, 1996
cussed here, however, are not with physiological
mechanisms in search of perceptual correlates, but
with perceptual phenomena in quest of neuronal
mechanisms.
In this paper, we present six phenomena requiring
an explanation in terms of long-rangeinteractions in
the visual field. The mechanisms proposedto account
for these interactions are: (1) converging feedforward
projectionsleadingto the generationof new receptivefield propertiesat higher levels; (2) horizontal intrinsic connectionsprovidingfor interactionwithina single
visual area; and (3) re-entrant feedback projections
divergingto cells at lower levels which respond selectively to specific properties of a stimulus. It has been
argued that the divergent feedback associated with
this latter mechanism might utilize synchronization
of neuronal responses as the basis for binding. The
six phenomena, all of which can be observedunder
naturalconditions,and their hypotheticalexplanations
by long-range interactions constitute a challenge for
neurophysiologists as they shift their attention
increasinglyfrom local to global mechanisms of visual
perception.
Illusorycontours
andbrightness
enhancement
In nature we rarely experience an unobstructed
viewof an object; most objects are seen only partially,
due to occlusion by other objects. Despiteincomplete
views, however, there is seldom confusion about
which parts belong together. A partially occluded
object is perceivedas unitary accordingto the Gestalt
factors of good continuation, closure and the principle of ‘Pragnanz’ (figural goodness). Similar principles applyto the perception of an illusoryfigurein a
seeminglyincomplete stimulus configuration such as
the Kanizsa triangle (Fig. 1), where an area Of uniformly enhanced brightnessbounded by illusorycontours is generated by the corners of a hypothetical
triangle.There is no luminance differencebetweenthe
inside and outside of the perceived figure, yet the
triangle appearsto be lighter than the background.
Although it is tempting to assign illusory contours
(and surfaces)a role in everydayperception, they are
in fact rarely if ever seen in a natural setting, Also,
Copyright 431996, Elsevier Science Ltd. All rights reserved. 0166-
2236/96/$15.00
PII: S0166-2236(96)1
OO38-2
L. Spillmann and J.S. Werner
unlike recognition of a partially hidden object which
will occur even if the occlusion is ‘jagged’, illusory
contours requiresmooth gapsand closely alignedsegments. Neurons with classical receptive fields cannot
account for such contours; long-range connections
within a larger‘responsefield’ able to mediate perceptual completion acrossgapsof severaldegreesof visual
angle are needed.
Peterhans, von der Heydt and Baumgartner16’17
found that approximately one third of the magnocells in area V2 of the macaque monkey, the second
largest cortical area for visual processing, are responsive to cues eliciting the perception of subjective
contours in human observers. End-stoppedreceptive
fields of these cells are hypothesizedto fill-in the gap
and restore the (incomplete) Gestalt by connecting
the parts with an illusory border. The parallel behavior between these cells and the percept suggeststhat
one is a neural substrate for the other. Related
phenomena implying long-distance interactions
might be similarly explained. For example, illusory
stripes (’phantoms’) can be observedmoving through
an empty horizontal gap in phase with two vertical
gratings on top and on the bottom’g. The uniform
brightness enhancement observed in the Kanizsa
triangle is an example of filling-in as discussedbelow.
REVIEW
– Long-range interactions
(1)
(2)
Areacontrast
andfilling-inof brightness
andcolor
The fundamental cues defining figure-ground segregation in a stationary stimulus configuration are
brightness, color, binocular disparity,motion and texture. In this section, we concentrate on the first two of
these cues. Areaspossessingthe same brightness and
color are usually seen as one, figure or ground. This
segregationis supportedby contrast mechanisms that
enhance the difference between adjacent regions of
the visual field. However,border contrast alone is not
sufficient to describeour visual experienceunder normal conditions; enhancement of surfacesby area contrast is equally important. For example, a grey square
of given luminance presented on different shades of
greywill be perceivedas having greatlydifferentlightnesses. Or, a yellow disk if surroundedby an achromatic annulus of variable luminance will turn brown
and ultimatelyblack as the luminance of the surround
is progressivelyincreased above the luminance of the
center (’Schwarzverhullung’
23’19).A striking example
of area contrast, known as colored shadows, was
describedby Goethezo. His illustration is presentedin
Fig. 2. Here, an area of broad-bandillumination, that
would appear white under ordinary viewing conditions, looks bluish when surrounded by yellow.
Unlikeborder contrast, which enhances the difference
between figure and ground, area contrast serves to
homogenize regions of different color and brightness
within the figure.
What might be the processes and mechanisms
responsiblefor brightnessand color perceptionwithin
uniformly stimulatedareasboundedby an edge?Most
neurons in areaVI and V2 do not respondto constant
illumination within their receptive fields. Absence of
response is ambiguous- it occurs in the presence of
many stimulusconditions. Yet we do not see outlines,
we see uniform surfaces bordered by contours. The
need for signalsfrom contours to prevent a perceptual
void becomes apparent when the image is stabilized
on the retinazl.Within seconds the entire perceptwill
Fig. 1. Kanizsa Triangle. This pattern gives rise to a subjective triangle (left), which is obo/ished when the inducing cuesare closedby thin curves(right). End-stopped cellsin area VI are
assumed to mediate the subjective contours based on receptive fields shown schematically by
an elongated excitatory region and an inhibitory end zone (depicted in gray). They are ideally
suited for detecting terminations of edges and lines, such as at corners. The bottom scheme
shows a hypothetical feedforward circuit to account for the illusion. The contour cell (2) in V2
is assumed to sum the signals of two para//e/ paths: (1) an edge-detecting path responsiveto
real contours of a given orientation; and (2) a grouping path integrating responsesto stimuli
oriented orthogonally to the contour. It is assumed that the receptive fields of the grouping
path are distributed along the receptive field of the edge-detecting mechanism and that their
inputs are mukiplied (X) before being summed (X). As this mechanism cannot distinguish
between signak from either path, we perceive a faint edge where in fact there is none.
Modified from Ref. 16.
Fig. 2. Coloredshadaws (simultaneous color contrast). A white light
is used to illuminate a surface from the left whi/ea yellowlight i//uminates it from the right. An object placed in the center will cast shadows
of the object that differ in color. In the area that receivesno white light,
the shadow is a saturated yellow, while the area receiving only white
light appears bluish. Colored shadows are frequently observed on the
snow and under intense spectral light sources. Modified from Ref. 20.
TINSVol. 19, NO. 10,1996
429
REVIEW
L. Spillmann and J.S. Werner
– Long-range interactions
disappearor fade (Troxler-effect;local adaptation). If,
however,the stimulus contours are displaced,as with
involuntary eye movements, the percept reappears.
We conclude that in order to sustain the brightness
and color of the central area, the signals corresponding to the edgesmust be constantly revived.
That these interactions dependon global, not local,
mechanismswasshownby Gerritsand Vendrik22,who
observed that colors in stabilized retinal images
encompassing the blind spot were perceived as uniform disks. This is a process known as filling-in. The
scotoma correspondingto the physiologicalblind spot
can be madevisible only with specialtechniques, such
as using small perimetrictargets. If a scotoma is mimicked by a large uniform disk presented on a colored
or textured background it will also tend to fill-in by
acquiring the spatio–temporal properties of the surround. For example, a largered diskshown on a green
background will rapidly disappearfrom vision to be
replacedby a uniformly green surface if the border is
blurred or stabilized on the retina23.Similarly, a uniformly grey target on a dynamic noise background
will temporarily fade from view to be filled-in by
noisez’z’.
Recent evidence shows a neurophysiologicalcorrelate of this adaptation in area V3 (and to a lesser
degree area V2) of the behaving monkey: within seconds of being presentedwith a textured stimuluscontaining an ‘artificial scotoma’ (producedby a white
square), the initially lowered activity of a cell will
recoverand approachthe activity elicited by the same
stimulus,but no scotoma26.The deficit of neural activity within the scotomatous area (and its presumed
compensationby lateralinteractions)is in a wayanalogous to filling-in of the blind spotz’.To explain these
percepts, long-range signals arising at the edge and
propagatedfrom the boundary to the silent neurons
representing the enclosed region are required. The
ambiguity associated with the absence of neuronal
response makes it possible for lateral interactions to
impart a signal to those neurons that is consistent
with the stimulusas encoded at the edge.
Colorconstancy
Color constancy refers to our ability to perceive a
given color as the same despite changes in ambient
illumination. This ability is essential for learning the
identity of objects. It has long been known that the
brightness and color of an area depend crucially on
the illumination of the surround, particularlyat the
edges as was discussedfor area contrast. In complex
scenes, overall increasesin illumination might be discounted by reciprocal lateral connections: the effects
of light in the center are offset by the effects of light
in the surroundsuch that the net effect is to promote
brightnessand color constancy23’28.
The importanceof
integrating information over large regions of space to
maintain color constancy was emphasizedby Landzg,
who showedthat spatiallycomplex stimuluspatterns
(patchwork of color called ‘Mondrians’ after the
works of the Dutch painter Piet Mondriaan) tend to
maintain their appearance despite large changes in
the intensity and spectral distribution of the light
source. Shown in isolation, individualregions of the
stimulus pattern would assume a different hue with
changes in the intensity or spectraldistributionof the
illumination.
430
TINS Vol. 19, No. 10, 1996
Land proposed an algorithm for computing colors
that wouldsupportconstancy. This algorithm is based
on a comparison of three receptor-specific lightness
values from each surface in the scene with those of
other surfaces. One way of implementing this algorithm would be to move a photosensitive ‘mouse’
along arbitrarytrajectories from the perimeter of the
Mondrian to the surface under consideration. By
crossing a number of borders, a comparison can be
madebetweenthe reflectanceratios for a given surface
relative to that of neighboring surfaces. Alternative
schemes have been proposed (for example, Ref. 30),
but most retain the important notion that constancy
depends on long-distance comparisons between different regionswithin a given pattern.
Long-rangecomputations involved in color induction and filling-in of large areas might be generated
alreadyat the level of the retina31or lateral geniculate
nucleu532,
but
the
global
computations required for
color constancy most likely involve cortical interactions between both hemispheres. The importance of
transcortical connections in color constancy was
demonstratedin a patient whose corpuscallosum had
been transected surgically to control his epileptic
seizuresjj. For this individual, color constancy for
stimuli falling on different hemifields was severely
impairedrelativeto normal observers.This case study
suggeststhat, given the appropriatestimuli, color constancy can be mediated by communication between
the two hemispheres. Where might these long-range
computationsbe carriedout in the visual system?The
answeris not known, but it is likely that they depend,
at least in part, on area V4, where cell responses are
modifiedby both local and global chromatic surround
information includinginformation coming from deep
inside the ipsilateral visual hemifield34.In primate
area V4, Zeki35has described‘perceptive’neurons that
respond in a context-dependent manner, and seemingly consistent with the perceived color of a stimulus, rather than its wavelength. The inputs from the
complex surroundsof these cells have not been quantified, but Zekinotes that they are long-rangeand defy
descriptionin terms of the classicalreceptivefield.
Theperception
ofdepthplanes
A phenomenon analogousto the uniform filling-in
of boundedsurfacesis the uniform induction of depth
from the edges onto an enclosed area. This is called
‘depth capture’. However,rather than by luminance
or color contrast, the perception of depth planes is
defined by the lateral disparities at the borders. For
example, an outline squarein which the vertical lines
for one eye are displaced laterally (so as to produce
crossed or uncrossed disparity) is perceived as uniformlyelevatedor recessedin depth in spiteof the fact
that the area between the lines providesno disparity
information. It appearsthat the entire enclosed uniform surface assumes a different depth plane based
only on the cues,providedby the edges. This ‘depth
capture’ is conceptually similar to the filling-in
processes discussed earli&. To explain these effects,
one has to assumelong-distanceinteractions.
This assumptionis also supportedby the occurrence
of depth contours in random-dotstereograms36.Here,
none of the monocular images contains contours
to define a figure. The figure emerges only after
binocular fusion has taken place (form-from-depth).
L. Spillmann and J.S. Werner
REVIEW
- Long-rangeinteractions
To extract depth from such a stimuluswould seem to
require a global comparison of the two monocular
imagess7.While there is evidence suggestingthat the
correspondence problem might alreadybe solved by
VI neurons within local regions of the random-dot
stereogram38, the contour separatingthe depth Planes
appearsto be made explicit only by cells in area V2
(Ref. 39). This contour almost certainly requires a
global mechanism as it does not connect individual
dots to make an irregular or jagged contour, but
instead follows a perfectly smooth straight or curved
line. Little is known about the algorithm used by the
brain to perform this smoothing operation.
Coherentmotion
Groupingof stimuli by coherent motion is a powerful effect to be attributedto binding. Figure3 servesto
illustratehow an invisible figurecomposedof dots on
a dotted background will ‘pop out’ at the slightest
movement by virtue of the Gestalt factor of common
fate. The figurewill also be perceived,but not as well,
when only the background dots move or when both
the dots representingthe figure and those representing the ground move in different directions or at different speeds.Despitethe coarsenessand irregulardistribution of the dots the word MOTION emerging
from the noise field tends to have crisp, well-defined
edges. It also is seen as lying slightly above the background.This is similarto the stratifiedsheets of noise
sliding past each other when two groups of random
dots moving in different directions are spatiallysuperimposed’”.The high spatial resolution needed for the
edge extrapolation and the low resolution needed for
the stratification require mechanisms analogous to
those that produce smoothing and filling-in of depth
planes in random-dot stereograms.Single-cell studies
in the macaque monkey have yielded neuronal correlates of motion segregationin areaV1 (Ref. 41) and of
motion transparency in the medial temporal area
(area MT)Q. Motion filling-in also OCCS.US when ‘ne
observesa large figure such as a homogeneous white
square moving over a black background. In this
example, motion signals originate only at the boundary of the figurewhile no signalsare generatedby the
interior. Yet, we see the whole figure moving instead
of just accretion at the leading edge and decretion at
the trailing edge. This might be called motion capture
by analogy to depth capture.
Using both psychophysical and electrophysiological methods, Niedeggenand Wist43showedthat only
a small percentageof spatiallydistributed,coherently
moving dots in a dynamic random-dotfield were sufficient to support figure-ground segregation. That
coherent motion does not require identical directions
and speeds for all elements in an array is demonstrated by biological motion such as observed when
small lights are attached to the main joints of the
body of a dancer in the dark44.One can easily group
the varyingtrajectories of the individuallights to perceive correctly the movements of the dancer. What
might be the cellular basis of motion coherence?
Britten and colleagues’sidentified cells in area MT of
the monkey that respondin a manner that is directly
parallel to the behavioral findings. These cells have
large receptive fields and are highly sensitive to the
coherent translation of dots within a random-dot
kinematogram. The ability of neurons from area MT
Fig. 3. Coherent motion. If one makes a transparency af the word
MOT/OIV in a photocopying machine and over/ays it onto the rcmdomdot background, one perceives figureqround segregation through
coherent motion. The individual letters will be invisible as long as the
foi/ is kept stationay, but will immediate/y pop-out when it is moved
relative to the background. This is an example of grouping by the
Gestalt factor of common fate and might be attributed to synchronization of neuronal responsesof movement-detecting cells in the medial
temporal area (MT), a process called binding.
and the medial superiortemporal (MST)area46to integrate motion information (including constriction,
expansion and rotation) across large portions of the
visual field requiressubstantialconvergenceof inputs
from earlier cortical levels, such as V1 and V2. The
generation of receptive fields that can mediate this
long-range integration is likely to be accomplished
through the convergenceof feedforwardconnections.
Convergingfeedforwardprocessingis also likely to
mediate the perception of apparent motion as
described by Wertheimer4. Thresholds for apparent
motion in humans and monkeys were measured by
Newsomeand colleagues47,and their psychophysical
resultswerecomparedwith cell recordingsin areasV1
and MT. The authors conclude that MT neurons are
responsiblefor the perception of fast apparentmovement, whereasboth VI and MT neurons are responsible for slow apparent movement. They suggestthat
the neuronal substratefor apparentmotion might be
distributedover multiple cortical areas, dependenton
the speedand the spatial interval of the stimulus.
Texturecontrast
There are severalstrikingexamplesof texture segregation that suggest the involvement of long-range
neuronal interactions across the visual field. The sudden emergence or ‘pop-out’ of a stimulus element
embedded within a matrix of elements differing in
orientation, color, polarity, size, shape and texture
requires fast long-distance comparisons without any
effort or conscious attention by the observer. These
processesare independentof the numberof distracters
and have been attributed to parallel (versus serial)
search. They are, therefore, called preattentive4&50.
It has recently been shown that neurons in area VI
respond dependingon the perceptualcontext within
TINSVO1. 19, No. 10,1996
431
REVIEW
L. Spillmann and J.S. Werner
- Long-rangeinteractions
A
B
Receptivefields
Corticalcells
Fig. 4. Filling-in of o retinal scotonxz (A) Shaded areos illustrate the receptive fields of cortical cellsA, B and C. The crrrowsrepresent intracortical connections from cellsA and C onto
cell B. The inputs provided by these /otera/ connections are assumed to be relatively weak and
not detectable under normal conditions ofihmrination. (B) Followingloss of primary afference
due too retinal lesion, cell B will cease to respond. However, after only a few minutes it can be
reactivated by stimulation of the shaded oreas surrounding the /esion. This enlargement of the
effective receptive field is mode possible through ‘recruitment’ of horizontal connections with
cc//s A and C (thick arrows). The perceptual consequences of the recruitment might be a
fi//ing-in of the scotoma from the surround such that a uniform field is perceived despite the
absence of stimulation within the /esioned area.
which the stimulus is placed. For example, the
responsesof many neurons were suppressedmarkedly
by line textures extending beyond the classicalreceptive field51-53.
These regions are normally silent, but
will inhibit or facilitate the neuronal response if
stimulatedin conjunction with the classicalreceptive
field. Accordingly, the response to an optimally
oriented bar was enhanced if the surroundingtexture
was oriented orthogonally. In other words, responses
of these cells were consistent with an increased perceptual salience, analogouslyto ‘pop-out’.Pop-outby
orientation contrast is paralleledby a similarenhancement due to motion contrast when a cell’s responseis
facilitated by antiphase movement in its receptivefield surroundSA,SS.
Such neurons might be inS.tmmental in mediating figure+groundsegregation.
Potential
mechanisms
Neurophysiological and anatomical studies have
provided evidence that the visual system processes
global information in at least three different ways:
(1) converging feedforwardprojections from lower to
higher levels; (2) recruitment of lateral connections
within the cortex to compensate for loss of afference
in deafferented cells; and (3) synchronization of
neuronal responses and ‘tagging’ of lower-level cell
activity by re-entrant signals.All of these mechanisms
are relevant to the understandingof the phenomena
describedhere.
Traditionally, hierarchical feedforward processing
has been assumedwhereby new receptive-fieldproperties evolve from convergent inputs from lower-level
neuronss~ss.The idea here is that widely separated
neurons at lower levels become interacting neighbors
at higher levels through remapping. We now know
that the projection from areas VI and V2 to area MT
and the inferotemporalcortex (IT) is not strictly topographic;spatially,the receptivefieldsof these areasare
quite large with a substantial overlap. Barlow has
arguedthat these non-topographicprojections are not
random, but occur in an orderlypattern accordingto
stimulus dimensions (see also Ref. 59). He assumed
that all information on one aspect of the stimulus(for
example, motion in one direction, one velocity) is
brought together from a wide region of the visual
432
TINS VO1. 19, No. 10, 1996
field. The large number of specializedpathways and
cortical areas with their anatomically identified patterns of connectivity known to servevision60,61
would
supportthis bottom-up approach.
Amongthe six phenomena described,the filling-in
of illusory contours and area brightness lends itself
most clearlyto an explanation in terms of converging
feedforward projections. Peterhans, von der Heydt
and Baumgartner16’17propose that multiplicative
inputs from end-stopped cells in area VI, tuned
orthogonally, are required for generating the illusory
contours of the Kanizsatriangle. A subjective contour
is signaledwhen a higher-ordercell in area V2 is activated concurrently by two or more of these distant
inputs. Since the same neuron might also respond to
a physical line in the same location as the subjective
contour, the output of the cell is similar for the two
types of stimuli, as is their appearance.Furthermore,
when the sectors inducing a subjective contour are
closed by thin curves, as illustrated in Fig. 1 (right),
the neuronal response breaks down and so does the
illusion. In the context of the model by Peterhansand
colleagues,this wouldbe expectedbecausethe closing
lines fall on the inhibitory end-zones of the endstoppedcells and thus reducetheir activation.
A second kind of neural processing to explain
filling-in involves horizontal intrinsic connections
within a single visual area. It is now known that
dramatic dynamic changes occur in the size and location of cortical receptivefields when retinal activity
is preventedfrom reachingits target areaby deafferentation. For example,when a scotoma is inducedin cat
retinae by photocoagulation, receptive fields of cells
in the lateralgeniculatenucleus62and striatecortex63’64
that had previouslyreceived input from the lesioned
areabecome displacedand enlarged.This is illustrated
in Fig.4. The finding that the initially silencedneuron
can be re-activated,within minutes, by stimuli falling
onto the edge of the lesion65)66
suggestsa mechanism
that mediates input through lateral connections. An
alternative explanation of short-term plasticity in
terms of an overall increase in cell responsiveness,
rather than a dynamic alteration of receptive-field
structure,has been proposed67.The functional potentiation resultingin either case wouldlead to a perceptual filling-in of a gap, such as a retinal scotoma,
although the much slower recovery in patients
suggeststhat sproutingmust also take place.
If deafferentationfollowinga lesion leads to fillingin by lateral interaction, it would also be expected to
occur when there is no patterned input due to a mask
covering a certain region of the retina (artificial scotoma). Indeed,in the cat it has been shown that when
the area from which a given neuron can be driven is
maskedby a black diskand a gratingpattern is moved
in the immediatesurround,the receptivefield expands
up to five times its original size, enabling cells to
become responsiveagain68.Similarly,followingpartial
occlusion of a dynamic noise stimulus,cells inarea V3
of the monkey will compensate rapidly for the
reduced retinal stimulation with an increase in
response,presumablydue to an unmasking(disinhibition) of previously ineffective excitation from well
outsidethe classicalreceptivefield26.
An even larger expansion has been reported by
Fiorani and colleagues69,who obtained responses in
area VI of the monkey when stimulating two regions
L. Spillmann and J.S. Werner
on either side of the optic disk that were 15 degrees
apart. These distances are up to IOtimes the lengths
of conventional receptivefields,implyinga functional
(and structural) convergence much larger than
hitherto thought. Although the blind spot is present
from birth and thus represents a special case of a
scotoma, the physiological findings suggestthat our
unawareness of it might be based on a filling-in
process that uses mechanisms similar to the intracortical connections underlying filling-in of other
kinds of scotomas.
Filling-in by horizontal interaction might also be
responsible for uniform brightness and color perception in extended areas. One possible manifestation of
this type of neural processingis illustratedin Fig. 5 for
the Craik–O’Brien-Cornsweet illusion. Contrary to
the luminance profile, the perceivedbrightnesswithin
the central area is uniformly low, whereas in the surround it is uniformly high. Clearly, these two brightness levels must be defined by the relative neural
activities at the border. Horizontalconnections of up
to 6mm between spatially separated hypercolumns
have been found”71 that could mediate propagation
of edge signals and thereby ‘assign’ different brightnesses and colors to regions of the visual field several
degreesaway. Polysynaptic cascades of local connections could carry these signals even further. It is
important to add that the distribution of neural activity measured by microelectrode recording is only
about 5% of that found by optical recording,in agreement with what might be needed to explain many
long-range perceptual effects’z. A similar interpretation applies to the segregation into smoothly delineated depth planes. Modelsassumingconnectionist
networks73and boundary-triggereddiffusion74have
been proposedto account for these effects.
A third neural process that supports global integration involveslong-rangefeedbackprojections from
higher areas to group cell responsesat lower levels or
across hemispheres” by virtue of synchronization of
neural activity among separateareas. These top-down
projections need not be confined to the cortex. It is
known, for example, that the feline cortex has massive
projections to the lateral geniculate nucleus”. It
should also be noted that top-down feedback from
higher areas does not necessarilyrequire synchronization, and conversely synchronization does not
necessarily involve feedback from higher areas, but
might be accomplishedby feedforwardor lateralinteractions in some yet unknown manner.
While binding by re-entrant signals might contribute to more than just one of the phenomena
described,it is most easilyunderstoodwith respectto
coherent motion. We know that spike activities
between remote brain areas can interact by way of
synchronization within a cell assembly, thereby producing a possibleneuronal basis for perceptualcoherence and connectivity7778.Such a scheme would serve
to enhance the salience of a figurewith a minimum of
neural connections: the significance of a neuron’s
activity would depend on the context of activity in
other neurons7g,a0.
For the multitude of random dots
comprisingthe wordMOTIONin Fig.3, the synchronization of activity in cells with separatereceptivefields
might provide the binding necessary to obtain
structure-from-motion. To confirm this hypothesis
an experiment combining single-cell recording and
REVIEW
– Long-rangeinteractions
Craik–
O’Brien–
Cornsweet
illusion
Luminance
profile
Receptive
fields
Cortical
cells
Fig. 5. Area contrast. The Craik-O’Brien–Cornsweet illusion (see
Ref. 70) is shown on the top with its corresponding luminance profile.
The Iuminances of the center and surround are identical, as can be
verified by placing thin strips of paper over the discontinuitiesindicated
by the luminance profile. Without the strips, the perceived brightness
within the central area is uniformly /ow while in the surround it is
uniformly high (center panel). It is assumed that the brightness in each
area is ‘assigned’ according to the brightness at the surrounding edges
with the gradual luminance change rectified. This homogenization of
brightness across a large surface might be attained in analogy to deafferentation by cells that propagate the edge signal across adjacent
areas where input is weak or lacking (cell B). The spread of uniform
brightness is hypothesized ta be due to recruitment of /ong-ronge /ateral connections from cellsA and C.
behavioraltesting is neededto be able to correlatethe
extent of synchronizationwith the perceptualsalience
of a stimulus.This experiment remains to be done.
What is the evidence for such a proposal?On average, each cortical area in monkey possesses connections t. 10 others81.These connections are between
areas at the same level as well as above and below in
the cortical hierarchy, giving cells in any tissue patch
distributedparallel access to information different in
both retinotopic location and sensory specialization.
Successful spike transmission to exploit these pathwaysrequiressynchronous dischargein widelyseparated areas. Striate and extrastriate neurons of both
cat82and monkef13 stimulated by coherent motion
show synchronizeddischarges(short bursts occurring
at 40-90 Hz), even though the individual stimulus
elements are spaced over areas much larger than
Hubel and Wiesel’s hypercolumns. This theoretical
framework might also account for the long-range
facilitation underlying the Gestalt factor of good
continuation84’85.
Global integration might sometimes be based on
more than one neural mechanism to produce a given
phenomenon. For example, in a neural-network
model, texture contrast has been attributedto feedforwardprocessingwith inhibitory interneuronsbetween
TINS Vol. 19, No. 10, 1996
433
REVIEW
L. Spillmann and J.S. Werner
– Long-rangeinteractions
orientation detectors8bas well as to lateral modulation
by orientationally tuned cells via horizontal connections8788.However,the same phenomenon could also
be interpretedin terms of binding through synchronization. Similarly,while coherent motion and motion
contrast were explained by synchronization’g, it
might equally be accounted for by convergent feedforwardprojections4b89.
To explain many of the phenomena of the Gestalt
school that have eludedneuroscientist for more than
50years, it appearsthat we must deal with the possibility that simple perceptsare the achievement of distributedprocessingby morethan one corticalarea.This
idea requirestwo tasksfrom neuroscience:to establish
mechanisms for the synchronization of information
and binding among multiple areas, and to give reciprocal projections a more central role in spike transmission. The real challenge, however,lies in the problem of demonstratingthat perceptualsalienceparallels
the pattern of cell responseswithin an assemblyin the
human brain. The perceptualphenomena discussedin
this paper require visual neurons that respond not
solelyto stimulusfeatureswithin a localizedarea,but,
equally importantly, to the context in which they
appear.This is consistent with the Gestalt credo that
the whole is different from the sum of its parts.
Selectedreferences
1 Mach, E. (1865) Sitzungsberichte der Wiener Akademie der
Wissenschaften, Mathematisch-Natrcrwissenschaftliche Classe,
ZweiteAbteilung, 52, 303-322
2 Hering, E. (1878) Grcmdziigeder Lehre vom Lichtsinn (Hurvich,
L.M. and Jameson, D., trans., 1964, HarvardUniversityPress),
Springer
3 Hering, E. (1907) in Graefe-Saemisch Handbuch der gesamterr
Augenheilkunde(Vol.3, 2nd edn) (Saemisch,Th., cd.), pp. 1-194,
W. Engelmann
4 Wertheimer, M. (1912) Zschr. Psychol. 61, 161-265
5 Kuffler,S.W. (1953)1. Neurophysiol.16, 37-68
6 Barlow, H.B. (19S3) J. PhysioL 119, 69-88
7 Ratliff, F. (1965) Mach Bands: Quantitative Studies on Neural
Networksin the Retina,HoldenDay
8 Jung, R. (1973) in Handbook of Sensory Physiology: Central
Processingof Visual Information (Vol. VII/3A)(Autrum,H. et aL,
eds), pp. 1-152, Springer-Verlag
9 Spillmann, L. and Werner, J. S., eds (1990) Visual Perception:
The Nercrophysiolog”cal
Foundations, AcademicPress
10 Spillmann, L. and Ehrenstein, W. (1996) in Comprehensive
Human Physiology. From Cellular Mechanisms to Integration
(Vol. 1) (Greger, R. and Windhorst, U., eds), pp. 861-893,
Springer-Verlag
11 Allman, J., Miezin, F. and McGuinness, E. (1985) Annu. Rev.
Neurosci. 8, 407-430
12 McIlwain, J.T. (1964) J. Neurophysio2.27, 1154-1173
Acknowledgements
13 Kriiger,J. and Fischer, B. (1973) Exp. Brain Res. 18, 316-318
We aregratefil for 14 Fischer, B., Barth, R. and Stemheim, C. (1978) Exp. Brain Res.
the valuable
COf?lr?WiltSby
H.B. Barlow,
R. Eclchorn,
A.K. Errgel,
C.D. Gilbert,
S. Lehar,
\.H.R.Maunsell,
[.1.Nelson,
P.H.SchilIerand
W. Singer.JSW is a
recipientof a Senior
ScientistAward
from the
Alexander-vonHumboldt
Foundation.
434
31, 235-248
Breitmeyer, B. and Valberg,A. (1979) Science203, 463-464
16 Peterhans, E. and von der Heydt, R. (1989) J. NeuroscL9,
1749-1763
17 Peterhans, E., von der Heydt, R. and Baumgartner, G. (1986)
in Visual Neuroscience (Pettigrew, J.D., Sanderson, K.J. and
Levick,W.R., eds), pp. 343–351, CambridgeUniversityPress
18 Tynan, P. and Sekuler,R. (1975) Science188, 951-952
981-986
19 Werner, J.S. et aL (1984) J Opt.Soc. Am. A 1,
20 von Goethe, J.W. (1810) Zur Farberdehre (J.G. Cotta, cd.),
Tiibingen
21 Riggs,L.A. et aL (1953) J. Opt. Soc. Am. 43, 495-501
22 Gerrits, H.J.M. and Vendrik, A.J.H. (1970) EXP.Brain Res. 11,
411-430
23 Krauskopf,J. (1963) J. Opt.Soc. Am. 53, 741-744
24 Ramachandran, V.S. and Gregory, R.L. (1991) Nature 350,
699-702
25 Spillmann, L. and Kurtenbach, A. (1992) Vis. Res. 32,
1941-1946
26 DeWeerd,P. et al. (1995) Nature 377, 731-734
27 Durgin, F.H., Tripathy, S.P. and Levi, D.M. (1995) Perception
24, 827-840
15
TINS Vol. 19, No. 10, 1996
28 Jameson, D. and Hurvich, L.M. (1961) Science133, 174–179
29 Land, E.H. (1959) Proc. Natl. Acad. Sci. U.S. A. 45, 115–129
30 Shapley, R. and Reid, R.C. (1985) Proc. NatL Acad. Sci. U. S. A.
82, 5983-5986
31 PCippel,E. (1986) Nature 320, 523-526
32 Creutzfeldt, O.D. et aL (1991) Exp. Brain Res. 87, 2245
33 Land, E.H. et al. (1983) Nature 303, 616-618
34 Schein, S.J. and Desimone, R. (1990) J. Neurosci. 10,3369-3389
35 Zeki, S.M. (1983) Neuroscience9, 767-781
36 Julesz, B. (1971) Foundations of Cyclopean Perception, University
of ChicagoPress
37 Nelson,J.L (1975) J. T’heor.Bio2.49, 1-88
38 Qian, N. (1994) Neural Computation 6, 390-404
39 Friedman, H. et aL (1996) Invest. Ophtha2moLVis. Sci. 37, S424
40 van Doom, A.J. and Koenderink,J.J. (1982) Exp. BrainRes.45,
179-188
41 Lamme, V.A.F. (1993) Nature 363, 541-543
42 Qian, N. and Andersen,R.A. (1994) J. Neurosci. 14, 7367-7380
43 Niedeggen,M. and Wist, E.R. (1992) Perception21 (Suppl.2), 32
44 Johansson, G. (1975) Sci. Am. 232, 76-88
45 Britten, K.H. et al. (1992) J. Neurosci.12, 4745-4765
46 Saito, H. et al. (1986) J. Neurosci. 6, 145-157
47 Newsome, W.T., Mikami, A. and Wurtz, R.H. (1986)
J. hJearophysioL55, 1340-1351
48 Beck, J. (1982) Textural Segmentation in Organization and
Representationin Perception(Beck,J., cd.), pp. 285-317, Erlbaum
49 Julesz, B. (1984) in SensoryExperience,Adaptation, and Perception
(Spillmann,L. and Wooten, B.R., eds),pp. 37-52, Erlbaum
50 Treisman, A. (1986) Sci.Am. 255, 114-125
51 Knierim, J.J. and van Essen, D.C. (1992) J. Neurophysiol. 67,
961-980
52 Sillito, A.M. et al. (1995) Nature 378, 492-496
53 Lamme, V.A.F. (1995) J. Neurosci. 15, 1605-1615
54 Hammond, P. and MacKay,D.M. (1981) J. Physiol.319,431442
55 Allman,J., Miezin,F. and McGuinness,E. (1985) Perception14,
105-126
56 Hubel, D.H. and Wiesel, T.N. (1962) J. Plrysio2.160, 106-154
57 Barlow, H.B. (1981) Proc.R. Soc.London Ser.B 212, 1-34
58 Barlow, H.B. (1985) Quart.J. Exp. PsychoL 37, 121-146
59 Nelson, J.L (1985) in Modek of the VisualCortex(Rose,D. and
Dobson,V., eds),pp. 108-122, Wiley
60 van Essen, D.C., Anderson, C.H. and Felleman, D.J. (1992)
Science255, 419-423
61 Maunsell,J.H.R. (1995) Science 270, 764-769
62 Eysel, U.Th. (1982) Nature 299, 442444
63 Gilbert, C.D. and Wiesel, T.N. (1992) Nature 356, 150-152
64 Chino, Y.M. et aL (1992) Vis.Res. 32, 789-796
65 Gilbert, C.D. and Wiesel, T.N. (1990) Vis.Res. 30, 1689-1701
66 Gilbert, C.D. (1992) Neuron9, 1-13
67 DeAngelis, G.C. et al. (1995) Proc. NatL Acad. Sci. U. S. A. 92,
9682-9686
68 Pettet, M.W. and Gilbert, C.D. (1992) Proc. NatL Acad. Sci.
U.S. A. 89, 8366-8370
69 Fiorani, M. et aJ. (1992) Proc. NatL Acad. Sci. U. S. A. 89,
8547-8551
70 Fiorentini, A. et aL (1990) in Visual Perception: The
Neurophysiologfca2Foundations (Spillmann, L. and Werner, J.S.,
eds),pp. 129-161, AcademicPress
71 Lund, J.S., Yoshioka, T. and Levitt, J.B. (1993) Cereb. Cortex3,
148-162
72 Das, A. and Gilbert, C.D. (1995) Nature 375, 735-736
73 Marr, D. and Poggio, T. (1979) Proc.R. Soc. London Ser. B 204,
301-328
74 Grossberg,S. and Todorovic, D. (1988) Percept.Psychophys.43,
241-277
75 Engel, A.K.et al. (1991) Science 252, 1177-1179
76 Sherman, S.M. and Koch, C. (1986) Exp. Brain Res. 63, 1-20
77 Eckhom, R. et aL (1988) Bio2.Cyberrr.60, 121-130
78 Gray, C.M. et al. (1989) Nature 338, 334–337
79 Singer,W. (1993) Annu. Rev. Physiol. 55, 349-374
80 Singer,W. (1995) Science 270, 758-764
81 Young, M.P. (1992) Nature 358, 152-155
82 Eckhom, R. et al. (1989) in Models of Brain Functioning
(Cotterill, R.M.J., cd.), pp. 255-272, CambridgeUniversityPress
83 Frien, A. et al. (1994) NeuroReport5, 2273-2277
84 Field, D.J., Hayes, A. and Hess, R.F. (1993) Vis. Res. 33,
173–193
85 Polat, U. and Sagi, D. (1994) Proc. Natl. Acad. Sci. U. S. A. 91,
1206-1209
86 Stemmler, M., Usher, M. and Niebuhr, E. (1995) Science 269,
1877-1880
87 Ts’o, D.Y., Gilbert, CD. and Wiesel, T.N. (1986) J. Neurosci. 6,
1160-1170
88 Kapadia,M.K. et al. (1995) Neuron 15, 843-856
89 Barlow, H. (1995) in Large Scale Neuronal Theories of the Brain
(Koch,C. and Davis,J.L., eds),pp. 1-22, MITPress