Intersensory Interactions and Human Computer Interaction

Transcription

Intersensory Interactions and Human Computer Interaction
Intersensory Interactions and Human
Computer Interaction (HCI)
Intersensory Interactions
• Intro and metacognitive gap
• Integrating cogsci theory with design
• Cognitive Architecture
Brian Fisher
– Modularity and multimodal interaction
• Information hiding-- conflict resolution
• Cognitive impenetrability
• Performance differences between modules
• Recalibration
– Spatial indexes in complex environments
THE UNIVERSITY
OF BRITISH COLUMBIA
Key Points for HCI Practice
• The Metacognitive Gap: The need for a
grounded Cognitive Science approach
• Reflective design practice methods
– Integrating CogSci with interaction design
– Iterative design cycle
• Multimodal cue matching within modules
Fels: Design of Interactive Multimodal Graphics
Introduction
History -- why we need to adapt HCI
methods for multimodal interaction
• Examples of extended HCI
– Cognitive architecture: Multimodal displays and
how they are understood
– Situated cognition: embodied interaction with
complex displays
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
1
HCI History
Ergonomic HCI domain limits
• “Classic” HCI: User/Task/Tool Model
User
– Task, Protocol & GOMS/keystroke analysis
• Command-line, menus, workplace systems
• Theoretical underpinnings
– Cogsci of conscious thought
– Learning, Memory, Reasoning
– Sequences of operations
Fels: Design of Interactive Multimodal Graphics
Evolution towards multimodal displays
Task
Tool
What if I am doing this for fun?
What if I want new insights?
What if I want to communicate with someone?
What if I am exploring a complex environment?
Fels: Design of Interactive Multimodal Graphics
Iterative development cycle
• WIMP interface: visual semantics
– Metaphorical tool icons on desktop
– Direct manipulation
• Information visualization: Information
processing in the visual system
– Visual analogs of information
– Spatial Instruments
Walkthrough or experiment
Design
Test
How?
What?
Implement prototype
Spiral with increasing detail and test specificity
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
2
Future challenges
• Perception, cognition, and action in an immersive
multimodal environment populated with objects
events and actors
• Applications in entertainment, cognition,
communication
• Blend of virtual and real spaces… with seams
– Are the rules consistent?
Opportunities for creative design
• Environments: Affordances for exploration
– Spatial cognition, human space constancy theory
• Support for creative & logical thinking
– Problem solving, embodied cognition models
• Media-based communication & collaboration
– Metacognition, distributed cognition
• Experience (Kansei) engineering: Moving
beyond usability
– Can users shift between them?
– Can frames support rule shifts?
Fels: Design of Interactive Multimodal Graphics
Effective Interface Design for Rich
Sensory Environments
The interaction between display characteristics and
the information processing characteristics of the
user’s perceptual, motor, and cognitive processes
will largely determine interface performance
Fels: Design of Interactive Multimodal Graphics
Metacognitive Gap
• Intuitions about thoughts,goals and plans (“folk
Psychology”) are reasonably accurate
• Intuitions about how people see, hear, and
remember are very inaccurate
• Lack of awareness of the limits of intuition is
the “Metacognitive gap”
• Design-by-intuition leads to bad user
experience
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
3
Evolving interaction design models
• Guidelines are (still) inadequate
• User-centred design inadequate for
rich sensory environments
• Need for theory-rich, evidence-based
approach: design for lower-level
processes
• Must integrate with higher-order
cognitive task analysis and usercentred design practices
Fels: Design of Interactive Multimodal Graphics
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Interaction Design
Ideas and hypotheses
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
• Convincing designers that there is something to
understand—the “metacognitive gap” of folk
Psychology
• Convincing Cognitive Scientists to answer relevant
questions—Complex data displays and multiple
tasks and the Psychophysics reductionist approach
• Integrating research and design—Finding a
common language
Fels: Design of Interactive Multimodal Graphics
Reflective HCI Practice (Schön)
Implement
Bridging the theory/practice gap
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Fields of interest: Experimental
Psychology:
– Founded~ 100 years ago
– Areas of study
• Psychophysics—Vision, hearing, tactile
senses
• Attention—Endogenous, exogenous,
sustained
• Learning and memory
• Goal is often information processing
algorithms
• Discussed in Module 2
Fels: Design of Interactive Multimodal Graphics
4
Fields of interest: Kinesiology &
related disciplines
Useful things from Psychology
•
Perception and attention
– Visual & auditory acuity & discrimination
– Colour perception
– Salient display changes, change blindness
•
– Neuroscience
– Mechanics
– Anthropometry
Learning and memory
–
–
–
–
•
• Science of human movement
Primacy, recency
Skill acquisition
STM limits
State-dependent learning
Decision making
• Examples: Fitts’ law, GOMS/Keystroke
• Goal is often perceptually guided behaviour
• Discussed in Module 3
– Base-rate neglect
– End effects, biases
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Fields of interest: Cognitive Science
• Cogsci Society founded 22 years ago
• Combines Experimental Psychology,
AI, Philosophy and Neurophysiology
• 3 levels of analysis
– Semantics: Intentions, Goals, and
Meanings
– Syntax: Information processing
– Implementation: Neural processing
• Goal is often Cognitive Architecture
Fels: Design of Interactive Multimodal Graphics
Other fields of interest
•
•
•
•
•
Psycholinguistics
Social cognition
Cultural Psychology
Communication theory
Embodied communication,
paralinguistics
• Anthropology
Fels: Design of Interactive Multimodal Graphics
5
Example of metacognitive gap
“Horizontal” Modularity
• The “computer metaphor” of mind
• Model Human Processor (MHP)
• Intuition: Single mental processor reads all
• Serial stages of processing
senses and performs a variety of tasks.
• Cognitive Science: Processing modules
operate in parallel.
Action
• Information flow is Bottom-up
Cognition
– Cognitive impenetrability
– “Seeing is believing”
– Restricted flow of information and control.
Perception
– Processing characteristics are counterintuitive
Fels: Design of Interactive Multimodal Graphics
Horizontal modularity restrictions
Cognitive impenetrability (Pylyshyn,
1984) refers to the inability of observers
to use semantic information (such as
what the person believes or intends to do)
to influence the operation of the input
stage.
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Vertical modularity (Fodor)
Cognitive processing
Phoneme
perception
Auditory
localization
Voice
recognition
Fels: Design of Interactive Multimodal Graphics
6
Vertical modularity restrictions
Information encapsulation (Fodor, 1983)
refers to structural barriers within the
cognitive architecture that prevents internal
data stores from being shared between
modules in the same stage of processing.
Events are processed by many channels
• Intuitions about thinking
Cognition
– Fails at low levels
• Cognitive Architecture
Bimodal
speech
Ventral
system
Dorsal
System
…
– Multiple brain areas
– Interconnected
Sensory world
– Informationally encapsulated
– Multimodal inputs parsed from scene
Fels: Design of Interactive Multimodal Graphics
Modularity of reading
•
•
•
•
Name the colour of the text
Respond as quickly as possible
Measure response time
2 trials
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Dog
Cat
Fish
Bird
Cow
Horse
Pig
Cow
Fish
Fels: Design of Interactive Multimodal Graphics
7
Green
Red
Orange
Red
Blue
Blue
Orange
Green
Red
Fels: Design of Interactive Multimodal Graphics
Advantages of modular processing
• Download task to module, reduce
cognitive bottlenecks
• Fast, effortless information processing
• Near-optimal information integration
between cues and sensory modalities
– Fuzzy logic cue integration
– Bayesian categorization
Fels: Design of Interactive Multimodal Graphics
Cognitive Impenetrability & Modularity
• Stroop effect
– Reading is data-driven module
– Competition for response
• Other Modularity phenomena
– Modularity of perception for action
– Modularity of visual/auditory integration
– Modularity of eye movement control
– Modularity for Models of Minds
Fels: Design of Interactive Multimodal Graphics
"Civilization advances by extending the
number of important operations which
we can perform without thinking
about them."
Alfred North Whitehead
What are the disadvantages?
Fels: Design of Interactive Multimodal Graphics
8
Disadvantage: Processing rigidity
• Stress within a module is not accessible
• Complex stimuli may be processed
differently in different modules
• Tasks may access different modules with
different performance characteristics
• Virtual environments may introduce
discrepancies that impact different modules
(and tasks) differently
Stress within a module may be
undetected
Action
Cognition
Poor cognitive access to
low-level processes
Perception
Fels: Design of Interactive Multimodal Graphics
Example: VDT stress syndrome
• Users complain of headache, vision
problems etc.
• Reports anecdotal, but reading
impairment is observed
• Also pupillary tremor and regressive
saccades
Fels: Design of Interactive Multimodal Graphics
Reflective HCI Practice (after Schön)
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Implement
Interaction Design
Ideas and hypotheses
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Fels: Design of Interactive Multimodal Graphics
9
Reflective HCI Practice: Foraging
Theory of space constancy in active
vision
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Fels: Design of Interactive Multimodal Graphics
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
–
–
–
–
Efference copy
Passive blur
Lateral masking
Saccadic suppression
Fels: Design of Interactive Multimodal Graphics
Reflective HCI Practice: Hypothesizing
Ideas and hypotheses
• Cognition needs the illusion of a stable world
• Eye movements should create confusing
image shifts
• Maintaining space constancy requires
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
VDT fatigue study
• Hypothesis “Sampling” raster during
saccades reduces intra-saccadic blur, and
may overload saccadic suppression
• Space constancy perspective allowed us to:
– Isolate the important factors in a complex
situation
– Find a more sensitive task and measure
• Study examines detection of movement
during saccade (w Bridgeman & Macnik)
Fels: Design of Interactive Multimodal Graphics
10
VDT fatigue study recommendations
• Suppression thresholds elevated for
flickering stimuli
• Effect is reduced >250 Hz
• Problems will be greatest
– large saccades
– high-contrast display
• Work arounds include blanking display
during saccades
Reflective HCI Practice: Testing
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Implement
Interaction Design
Ideas and hypotheses
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
Disadvantage: Each module must
solve feature assignment problem.
• Modules can’t accept information from other
modules: Information encapsulation
• Different modules should have access to a
different set of matching cues.
• Illusory conjunctions can occur in
multimodal environments:
– Phoneme perception: The McGurk effect
– Auditory localization: The ventriloquist effect
Fels: Design of Interactive Multimodal Graphics
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Fels: Design of Interactive Multimodal Graphics
Study: Impact of display errors on
multimodal perception
• Immersive environments typically
have display errors
– Location of events is not precise
– Timing is not precise
– Graphics can be low-fidelity
• As immersive environments add
sound and touch, what will be the
impact of these errors?
Fels: Design of Interactive Multimodal Graphics
11
Reflective HCI Practice:Foraging
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Fels: Design of Interactive Multimodal Graphics
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
• Individual senses adapt to display
• Sensory modalities calibrate each other:
haptics, vision, sound
– Observed actions calibrate visual space (space
constancy)
– Vision calibrates hearing for the location of a
multimodal event
– Sound calibrates vision for the time of a
multimodal event
• Result is an after-effect: remapping of
auditory (visual, haptic) space
Fels: Design of Interactive Multimodal Graphics
Reflective HCI Practice: Hypothesizing
Ideas and hypotheses
Recalibration by pairing (Epstein, 1975)
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Impact of information encapsulation
• Multimodal environment with errors in
timing and location
• The same event might give rise to a single
multimodal construct in one task, and two
unimodal events for another.
– Vary location of visual and auditory phonemes
in a simple teleconferencing-style video display
– Vary information carried by using synthetic
speech stimuli (5 levels).
Fels: Design of Interactive Multimodal Graphics
12
Ventriloquism meets the McGurk
effect.
• Vary location of visual and auditory phonemes
in a simple teleconferencing-style video display
• Vary information carried by using synthetic
speech stimuli (5 levels).
• Subjects report sound location and syllable
heard,
• Analyses included testing a variety of
mathematical models of information integration
by fitting free parameters with STEPIT.
Fels: Design of Interactive Multimodal Graphics
Test: walkthrough, field study or experiment
Exp:“Lab sense”,
FS: Observation
Information
foraging
Interaction Design
Ideas and hypotheses
Evaluation,
Mapping
Fels: Design of Interactive Multimodal Graphics
• Sensory input from a number of channels
simultaneously
• How stimuli from multiple channels are
matched and partitioned into mental
representations
• How information from multiple senses is
integrated to give rise to trans-modal
mental events
Fels: Design of Interactive Multimodal Graphics
Reflective HCI Practice: Testing
Implement
Use of mathematical modeling tools
allow us to address
(Online) Literature:
Psychology
Kinesiology
Sociology
Anthropology
Architecture…
Results:
• Visual capture of auditory source location, resulting
in a shifting of unimodal auditory location
estimation (ventriloquism after-effect).
• No effect of location difference on phoneme
perception as measured by statistical or modeling
tests.
• No correlation between errors in the two tasks (i.e.
subjects could not selectively attend to the auditory
phoneme on trials when visual capture failed).
• Overall, modularity of phoneme perception is
supported.
Fels: Design of Interactive Multimodal Graphics
13
Disadvantage: Changes in task interact
with modules to change performance
• 2 visual systems—“ventral stream” for
recognition and “dorsal stream” for action.
• Where vs how
• Different impact of illusions
• Lesion data
Functional Neuroanatomy of
perception for action.
2 visual systems—“ventral stream” for cognition and
“dorsal stream” for motor performance.
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
2 visual systems lesion evidence
2 visual system illusions
lesion
performance
deficits
spared abilities
V1 (blindsight) detection and
identification
pointing
Ventrolateral
occipital (DF)
identification, shape
recognition, object
orientation
object manipulation
(orientation matching,
grip scaling)
Posterior
parietal (RV)
object manipulation identification, shape
(orientation matching, recognition, object
grip scaling)
orientation
Fels: Design of Interactive Multimodal Graphics
stimuli
deficits
spared abilities
Tichner circles size report
grip scaling
displacement
during
saccade
detection of
displacement,
location report
pointing
Moving or offcentre frame
induced motion,
location report
pointing
Fels: Design of Interactive Multimodal Graphics
14
Research with videoconferencing
and abstract displays
• Targeting sound: cognitive better than
motor
– Subs aware of visual and auditory
locations, but point to visual
• Targeting vision with context: Less
feedback is better
– Pointing with no visual feedback better
– Lagged cursor better than unlagged
Fels: Design of Interactive Multimodal Graphics
Interpreting pointing studies
• Pointing studies counterintuitive, but
predicted by response characteristics
of neurons in dorsal/ventral to visual
and auditory stimuli
• More in “Graphics in the Large” panel,
Thursday 3:30
Fels: Design of Interactive Multimodal Graphics
2 visual system illusions
Modularity Recap
stimuli
deficits
spared abilities
Tichner circles size report
grip scaling
displacement
during
saccade
Moving or offcentre frame
Sound with
displaced
visual
distractor
pointing
detection of
displacement,
location report
induced motion,
location report
pointing
pointing
apparent location of
sound
Fels: Design of Interactive Multimodal Graphics
• Displays can interact with active perception
to cause hidden stress: VDT study
• Displays can impact matching cues for
multimodal cue integration
– Support or frustrate recalibration by pairing
– Lead to discrepancies in number and
composition if stimuli in different modules
• Displays can cause disagreement between
motor performance and cognitive measures
• Sometimes removing information helps
Fels: Design of Interactive Multimodal Graphics
15
Extending to complex worlds
Indexical cognition (Pylyshyn)
• Previous studies in simple worlds, with a
few visual and auditory events
• Multimodal environments are complex
– Virtual worlds
– Augmented reality
– Ubiquitous computing
• How are multiple multimodal events dealt
with in the cognitive architecture?
Fels: Design of Interactive Multimodal Graphics
“FINSTs... make thoughts true”
• Perception
–
–
–
–
–
“Hotlink” tokens
Drawn to salient events
Object-centred, “sticky”
Visual routines
Finite number ~ 4
• Cognition
–
–
–
–
–
Maintain object history
Implicit memory of object associations
Sparse cognitive representation
Just-in-time delivery of information
Atom of intentionality
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Mental representations of complex worlds
• Cognitive architecture perspective requires
that links be established between lower
level perceptual qualities and cognitive
symbols—i.e. a pointer, called a FINST.
• FINSTing allows us to interact with
perceptual objects and events without the
need for mental images per se.
• Symbolic representation + pointers makes
different predictions than intuitive picture-inthe-head
Fels: Design of Interactive Multimodal Graphics
16
Indexical cognition (Pylyshyn)
Naïve view of FINSTs in Cognitive Arch
Action (motor space)
Cognitive processing
FINSTs
Phoneme
perception
Fels: Design of Interactive Multimodal Graphics
Auditory
localization
Voice
recognition
Fels: Design of Interactive Multimodal Graphics
Another view of FINSTs
More about FINSTs
Action (motor space)
• FINSTs Link mind & perceptual world
Cognitive processing
FINSTs
Phoneme
perception
Auditory
localization
Voice
recognition
Fels: Design of Interactive Multimodal Graphics
– Visual routines: (collinear, inside,
subitizing)
– History of an object
– Object-centred, “sticky”
– Drawn to salient changes-- onsets,
luminance increments, oddballs
– Finite number ~ 4-7
– FINSTs + ANCHORs for motor behaviour
Fels: Design of Interactive Multimodal Graphics
17
More about ANCHORs
Multiple object tracking demo
• ANCHORs link mind & action
– Remembered locations for eye
movements
– Direct interaction with items off the retina
– Fast, robust motor performance by action
routines
– Affordances for action
Fels: Design of Interactive Multimodal Graphics
Another trial (Scholl)
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Another trial (Scholl)
Fels: Design of Interactive Multimodal Graphics
18
Application
Mental representations of complex
environments
• Cognitive architecture perspective requires that
links be established between lower level perceptual
qualities and cognitive symbols—i.e. a pointer,
called a FINST.
• FINSTing allows us to interact with perceptual
objects and events without the need for mental
images per se.
• Symbolic representation + pointers makes different
predictions than intuitive picture-in-the-head
• Coping with spatial transformations in complex
data spaces
Fels: Design of Interactive Multimodal Graphics
Module advantages
• High-realism interface designs improve
performance by “downloading” information
processing operations to input modules.
• Interaction of display characteristics with
capabilities and characteristics of the
functional architecture will determine
performance.
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
Module disadvantages
• Coordination
– Distortions in location, timing, and category-relevant
information may lead to the formation of conflicting
representations in different modules.
• Processing inflexibility
– Errors and conflicts within a module can create errors and
increase cognitive load. (CRT flicker example)
• Information hiding
Cognitive impenetrability of modules makes it difficult for
operators to determine the reasons for their poor
performance.
Fels: Design of Interactive Multimodal Graphics
19
Inseparability of Mind & World
•
•
•
•
Embodied cognition-- mind/body
Situated cognition-- mind/world
Distributed cognition-- mind/mind
Ecological theories (Vygotski, Luria,
Bateson, Gibson) can be linked to
sensory phenomena and inform
interaction design
Fels: Design of Interactive Multimodal Graphics
What to expect in the next talk
• More about perceptual and attentive
bottlenecks
• Change blindness
• Three flavours of (in)attention
• Implications for visual displays
Key Points for HCI Practice
• The Metacognitive Gap: The need for a
grounded Cognitive Science approach
• Reflective design practice methods
– Integrating CogSci with interaction design
– Iterative design cycle
• Examples of extended HCI
– Cognitive architecture: Multimodal displays and
how they are understood
– Situated cognition: embodied interaction with
complex displays
Fels: Design of Interactive Multimodal Graphics
Environment/Inhabitant/Representation
• Information visualization
• Perceptual/deictic/situated Cognition
• Cognitive Architecture, perception, attention
• Display as extension of mental model
• Coercive graphics
Fels: Design of Interactive Multimodal Graphics
Fels: Design of Interactive Multimodal Graphics
20
Thirteen Ways of Looking at a Blackbird
Wallace Stevens
I
Among twenty snowy mountains,
The only moving thing
Was the eye of a blackbird.
II
I was of three minds,
Like a tree
In which there are three blackbirds.
III
The blackbird whirled in the autumn
winds.
It was a small part of the pantomime.
IV
A man and a woman
Are one.
A man and a woman and a blackbird
Are one.
V
I do not know which to prefer,
The beauty of inflections
Or the beauty of innuendoes,
The blackbird whistling
Or just after.
VI
Icicles filled the long window
With barbaric glass.
The shadow of the blackbird
Crossed it, to and for.
The mood
Traced in the shadow
An indecipherable cause.
VII
O thin men of Haddam,
Why do you imagine golden birds?
Do you not see how the blackbird
Walks around the feet
Of the women about you?
VIII
I know noble accents
And lucid, inescapable rhythms;
But I know, too,
That the blackbird is involved
In what I know.
IX
When the blackbird flew out of sight,
It marked the edge
Of one of many circles.
X
At the sight of blackbirds
Flying in a green light,
Even the bawds of euphony
Would cry out sharply.
XI
He rode over Connecticut
In a glass coach.
Once, a fear pierced him,
In that he mistook
The shadow of his equipage
For blackbirds.
XII
The river is moving.
The blackbird must be flying.
XIII
It was evening all afternoon.
It was snowing
And it was going to snow.
The blackbird sat
In the cedar-limbs.
Fels: Design of Interactive Multimodal Graphics
21