lect 0514 p

Transcription

lect 0514 p
IST 4 Information and Logic
Associative Memories
A Case Study with Remote Association Test
Yue Li
Call Phone Fence
Car Valve
Caltech MIT
Pen Blue Cheese
BabylonTablet
Suzanne Corkin
Henry Molaison or H. M.
(1926 2008)
(1926-2008)
Part of H. M.’s brain is removed for treating epilepsy in 1953.
After the surgery
surgery, he suffered from severe amnesia.
amnesia
Scientists worked with H.M. for more than 40 years.
William Scoville
(1906 – 1984)
Brenda Milner
(1918 - )
Metal – Iron
Baby – Cries
Crush – Dark
School – Grocery
Rose – Flower
Obey – Inch
Fruit – Apple
Cabbage – Pen
“Do
Do you remember what
went with Metal? Baby?
Crush?...”
First trial: Iron
Second trial: Cries, Iron, Flower
Third trial: Apple, Cries, Iron
Henry Molaison or H. M.
(1926-2008)
The Storyy of S.
“Nel mezzo del cammin di nostra vita …”
Alexander Luria
(1902 – 1977)
“
(Nel) – I was paying my membership dues when
there, in the corridor, I caught sight of the ballerina
Nel’skaya
Nel
skaya.
(Mezzo) – I myself am a violinist; what I do is to set
up an image of a man, together with Nel’skaya, who
is playing the violin.
(Del) – There’s a pack of Deli Cigarettes near them.
(Cammin) – I set up an image of a fireplace close
by.
(Di) – Then I see a hand pointing toward a door.
”
Memory champions have average
memories.
They memorize in a different way
using navigation, association and
visualization.
Joshua Foer
Good memories can be trained.
Association and Creative Thinking
Creative thinkingg connects different ideas together
g
and generates a new idea.
Prof. Mednick proposed remote association tests
Prof
in 1960s for evaluating a person’s creativity.
S
Sarnoff
ff A
A. Mednick
M d i k
Sk
Skate
C
Cream
Ice
Water
Falling
Dust
Star
Actor
Boot
Summer
Camp
Ground
Fence
Card
Post
Master
Remote Association Test
Cue #1
Cue #2
Fence
Card
Solution
P
Post
Cue #3
Master
We want to synthesize the
process for solving RATs.
RATs
Why ?
Wouldn’t it be nice if …
Caltech
Astronomy
JJet Propulsion
p
Laboratoryy
Pasadena
Why do we want to solve RAT
• Captures word associations
– Building blocks for associating large datasets, e.g.,
webpages,
b
databases,
d b
documents.
d
• Potentials for interesting applications
– Artificial brainstorming.
g
– Brain-like memory organization.
– Artificial experts.
experts
Solvingg RATs
Human: Knowledge + Strategy/Tricks
Machine: Database/Data structures + Algorithms
Captures word
associations
Search in data
structures
A Good Database
• Human Brain Cloud
– http://www.humanbraincloud.com
– A social game by Kyle Gabler.
– Word associations contributed by
crowd sourcing.
Why HBC Database ?
http://www.humanbraincloud.com/#stats
• Using humans
• Completeness
– Many words
– Many connections
Word Association Graph
p
Caltech
0.1
0.7
0.6
Astronomy
0.5
0.3
0.2
0.3
Telescope
0.2
JPL
0.3
Uni ersit
University
0.1
0.1
Sunny
Pasadena
0.6
HBC Graph
G h Demo
D
Demo #1
D(x, y): the shortest distance from x to y
D(Caltech, Telescope) = 2
Caltech
01
0.1
07
0.7
0.6
Astronomy
0.5
Tl
Telescope
Asymmetric
0.2
0.3
0.2
0.3
JPL
0.3
U
University
0.1
0.1
Sunny
Pasadena
0.6
Symmetric Distance: SD(x, y) = min{ D(x, y), D(y, x) }
SD(C l h JPL) = min
SD(Caltech,
i { 22, 1 } = 1
Caltech
01
0.1
07
0.7
0.5
Tl
Telescope
0.6
Astronomy
0.2
0.3
0.2
0.3
JPL
0.3
U
University
0.1
0.1
Sunny
Pasadena
0.6
Sphere(w, r): all the words within distance r
SSphere(Pasadena,
h (P d
1) = {JPL,
{JPL Caltech,
C lt h Sunny}
S
}
Sphere(Pasadena, 2) = {JPL, Caltech, Sunny, Telescope, Astronomy, University}
Caltech
01
0.1
07
0.7
0.5
Tl
Telescope
0.6
Astronomy
0.2
0.3
0.2
0.3
JJPL
0.3
University
0.1
0.1
Sunny
Pasadena
0.6
Sphere Intersection
S h (A
Sphere(Astronomy,
1) ∩ Sphere(JPL,
S h (JPL 1) = { Telescope
Tl
}
Caltech
01
0.1
07
0.7
0.5
Tl
Telescope
0.6
Astronomy
0.2
0.3
0.2
0.3
JJPL
0.3
University
0.1
0.1
Sunny
Pasadena
0.6
An Algorithm of the RAT Solver
• Compute the radius-3 sphere intersection of the cue words*
• Prune the sphere intersection
– take the words with minimum sum of symmetric distances
from the cue words.
words
• Shorten the candidate set with predefined metric to
yield top-K solutions
– e.g. using the multiplication of the edge probabilities
*our demo to be shown later uses radius-2 ball intersection for faster execution speed.
A Toyy Example
p
Caltech
Astronomy
?
Pasadena
Compute
p
Sphere
p
Intersection
Caltech
0.1
0.7
0.5
Telescope
0.6
Astronomy
0.2
0.3
0.2
0.3
JPL
0.3
Uni ersit
University
0.1
0.1
Sunny
Pasadena
0.6
Prune Sphere Intersection
SD(Telescope, Astronomy)
SD(S
SD(Sunny,
A
Astronomy)
)
SD(JPL, Astronomy)
SD(University, Astronomy)
+ SD(Telescope, Caltech)
+ SD(Sunny,
SD(S
C l h)
Caltech)
+ SD(JPL, Caltech)
+ SD(University, Caltech)
+
+
+
+
SD(Telescope, Pasadena) = 5
SD(S
SD(Sunny,
P d )
Pasadena)
=6
SD(JPL, Pasadena)
=3
SD(University, Pasadena) = 6
Caltech
0.1
0.6
Astronomy
0.7
0.5
Telescope
0.2
0.3
0.2
03
0.3
JPL
0.3
University
0.1
0.1
Sunny
Pasadena
0.6
Walk-through Examples with
Real RATs
Demo #2
Performance (k = 1)
In 84.9% RATs, the real solutions appear in the pruned intersection.
In 70.8% RATs, the final solution is correct.
* All RATs are collected from
www.remote-associates-test.com
Difficulty characterized by the
average solving time of human beings.
Performance (k = 3, 4, 6, 8)
In 80% RATs, the real solutions appear in the top-k solution list.
Success Rate
Average Precision (k =1, 3, 4, 6, 8)
increases slowly with k when k > 3
A better association
graph is needed
Ongoing Work for Solving RATs
• Better
B
algorithms
l
h
– Improving distance metric
– Lowering complexity
• Better databases
– Generatingg association graphs
g p with better quality
q
y
• Utilizing the semantics of and the relationships between
words
Flower
Fight
Rose
Up
Future Directions
• How to generate association graphs?
– How to determine the quality of a graph?
– Domain specific graph
• Medicine, NSA, marketing, patents, papers, people
• How to access the quality of a domain specific graph?
• How to determine the difficulty level of a RAT?
• Can we create RAT using an association graph?
Ideas and Questions
If you have
ave any
a y quest
questions,
o s, new
ew ideas,
eas, or
o want
wa t
to work with us, please drop us an email!
y @ca tec .e u
[email protected]