Online Dictionary Learning for Sparse Coding

Transcription

Online Dictionary Learning for
Sparse Coding
Julien Mairal1 Francis Bach1
Jean Ponce2 Guillermo Sapiro3
1
INRIA–Willow project 2 Ecole Normale Supérieure
3
University of Minnesota
ICML, Montréal, June 2008
What this talk is about
Learning efficiently dictionaries (basis set) for sparse
coding.
Solving a large-scale matrix factorization problem.
Making some large-scale image processing problems
tractable.
Proposing an algorithm which extends to NMF, sparse
PCA,. . .
1
The Dictionary Learning Problem
2
Online Dictionary Learning
3
Extensions
1
2
3
Extensions
y
|{z}
measurements
=
xorig
| {z }
original image
+ |{z}
w
noise
[Elad & Aharon (’06)]
Solving the denoising problem
Extract all overlapping 8 × 8 patches xi .
Solve a matrix factorization problem:
n 1
X
min
||x − Dα ||2 + λ||α || ,
αi ,D∈C i=1 |2 i {z i 2} | {z i 1}
reconstruction
sparsity
with n > 100, 000
Average the reconstruction of each patch.
[Mairal, Bach, Ponce, Sapiro & Zisserman (’09)]
Denoising result
[Mairal, Sapiro & Elad (’08)]
Image completion example
What does D look like?
n 1
X
||xi − Dαi ||22 + λ||αi ||1
mink×n
α∈R i=1 2
D∈C
M
C = {D ∈ Rm×k s.t. ∀j = 1, . . . , k, ||dj ||2 ≤ 1}.
Classical optimization alternates between D
and α.
Good results, but very slow!
1
2
3
Extensions
Classical formulation of dictionary learning
n
1X
min fn (D) = min
l(xi , D),
D∈C
D∈C n i=1
where
1
M
l(x, D) = mink ||x − Dα||22 + λ||α||1 .
α∈R 2
Which formulation are we interested in?
n
1X
min f (D) = Ex [l(x, D)] ≈ lim
l(xi , D)
n→+∞ n
D∈C
i=1
M
Online learning can
handle potentially infinite datasets,
adapt to dynamic training sets,
be dramatically faster than batch
algorithms [Bottou & Bousquet (’08)].
Proposed approach
1: for t=1,. . . ,T do
2:
Draw xt
3:
Sparse Coding
1
αt ← arg min ||xt − Dt−1 α||22 + λ||α||1 ,
α∈Rk 2
4:
Dictionary Learning
t 1
1X
2
||xi −Dαi ||2 +λ||αi ||1 ,
Dt ← arg min
t i=1 2
D∈C
5: end for
Proposed approach
Implementation details
Use LARS for the sparse coding step,
Use a block-coordinate approach for the
dictionary update, with warm restart,
Use a mini-batch.
Proposed approach
Which guarantees do we have?
Under a few reasonable assumptions,
we build a surrogate function fˆt of the
expected cost f verifying
lim fˆt (Dt ) − f (Dt ) = 0,
t→+∞
Dt is asymptotically close to a stationary
point.
Experimental results, batch vs online
m = 8 × 8, k = 256
m = 12 × 12 × 3, k = 512
m = 16 × 16, k = 1024
Experimental results, ODL vs SGD
m = 8 × 8, k = 256
m = 12 × 12 × 3, k = 512
m = 16 × 16, k = 1024
Inpainting a 12-Mpixel photograph
1
2
3
Extensions
Extension to NMF and sparse PCA
NMF extension
n
X
1
mink×n
||xi − Dαi ||22 s.t. αi ≥ 0,
α∈R i=1 2
D ≥ 0.
D∈C
SPCA extension
n 1
X
mink×n
||xi − Dαi ||22 + λ||α1 ||1
α∈R 0 i=1 2
D∈C
M
C 0 = {D ∈ Rm×k s.t. ∀j ||dj ||22 + γ||dj ||1 ≤ 1}.
Faces: Extended Yale Database B
(a) PCA
(b) NNMF
(c) DL
Faces: Extended Yale Database B
(d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10%
Natural Patches
(a) PCA
(b) NNMF
(c) DL
Natural Patches
(d) SPCA, τ = 70% (e) SPCA, τ = 30% (f) SPCA, τ = 10%
Conclusion
Take-home message
Online techniques are adapted to the
dictionary learning problem.
Our method makes some large-scale
image processing tasks tractable—. . .
. . . — and extends to various matrix
factorization problems.

Online Dictionary Learning for Sparse Coding

Transcription

Similar documents

Many a mickle maks a muckle

“The Pirate Dictionary”

Harmless drudgery

Corazol

Dictionary - Texas Education Agency

World Book Student

Correspondence-Free Dictionary Learning for Cross

3. Newsletter Aug - Sep 2015

COMPUTERIZED PALEOGRAPHY: TOOLS FOR HISTORICAL

ECG Denoising by Dictionary Learning

The ideal dictionary, lexicographer and user

Online Dictionary Learning for Sparse Coding

Online Robust Dictionary Learning - The Chinese University of Hong

5. Asialex8_Proceedings_Abstracts and Poster

LIFEPAC 9th Grade Language Arts Unit 8 Worktext

purpose and audience - Pearson Schools and FE Colleges

national srtategies for seismic - AJIT-e

Dictionary Optimization for Block-Sparse - Lihi Zelnik

K-SVD: An Algorithm for Designing Overcomplete Dictionaries for

This Chapter in a Nutshell

SPCA – Paws 4 a Cause – General

User Manual (English)