Validation of a binaural model for speech intelligibility in noise and

Transcription

Validation of a binaural model for speech intelligibility in noise and
Validation of a binaural model for
speech intelligibility in noise and reverberation
Jelfs S., Culling J. F. & Lavandier M.
BRIR Target
BRIR Interferer 1
BRIR Interferer 2
.
.
.
BRIR Interferer N
Predictions in
anechoic*
Noise interferers and frontal target sentences in all
experiments
BMLDs
Better-ear S/N ratios
Binaural S/N ratio
predicting SRT
Better-ear listening
Peissig & Kollmeier (1997)
Hawley et al. (2004)
Culling et al. (2004)
- Subjective adjustment procedure in
German
- Interferers: one (17 azimuths between
0° and 360°), two (17 az.+105°), or three
(17 az.+105°+255°)
- Adaptative procedure in English
- One, two or three interferers at one,
two or three of a range of azimuths (-30°,
0°, 30°, 60° and 90°)
- Binaural and monaural listening
- Same as Hawley et al. (2004) with
three interferers
- Conditions with both cues (FF), no
binaural unmasking (ILD-only) or
no better-ear listening (ITD-only)
Anechoic => head-related impulse responses (HRIRs)
instead of BRIRs for the predictions
6
Predicted SRT (dB)
3
When original HRIRs not available (P&K1997,
Hal.2004, B&P1988) => HRIRs from Gardner and
Martin (1994)
Energy levels
1 interferer
2 interferers
3 interferers
0
-3
-6
* Jelfs et al. (2011), Hear. Res. (in press)
-6
-9
open symbols: monaural, r = 0.99
closed symbols: binaural, r = 0.99
r =0.98
-12
-12
-12
-9
-6
-3
0
3
6
-9
-6
Both cues (FF)
ITD only (dT)
ILD only (dL)
0
-3
-12
-9
1
1 interferer
2 interferers
3 interferers
0
Predicted SRT (dB)
* computationally efficient version of
Lavandier & Culling (2010) model
Interaural parameters
(ΦT - ΦI, ρI)
Intelligibility
Predicted SRT (dB)
BRIR: binaural room impulse response
BMLD: binaural masking level difference
Φ: interaural phase difference
ρ: interaural coherence
S/N ratio: signal-to-noise ratio
SII: speech intelligibility index
SRT: speech reception threshold
Binaural unmasking
Gammatone Filterbank
BRIRs between sources & ears
SII-weighted
integration
Model*
-3
0
-1
-2
-3
-4
-5
r =0.95
-6
-6
Observed SRT (dB)
-5
-4
-3
-2
-1
Observed SRT (dB)
Observed SRT (dB)
Bronkhorst & Plomp (1988)
- Adaptative procedure in Dutch
- One interferer at 0°, 30°, 60°, 90°, 120°, 150°, or 180°
- Conditions with both cues (FF), no binaural unmasking (dL)
or no better-ear listening (dT)
Predicted SRT (dB)
-5
FF (Both cues)
dL (ILD only)
dT (ITD only)
Main discrepancy associated with a HRIR difference at 90°
-10
-15
r =0.88
-20
-20
-15
-10
-5
Observed SRT (dB)
Predictions in
reverberation*
Noise interferers, target sentences and
real-room BRIRs in all experiments
Adaptive procedure in English
(except B&B2006)
Experiment 1
Experiment 2
Beutelmann & Brand (2006)
- One interferer at three azimuths (-25°, 0°
or 25°) and two distances (0.65 or 5 m), and
a target at (25°, 0.65m)
- Meeting room 1
- Conditions with both cues (BRIR) or no
binaural unmasking (SEIR for Spectral
Envelope Impulse Response)
- One interferer in 12 selected conditions
involving three azimuths (-25°, 0° or 25°),
5 distances (0.65, 1.25, 2.5, 5 and 10 m)
and 4 rooms (corridor, L-shaped, lecture
hall, meeting 2)
- Adaptative procedure in German
- One interferer (8 azimuths between
-140° and 180°) and a frontal target
- Two rooms (cafeteria, office)
* Lavandier et al. (2011), J. Acoust.
Soc. Am. (under review)
Experiment 3
- One, two or three interferers in 4 configurations (see below) at two distances (0.65 or 5 m)
- Meeting room 1
- Conditions with both cues (BRIR) or no
binaural unmasking (SEIR)
near (0.65 m)
or far (5 m)
0.65 m
1int.
Conclusions
2 int.
bilateral
3 int.
 model predicting speech reception thresholds in combined noise and reverberation
 accounts for the effects of binaural unmasking, head-shadow and room colouration
 for multiple interfering sources and real-room acoustics
 reproduces a range of data sets from the literature (≠ languages and procedures)
 without parameter fitting
0
1

Similar documents