Small area estimation of labour force indicators under a multinomial

Transcription

Small area estimation of labour force indicators under a multinomial
Small area estimation of labour force indicators
under a multinomial mixed model with correlated
time and area effects
M.E. López-Vizcaíno1 , M.J. Lombardía2 and D. Morales 3
1
3
Instituto Galego de Estatística
2
Universidade da Coruña
Universidad Miguel Hernández de Elche
Introduction
The impact of the crisis on the Spanish labour market has been very
intense.
Spanish unemployment rate was 27.16% in the firth quarter of 2013
(around 17 points higher than in 2008).
Unemployment in Galicia was also a major problem. The unemployment
rate was 22,35% in the firth quarter of 2013.
The availability of a hight quality set of labour force indicators at the
county level is very important for police makers.
As the Spanish Labour Force Survey (SLFS) is designed to obtain
estimates at the province level, estimating indicators at a low level of
aggregation (like counties) is a small area estimation problem.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
2 / 47
Objective
Estimation of labour force indicators: totals of employed and unemployed
people and unemploymet rates in the counties of Galicia.
We use a multinomial logit mixed model with two random effects, one
associated with the category of employed people and the other
associated with the category of unemployed people.
We model the time dependency with AR(1)-correlated time effects.
We propose mean squared error (MSE) estimators based on analytical
and bootstrap methods.
The new models generalized the model of Molina et al. (2007) and
López-Vizcaíno et al. (2013).
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
3 / 47
Data description
Sample information:
SLFS of Galicia from the third quarter of 2009 to the fourth quarter of
2011 (10 time periods). Information available on individual level.
Sample design:
Quarterly survey that uses a two-stage stratified random sampling design to
extract samples from each Spanish province.
The Primary Sampling units are Census Sections (geographical areas with a
maximum of 500 dwellings-approximately 3000 people).
The Secondary Sampling Units are dwellings, around 8000 in Galicia.
All individuals aged 16 or more in the selected dwelling are interviewed.
Domains of interest:
Domains are the 51 counties of Galicia crossed with sex. There are
D = 102 domains Pdt partitioned in the subsets
Pd1t
Pd2t
Pdt3
employed
unemployed
inactive
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
4 / 47
Data description
Auxiliary variables
0.8
SEXAGE: Combinations of sex and age groups, with 6 values. SEX is coded
1 for men and 2 for women and AGE is categorized in 3 groups with codes 1
for 16-24, 2 for 25-54 and 3 for ≥55. The codes 1, 2, . . . , 6 are used for the
pairs of sex-age (1, 1), (1, 2), . . . , (2, 3).
2
1
2
2
2
2
2
2
2
1
5
5
5
5
1
4
3
6
1
2
3
4
4
2
5
4
5
2
2
4
2
5
5
1
5
2
5
2
2
4
4
4
3
6
3
3
3
6
6
6
7
8
9
10
5
5
2
0.05
2
5
1
4
1
3
4
3
4
3
4
3
3
3
3
6
6
6
6
6
6
6
6
6
6
3
6
3
6
5
6
7
8
9
10
1
2
3
4
5
6
3
4
6
4
2
5
1
4
1
4
3
6
1
4
3
4
3
6
1
1
1
0.2
3
1
1
1
5
1
4
4
4
5
0.6
5
0.4
Proportion of employed
5
1
1
0.10
5
Proportion of unemployed
5
1
1
2
0.15
2
TIME
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
3
TIME
June 2013
5 / 47
Data description
Auxiliary variables
STUD: This variable describes the achieved education level, with values 1-3
for the illiterate and the primary, the secondary and the higher education level
respectively.
3
2
3
3
3
3
3
3
3
3
3
2
2
2
2
2
2
2
2
2
2
3
2
0.10
2
2
Proportion of unemployed
2
0.4
Proportion of employed
0.6
2
2
2
3
3
2
2
3
3
3
3
3
3
3
0.2
0.05
1
1
1
1
2
1
1
3
4
1
1
5
6
1
1
1
1
7
8
9
10
1
1
1
3
4
1
1
1
1
1
1
1
2
TIME
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
5
6
7
8
9
10
TIME
June 2013
6 / 47
Data description
Auxiliary variables
0
SS: This variable indicates if an individual is registered or not in the social
security system.
REG: This variable indicates if an individual is registered or not as
unemployed in the administrative register of employment claimants.
●
●
●
●
●
●
●
●
●
●
●
1
●
●
● ●
●●
● ●
●
●
● ●●
●
●
●
●
●
●
−2
●
−1
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
● ●
●
● ●● ● ●●
●
● ●●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●● ● ●● ● ●● ● ● ●
●
● ●
●
●●●
●●
●
●
●●●
● ●● ● ●
● ●●
●
●
●
●
●
●
●
●
●
● ●
● ●● ● ●● ● ●
●
● ● ●
● ●● ●
●
● ●
●
●
●● ● ● ●
● ●
●
●●
●
● ●
●
●
●●
●
●
●● ●● ●
●
●● ● ●
●
●
●
● ● ● ●●
●
●●● ●●●● ●
●
●
●●●● ● ●● ● ● ● ● ● ●
●
●
● ●●
●●
●
● ●●
● ●●
●
●
●
●●
●
●
●
● ●●●●● ● ●
●
●●
●●
●
●● ● ● ●
●●●●●
●
●
●
● ●●● ●
● ●●
● ● ●● ●●
●●
● ●●
● ● ●
● ●
●●
●
● ● ● ● ●●●
●● ●
●●
●
●● ●
●●●
●●
● ●
●
● ●● ●
●●
●●
●●
● ●
● ●
●
●
●●●●
●
●●
● ●
●
●●
●
●●
● ●
●●
●
● ●●
●
● ● ●● ●
●
●●● ●
●
●
●●●
●
●
● ●
●●
● ●● ●
●● ●
●● ●● ●
● ●
●
● ●
●
● ●
●
●●
●●
●
●
● ●
●
●● ●
● ●●
● ●
●●● ●●●
● ● ●
●●
●
●
●●
●
●
●
●
●
●●●●●●● ● ● ● ● ●
●●
●
●
●
●
●
●●
●
●● ●●●
● ●●
● ●
●
●
● ● ●● ● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●●
●
●
●
● ●
● ●
● ●
● ● ●
●
● ●● ●●●
●
●
●●
●
● ●●
● ●●
● ●
●● ● ●
●
● ●●
●
● ●● ●●
● ●
●
●
● ●
●
●
●● ● ●
●●●
●
●
● ● ●●● ●
●
●
●
●
● ●
●●
●
●
●
●
●●
● ●●
●
●●● ● ●
●
●●●
●● ● ● ●
●
●
● ●● ●● ● ●●●
● ●
●
● ●
●
● ●
●●
●
● ●●● ● ● ●
● ● ● ● ●●●
●
●
● ●●●
● ●●
●
● ●● ● ●
●
● ● ●
● ● ●●●●
●●
●
● ●
●
● ●
●● ●
●● ● ●
●●
●
●●
●●
●●
● ●
● ●
●● ● ●
●● ●●
●●
● ●● ● ●●●
●
●
●
● ●●
●
● ●
●●
●
● ●●
●
●
● ●●
● ● ●●
●●
●
●
●
●
● ●
● ●●
●
●
●●●
●●●
● ● ●● ●
● ●●
●
●
●●
●●●●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
● ●● ● ●
●●●
●
●
●●
● ●
●
●● ●
●
● ●●
●
● ●
●
●
● ●● ●● ●
●●
●
● ● ● ●● ●
●
●
●● ●● ●
●
●
●
●●●
●
●
●
● ●
●
●
●
●● ●
● ●
●
●
●
● ●
●
●
●●
●● ●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●●
●●
●
●
●
●
−2
●
● ●
●
●
●
●
●
●
●●
●
●
●
● ●●
●● ●
●
●●
●
●
● ●●
●
●
● ●●
● ● ● ●● ●●●
●
●
●● ● ● ● ● ●
● ●●
● ● ●● ● ●
●
●
●
●
●●
●●
●
●
●●
● ●
●
● ● ●●●
●
●●
●
●
●●
● ●● ●
● ●
●●●●● ● ●
●●
●
●
●
●
●● ●
● ●●
●●
●
● ●
● ●●● ●
●
●
●● ●
●●
●●
●
●
●
●
●
●
●
●
●● ●
●
●● ●
● ●●● ●●
● ●
●
● ●
●
● ●●
●
●
●●
● ●
●
●
●
●
●●
●●
●
● ●●
●
●
●
●●●●●
●
●●●
● ●
● ●● ●
●
●●
●
●
●
●●
●
●
●●
●
●
●●
●●● ●
●
● ● ● ●
●
●
●
●● ●●
●● ●
●●
●●●
●
●
●
●● ●● ●
●● ●
●● ● ● ● ● ● ●
● ●
●●●
●●
●●
●
● ● ●●
●●●●
●●● ●● ●
●●
●●
●
● ●● ●
●●
●
●
●●
●
●
●●
●
● ● ● ● ●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●●●●
● ● ●
● ●
●●●
●
●
●● ●
●
●●
●●
● ●
●●
● ●●
●● ●
● ●●
●● ●●●
●
●
●●
●● ●
●●● ●●
●
●
●●
●●●●●
●●
●●●
●
●● ●
●
●
●
●
● ●●
●●
●
●●●
●
●
●●●
●● ●
●●
●
●
●
●
●
●●
● ●●
● ●
●
●
●● ●
●
●●●
●
● ●●●
●●●
●
●
● ●
●
● ●●
●●●●
●
●
●
●
●
●
●
●
●●
● ● ●●
●●●
● ● ●
●
●
● ●
●
●●●
●
●
●●
●●
● ●
●●●● ●
●
●
●
●
●●
●●
●● ●●●
●●●●
●●
●
●
●●
●●
●
●
● ● ● ●●
●●
●● ●
●
●●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
● ●●● ●
●
●
●●●
●●
●
●
●●
●
●●
●
●
●
●● ● ●
● ●●●
● ●●
●●
●
●
●●
●
●
●
●●
●
●●
●
●● ●
●
●
●
● ●
●
●●
●
●●● ●●
●● ●●
●
●●
●
●
●
●
●
● ●
● ●●
● ●
● ●
●●
●
●
●●●
●
● ● ●●
●
● ●●
●●
●●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●●
●●
●●●
● ●● ● ●● ●
●
● ●
●●
●●
●● ●
●
●●
●
●
●● ● ●
●●●●● ●●
●
●●●● ● ●
● ● ● ●●●
●
●
●
●
●
●
●
●
● ●●
●
●
●
● ●
● ●● ●
● ● ●● ● ●
●● ●
●●●
●●
●
●
●
●
●
● ●
●
●● ● ●●● ●
●● ●●
●
●
●● ●
●●
●
●
●
●
●
●
●
●
●●
●
●●● ●●● ● ● ●
● ●
●
●●
●
● ● ●● ●● ●●●●
●●
●● ●
●
● ●
● ●
● ●● ●
●
●●●●
●
● ●
●
●●
●
●
●
●
●
●
●
●● ●
● ●
●
● ● ● ●●
●
●
● ●●
●
●
● ●
●
●
●●
● ●
●
●
●
● ●
●
●
●●
● ●
●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●●
●
−4
0
●
●
●
−1
log(Employed/Inactive)
●●●
●
●
●
−3
●
log(Unemployed/Inactive)
●
●
●
●
0.20
0.25
●
0.30
0.35
0.40
0.45
0.50
Proportion of registered in social security system
0.04
0.06
0.08
0.10
0.12
0.14
Proportion of registered unemployed
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
7 / 47
Design-based estimators
Some notation:
k = 1, 2. The categories of the target variable:
1 for employed, and
2 for unemployed.
d = 1, . . . , D. The domains.
t = 1, . . . , T . The time periods.
Pdt . The population in domain d and period t.
Sdt . The sample in domain d and period t.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
8 / 47
Design-based estimators
Target population parameters:
The totals of employed and unemployed people and the unemployed rate.
Ydkt =
X
ydtkj ,
Rdt =
j∈Pdt
Yd1
,
Yd1t + Yd2t
k = 1, 2.
Direct estimators:
The direct estimators of the total Ydkt , the size Ndt and the rate Rdt are
dir
Ŷdkt
=
X
j∈Sdt
dir
wdj ydtkj , N̂dt
=
X
dir
wdtj , R̂dt
=
j∈Sdt
dir
Ŷd2t
dir
dir
Ŷd1t
+ Ŷd2t
.
where the wdtj ’s are the official calibrated sampling weights which take
into account for non response.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
9 / 47
Design-based estimators
Variance of direct estimators:
dir
dir
cd
ovπ (Ŷdk
, Ŷdk
)=
1t
2t
X
ˆ dir )(y
ˆ dir
wdtj (wdtj − 1)(ydtk1 j − Ȳ
dtk2 j − Ȳdk2 t )
dk1 t
j∈Sdt
This formula is obtained from Särndal et al. (1992), pp. 43, 185 and 391,
with the simplifications wdtj = 1/πdtj , πdtj,dtj = πdtj and πdti,dtj = πdti πdtj ,
i 6= j in the second order inclusion probabilities.
Variance of unemployment rate:
The design-based variance of R̂ddir approximated by Taylor linearisation
dir
V̂π (R̂dt
)=
dir 2
(Ŷd1t
)
dir
(Ŷd1t
+
dir 4
Ŷd2t
)
dir
V̂π (Ŷd2t
)+
2
Ŷd2t
dir
(Ŷd2t
+
dir 4
Ŷd1t
)
dir
V̂π (Ŷd1t
)−
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
dir
dir
2Ŷd1t
Ŷd2t
dir
dir 4
(Ŷd1t
+ Ŷd2t
)
June 2013
cd
ovπ
10 / 47
Design-based estimators
In the fourth quarter of 2008 the distribution of the sample sizes per domains
in the SLFS of Galicia has the quantiles
qmin = 13, q1 = 54, q2 = 97, q3 = 153, qmax = 1554.
This means that many of the direct estimators are not reliable.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
11 / 47
The multinomial logit mixed model
We use indexes k = 1, . . . , (q − 1), d = 1, . . . , D and t = 1, . . . , T for the
categories of the target variable, the domains and the time instants.
In the main model (Model 3), we write the random effects in the vector
form
u1
=
u 2,d
=
col (u 1,d ),
1≤d≤D
col
1≤k ≤q−1
u 1,d =
(u 2,dk ),
col
(u1,dk ),
1≤k ≤q−1
u 2,dk = col (u2,dkt ),
1≤t≤T
u 2 = col (u 2,d )
1≤d≤D
u 2,dt =
col
1≤k ≤q−1
(u2,dkt ),
and we suppose that
1
u 1 and u 2 are independent,
2
u 1 ∼ N(0, V u1 ), where V u1 = diag ( diag (ϕ1k )), k = 1, . . . , q − 1.
1≤d≤D 1≤k ≤q−1
3
u 2,dk ∼ N(0, V u2,dk ), d = 1, . . . , D, k = 1, . . . , q − 1, are independent with
covariance matrix AR(1), i.e. V u2,dk = ϕ2k Ωd (φk ) and
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
12 / 47
The multinomial logit mixed model

Ωd (φk ) = Ωd,k
1

 φk
1 
 ..
=

1 − φ2k  .

 φT −2
k
φTk −1
φk
1
..
.
φTk −2
...
..
.
..
.
..
.
...
φkT −2
φTk −1

..
φTk −2
..
.








.
1
φk
φk
1
.
T ×T
It holds that V u = var(u) = diag(V u1 , V u2 ), where
V u2 = var(u 2 ) = diag ( diag (V u2,dk )).
1≤d≤D 1≤k ≤q−1
We assume that the response vectors y dt conditioned to u 1,d and u 2,dt ,
are independent with multinomial distributions
y dt |u 1,d ,u 2,dt ∼ M(νdt , p1,dt , . . . , pq−1,dt ), d = 1, . . . , D, t = 1, . . . , T .
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
13 / 47
The multinomial logit mixed model
The model
ηdkt = log
pdkt
= x dkt β k + u1,dk + u2,dkt
pdqt
where
d = 1, . . . , D, k = 1, . . . , q − 1, t=1,. . . ,T,
x dkt = col0 (xdktr ),
1≤r ≤lk
β k = col (βkr ),
1≤r ≤lk
l=
Pq−1
k =1 lk .
pdkt =
exp{ηdkt }
Pq−1
1 + `=1 exp{ηd`t }
is the probability of the category k in the domain d and the time instant t.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
14 / 47
The multinomial logit mixed model
Other simpler models
Along this work we also consider three simpler models.
Model 2 is obtained by restricting Model 3 to φ1 = . . . = φq−1 = 0.
Model 1 is obtained by restricting Model 2 to one time period (T = 1) and
by considering only the random effect u 1 . This is the model studied by
López-Vizcaíno et al. (2013).
Model 0 is obtained by making u1,d1 = . . . = u1,dq−1 in Model 1. This is
the model studied by Molina et al. (2007).
Nevertheless, in the application to real data we apply the non-temporal
Models 1 and 0 to all the considered periods.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
15 / 47
The multinomial logit mixed model
Fit the model
We combine the penalized quasi-likelihood (PQL) method, described by
Breslow and Clayton (1996) for estimating and predicting the β k ’s, the
u 1d ’s and the u 2d ’s, with the residual maximum likelihood (REML) method
for estimating the variance components ϕ1 and ϕ2 .
The presented method is based on a normal approximation to the joint
probability distribution of the vector (y, u)
The combined algorithm was first introduced by Schall (1991) and later
used by Saei and Chambers (2003) and Molina et al. (2007). We adapt it
to the presented multinomial logit mixed model.
Other alternative fitting algorithms producing consistent estimators of
model parameters are the method of simulated moments of Jiang (1988)
and the EM algorithm.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
16 / 47
Model-based small area estimation
The problem is to estimate the domain total
mdt = N̂dt p dt ,
d=1,. . . ,D; t=1,. . . ,T.
N̂d is an estimated population size that can be obtained from the
dir
unit-level survey data. In the application to real data, we take N̂dt = N̂dt
We estimate mdt by means of
m̂dt = N̂dt p̂ dt
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
17 / 47
MSE estimation
Estimator 1: Analytical estimation
The employed fitting method requires two approximations:
1
The linearization of the GLMM link function.
2
The approximation of the distribution of the resulting linearized parameter
by a normal distribution.
The consequence of 1+2 is a LMM, where the G1 − G3 formulas of (Prasad and
Rao (1990)) can be applied. For the sake of brevity, we skip these formulas.
Therefore we approximate the mean square error of m̂d by means of
MSE(m̂dt ) = G1dt (σ) + G2dt (σ) + G3dt (σ),
where σ = (ϕ11 , . . . , ϕ1q−1 , ϕ21 , . . . , ϕ2q−1 , φ1 , . . . , φq−1 ) is the vector of
variance components.
The proposed analytic mean square error estimator is
mse(m̂dt ) = G1dt (σ̂) + G2dt (σ̂) + 2G3dt (σ̂).
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
18 / 47
MSE estimation
Estimator 2: Parametric bootstrap (González-Manteiga et al. (2007))
b , k = 1, . . . , q − 1.
1
Fit the model and calculate ϕ̂1k , ϕ̂2k , φ̂k and β
k
2
Generate u ∗1,d ∼ N(0, diag (ϕ̂1k )), u ∗2,dk ∼ N(0, ϕ̂2k Ωd (φ̂k )), and
1≤k ≤q−1
∗
∗
), where
, . . . , pdqt−1
y ∗dt ∼ M(νdt , pd1t
∗
pdkt
=
3
b ∗ and
Calculate ϕ̂∗1k , ϕ̂∗2k , φ̂∗k , β
k
∗
p̂dkt
=
4
∗
}
exp{ηdkt
∗
b xdkt + u ∗ + u ∗ , m∗ = N̂dt p∗ .
, ηdkt
=β
Pq−1
k
1,dk
2,dkt
dkt
dkt
∗
1 + `=1 exp{ηd`t }
∗
exp{η̂dkt
}
∗
b ∗ xdkt + u ∗ + u ∗ , m̂∗ = N̂dt p̂∗ .
, η̂dkt
=β
Pq−1
k
1,dk
2,dkt
dkt
dkt
∗
1 + `=1 exp{η̂d`t }
Repeat B times steps 2-3 and calculate the bootstrap mean square error
estimator
B
1X ∗
∗
∗ 2
msedkt
=
(m̂dkt − mdkt
) .
B
b=1
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
19 / 47
Simulation
We simulate a multinomial logit mixed model with three categories (q = 3).
For d = 1, . . . , D, k = 1, 2 and t = 1, . . . , T , generate:
the explanatory variables
1 d −D
k
t
Udkt =
+
+
,
3
D
q−1 T
xd1t
q
1/2
1/2 = µ1 + σx11 Ud1t , xd2t = µ2 + σx22 ρx Ud1 + 1 − ρ2x Ud2t ,
where µ1 = µ2 = 1, σx11 = 1, σx22 = 2 and ρx = 0.
the random effects u1,dk ∼ N(0, ϕ1k ) and u 2,dk ∼ N(0, V u2,dk ), with
ϕ11 = 1, ϕ12 = 2, ϕ21 = 0.25, ϕ22 = 0.5, φ1 = 0.5 and φ2 = 0.75.
the target variable y dt = (yd1t , yd2t )0 ∼ M(νdt , pd1t , pd2t ), where
pdkt =
exp{ηdkt }
, ηdkt = β0k + β1k xdkt + u1,dk + u2,dkt .
1 + exp{ηd1t } + exp{ηd2t }
νdt = 100, β01 = 1.3, β02 = −1, β11 = −1.6 and β12 = 1.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
20 / 47
Simulation I
The target of the first simulation is to analyze the behaviour of the
estimators β k , ϕ1k , ϕ2k , φk and mdkt (with N̂dt = 1000).
As efficiency measures, we consider the relative empirical bias (RBIAS)
and relative mean squared error (RMSE).
Number of iterations is I = 1000.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
21 / 47
Simulation I
RMSE and RBIAS for D = 100
T
RMSE(βb01 )
RMSE(βb02 )
RMSE(βb11 )
RMSE(βb12 )
RMSE(ϕ
b11 )
RMSE(ϕ
b12 )
RMSE(ϕ
b21 )
RMSE(ϕ
b22 )
RMSE(φb11 )
RMSE(φb12 )
4
0.33
0.34
0.34
0.34
0.17
0.25
0.39
0.39
0.94
0.79
8
0.28
0.28
0.36
0.35
0.17
0.18
0.19
0.18
0.50
0.39
12
0.22
0.23
0.31
0.32
0.15
0.18
0.13
0.12
0.30
0.24
T
RBIAS(βb01 )
RBIAS(βb02 )
RBIAS(βb11 )
RBIAS(βb12 )
RBIAS(ϕ
b11 )
RBIAS(ϕ
b12 )
RBIAS(ϕ
b21 )
RBIAS(ϕ
b22 )
RBIAS(φb11 )
RBIAS(φb12 )
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
4
-0.04
-0.05
-0.04
-0.03
0.02
0.15
-0.39
-0.39
-0.57
-0.38
8
-0.03
-0.02
-0.06
-0.05
-0.04
0.07
-0.18
-0.17
-0.50
-0.33
12
0.01
0.01
-0.06
-0.05
-0.05
-0.05
-0.11
-0.11
-0.29
-0.19
June 2013
22 / 47
Simulation I
RMSEdkt
RMSEd1t
RMSEd2t
RBIASd1t
RBIASd2t
T
1
D/2
D
1
D/2
D
1
D/2
D
1
D/2
D
4
0.09
0.11
0.13
0.12
0.09
0.09
0.004
-0.008
0.002
0.004
0.005
-0.002
8
0.09
0.11
0.14
0.12
0.10
0.08
-0.018
0.001
0.015
0.017
0.009
0.005
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
12
0.09
0.11
0.12
0.13
0.10
0.08
0.004
0.005
0.010
0.012
-0.011
-0.003
June 2013
23 / 47
Simulation II
The target of the second simulation is to study the behaviour of the two
mean square error estimators (analytic and bootstrap).
We consider D = 50 and T = 2, 4, 6.
Numbers of iterations are I = 500 and B = 500.
We present boxplots of MSE estimates for d = 25.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
24 / 47
Simulation II
Boxplots for (d, t) = (25, 2), (25, 4), (25, 6).
0.003
0.002
0.004
0.003
●
0.002
0.003
0.002
k=1
(d,t)=(25,6)
0.004
(d,t)=(25,4)
0.004
(d,t)=(25,2)
●
●
0.000
mse
mse*
0.003
0.002
0.003
0.004
mse*
0.002
0.003
0.002
k=2
mse
0.004
mse*
0.004
mse
0.001
0.001
0.000
0.000
0.001
●
●
●
●
0.001
0.001
0.001
●
●
●
●
mse*
T=2
0.000
0.000
0.000
●
●
mse
mse
mse*
T=4
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
mse
mse*
T=6
June 2013
25 / 47
Application to real data
Unemployment rate − women − IV quarter 2008
Objective
Estimate the total of employed and
unemployed people and the
unemployment rates per sex in the
counties of Galicia.
No data (5)
<=5 (0)
5 − 10 (24)
10 − 15 (20)
>15 (4)
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
26 / 47
Application to real data
Model 0: parameter estimates for the fourth quarter of 2011.
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
SS
ϕ
Employed
Estimate
-1.85
-1.36
1.81
0.18
0.31
1.26
-0.39
2.95
Estimate
0.06
p-value
0.00
0.14
0.00
0.48
0.77
0.01
0.23
0.00
Std.Dev.
0.0055
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
REG
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
Unemployed
Estimate
-4.87
0.61
3.72
0.18
3.43
2.86
0.54
10.94
p-value
0.00
0.72
0.00
0.71
0.08
0.00
0.34
0.00
June 2013
27 / 47
Application to real data
Model 1: parameter estimates for the fourth quarter of 2011.
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
SS
ϕ11
Employed
Estimate
-1.26
-0.76
1.58
0.18
-1.36
1.41
-0.90
2.01
Estimate
0.011
p-value
0.00
0.42
0.00
0.49
0.21
0.00
0.01
0.00
Std.Dev.
0.0055
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
REG
ϕ12
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
Unemployed
Estimate
-4.90
1.46
3.72
0.42
2.34
3.46
0.58
8.99
Estimate
0.09
p-value
0.00
0.43
0.00
0.45
0.30
0.00
0.39
0.00
Std.Dev.
0.0295
June 2013
28 / 47
Application to real data
Model 2: parameter estimates.
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
SS
ϕ11
ϕ21
Employed
Estimate
-1.43
0.92
2.05
0.15
0.48
1.68
-0.82
1.49
Estimate
0.031
0.013
p-value
0.00
0.02
0.00
0.38
0.28
0.00
0.00
0.00
Std.Dev.
0.0032
0.0002
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
REG
ϕ12
ϕ22
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
Unemployed
Estimate
-3.87
1.88
2.35
-0.48
1.57
1.51
-0.49
11.84
Estimate
0.086
0.104
p-value
0.00
0.01
0.00
0.11
0.07
0.00
0.12
0.00
Std.Dev.
0.0186
0.011
June 2013
29 / 47
Application to real data
Model 3: parameter estimates.
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
SS
ϕ11
ϕ21
φ1
Employed
Estimate
-1.47
0.69
2.14
0.16
0.56
1.71
-0.78
1.50
Estimate
0.024
0.013
0.58
p-value
0.00
0.07
0.00
0.33
0.18
0.00
0.00
0.00
Std.Dev.
0.017
0.017
0.099
Variable
CONSTANT
SEXAGE=1
SEXAGE=2
SEXAGE=3
SEXAGE=4
SEXAGE=5
STUD=1
REG
ϕ12
ϕ22
φ2
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
Unemployed
Estimate
-4.40
2.26
2.98
-0.45
2.62
2.13
-0.27
12.40
Estimate
0.081
0.098
0.29
p-value
0.00
0.00
0.00
0.06
0.00
0.00
0.29
0.00
Std.Dev.
0.010
0.010
0.084
June 2013
30 / 47
Application to real data
Domain relative residuals versus predicted values
ydkt − ŷdkt
,
ŷdkt
rdkt =
d = 1, . . . , 96,
k = 1, 2,
t = 10.
●
Model 0
Model 1
Model 2
Model 3
●
●●
●●
●
●
●
●
●●●
●
●●●
●
0.2
●
●
●●● ●
●
●
●
●
●
●
−0.4
●
−0.4
−0.4
● ●
●
0.0
●
●
●
●
●
0.4
0.4
0.2
●
●
●●
Residuals
●
●
●
●
●
●
●
●
●●
●
● ●
●●●
●
●
●
●●
●●●●●●
●
● ●●
● ●●●
● ●
●●●
●●●
●●
●●
● ●●●●
●●●
●
● ●●
●● ●
●
● ●●
●
●●
●
●
● ●
●
●● ●● ●
●● ●
●
●
●●●
●
−0.2
●
●
●
●●●
●●●
● ●
●● ●● ●
●
●●●●●
● ● ●●
●● ●
● ●
●
●●
● ●●
●● ●
●
●●
●
● ●●
●
●●
●●
●●
●●
● ●
●●●
●
●●
●
● ●
●●
● ●
●
● ●
●●
●●●●
●●
●
●
● ●
●
●
●● ●
● ● ●
●
●
●
●
−0.4
●
●
●
●●
●●
●
●
● ● ●
●
●●
●●
● ●●
●
●● ●
●● ●●
● ●●
● ●
●● ●●
● ●● ●●
●
●
●● ●
●● ●●
●
●
●
●● ●
●
●●
●
●● ● ●
●
●
● ●●
0.0
●
●
●
●
●●●
●
Residuals
●●
● ●●
●
0.0
●
●●
Residuals
0.2
●
●
● ●
●
●
● ●
●●●
●
●● ●
●●
● ●
●●
●
● ●●
●
●
●●●
●
●●
●●●●●
●●
● ●●●
●●
●
●● ●●●
●
● ●
●
● ●●●
●●● ●●
●● ●
●
●●●●
● ●●●
●
● ●
●●
●
−0.2
0.0
−0.2
Residuals
0.2
●
−0.2
0.4
0.4
●
●
●
●
●
●
●
●
●
●
0
100
●
200
300
400
500
600
700
0
100
200
300
400
500
600
0
100
200
300
400
500
0 ●●
600
●
●
Predicted employed people
●
Predicted employed people
●
Predicted employed people
●
●
●
●
●
●
●●
●
●● ●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●●●
● ●
●●
●●
● ●
●
●
●
●
●
100
150
Predicted unemployed people
●
●
●
●
●
●
●
0.4
0.2
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
400
500
600
●
●●
●
150
Predicted unemployed people
200
0
●
●
●
●
●
●
● ●
● ●
●
● ●
●
● ●● ●
● ●● ●
●
● ●
●
●
●●●
●
●
●●
●●
●●
●● ●
●
●●●
●
●●
●
●
● ●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
100
Model 3
●
●
●
●
●
●
●
50
●
●●
●
●
●
0●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●●
●●
●● ●
●
●
50
●
●
● ●
●
0.0
0.2
●
●
●
●
●
● ●
●
●
●
●
● ●●●
●
●
●
●
0.0
●
● ●
●
●
0●●●
300
●
Residuals
●
●
●
●
●
●
●
Residuals
●● ●
200
●
●
●
●
●
●●
●
● ●
●
●
−0.2
●●
●●
●
●
−0.4
●
●
0.0
●
● ●
●
●
●
●
●
●●
●●
● ●
●
●
●
Model 2
●
●
●
●
●
0.4
0.4
●
●
●
Residuals
●
● ●●
●
●●●
●
●
● ● ●●
●
●
●●
●
−0.4
0.0
−0.2
0.2
●
●
●●
●●
●●● ● ●
−0.4
Residuals
0.2
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●●
−0.2
0.4
●●
●
●
●
Model 1
●
●
−0.2
Model 0
●
●●
●
●
● ●
100
●
●
●
−0.4
●
Predicted employed people
●
●
●
50
100
150
Predicted unemployed people
200
0
●
●
●
50
100
150
200
Predicted unemployed people
●
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
31 / 47
Application to real data
Direct estimator versus model-based estimator.
Total of employed − model 1
Total of employed − model 2
80
80
●
●
●
●
● ●
0
20
40
60
0
Total of unemployed − model 1
20
40
60
10
15
Direct estimate
20
25
80
Total of unemployed − model 3
●
●
20
20
5
10
15
Direct estimate
20
25
model estimate
●
5
●
●
●
●
●
●
●
●● ●
●
● ●
● ●
●●●●
●
●●●
●●●●
●
●●
●●●
●
●
●●
●
●
●
●
●●●
●
●●
●●
●
●●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●
●
●
0
5
●
●
0
● ●
● ● ●●●
●● ●
●●
●
●
●●
●
●●
●●
●
●●
●
●●●
●
●
●
●
●
●●
●●
●
●●
●
●●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●●
0
●
5
●● ●
●● ●
● ● ●●
●
15
model estimate
●
●
●
0
5
60
●
20
0
●
5
5
●
●● ●
40
Direct estimate
●
0
●●
●●
●●
●
● ●
●
● ● ●●● ●
●
● ●
●●●
●●●●
●
●
●●
●●
●●
●●●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
20
●
15
model estimate
10
●
●●
0
Total of unemployed − model 2
10
20
15
●
60
80
●
●
40
model estimate
20
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
Direct estimate
●
model estimate
60
80
●
●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
Direct estimate
Total of unemployed − model 0
0
40
0
●
●
●●
●●
●●
15
80
●
●●
Direct estimate
●
10
60
● ●
●●
●●
●●
10
0
0
40
●
●
0
●
● ●
●
●●●
●
●
●●●
●●
●●●
●
●
●
●●
●
●●
●●
●
●
●●
●
●
●
●●
●●
●
●
●●
●●
●
20
20
●
●●
●
●
●
●●
●●
●
●
●●●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●●
●
model estimate
60
●
40
model estimate
60
40
20
model estimate
●
●
20
●
●
●
0
●
● ●
●
●
●●●
●
●
●
Total of employed − model 3
●
80
●
●
80
Total of employed − model 0
10
15
Direct estimate
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
20
25
●●
●
● ●
●
● ●
●
●
●
● ●
● ●
●●●●
●
●●●
●●●●
●
●●
●●●
●
●
●●
●
●
●
●
●●●
●
●
●
●●
●
●●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
0
5
10
15
20
25
Direct estimate
June 2013
32 / 47
Application to real data
Men and women employment totals for the fourth quarter of 2011.
Employed men (thousands)− IV/2011
Employed women (thousands)− IV/2011
●
Direct
●
Model 0
Model 1
40
●
Direct
●
Model 0
Model 1
40
Model 2
Model 2
Model 3
Model 3
●
30
●
30
●
●
●
●
●
20
20
●
●
●
●
●
●
10
●
●
●
●
●●
●
●
0
●●
13
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
105
172
●
●●
13
●
●●
●
●
43
Sample size
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
●
●
●
0
63
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
38
●
●
●
●
10
●
●
●
●
●
●●●
71
107
189
Sample size
June 2013
33 / 47
Application to real data
Men and women unemployment rates for the fourth quarter of 2011.
●
Unemployment rate − men − IV/2011
Unemployment rate − women − IV/2011
●
Direct
●
Direct
●
Model 0
Model 0
Model 1
Model 1
Model 2
40
Model 2
40
Model 3
Model 3
●
●
●
●
30
30
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
20
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
10
●
●●
●
10
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
20
●
●
●
●
●
●
●
●
●●
●
●
0
●
13
●
0
●
38
63
105
172
●
13
●
●
43
Sample size
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
71
107
189
Sample size
June 2013
34 / 47
Application to real data
RMSEs of model-based estimator of unemployment rates in the fourth
quarter of 2011.
RMSE unemployment rate men − IV/2011
RMSE unemployment rate women − IV/2011
●
60
30
●
●
●
20
40
25
50
●
●
30
●
5
10
10
20
15
●
Model 0
Model 1
Model 2
Model 3
Model 0
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
Model 1
Model 2
Model 3
June 2013
35 / 47
Application to real data
Estimated totals of employed men (top) and their estimated
RMSEs (bottom) in the fourth quarter of 2011.
County
27
49
15
34
44
53
27
49
15
34
44
53
n
13
40
65
107
179
1347
13
40
65
107
179
1347
dir
410
2593
4369
4277
9971
90346
40.1
18.6
14.3
13.5
9.7
2.7
mod0
391
2317
4024
4774
9942
87308
20.0
26.9
28.1
13.3
4.2
2.2
mod1
399
2056
4229
4155
10013
84926
25.0
18.0
12.2
11.6
7.5
2.7
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
mod2
383
2077
4674
4333
10551
83662
17.7
12.6
11.3
8.8
6.8
2.4
mod3
393
2053
4644
4297
10651
84055
19.4
13.3
11.1
9.6
7.9
3.1
June 2013
36 / 47
Application to real data
Estimated totals of employed women (top) and their estimated
RMSEs (bottom) in the fourth quarter of 2011.
County
27
11
10
41
42
53
27
11
10
41
42
53
n
13
45
75
108
193
1554
13
45
75
108
193
1554
dir
292
2209
3301
3594
4616
78128
50.6
20.9
18.3
12.4
12.7
3.2
mod0
358
2332
3417
3614
5184
76854
10.9
9.7
12.2
8.5
9.3
2.8
mod1
318
2119
3226
3505
4983
75033
27.8
16.5
14.5
11.3
8.9
3.0
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
mod2
330
1974
3143
3593
4789
73316
16.1
10.8
11.1
6.1
6.0
1.8
mod3
336
2019
3179
3566
4830
74199
19.2
13.4
12.6
9.5
8.0
3.2
June 2013
37 / 47
Application to real data
Estimated men unemployment rates (top) and their estimated
RMSEs (bottom) in the fourth quarter of 2011.
County
27
49
15
34
44
53
27
49
15
34
44
53
n
13
40
65
107
179
1347
13
40
65
107
179
1347
dir
18.6
18.1
19.0
28.5
29.3
21.7
146.4
71.9
51.5
26.5
19.1
8.4
mod0
18.0
22.7
19.7
16.4
19.8
18.0
23.5
24.6
28.6
15.1
9.8
6.4
mod1
17.8
21.3
18.7
25.6
24.2
20.7
32.6
20.9
15.8
15.5
12.1
4.6
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
mod2
15.5
19.0
18.2
19.6
21.0
20.5
25.6
18.1
16.2
12.1
10.7
4.8
mod3
15.3
19.4
18.3
19.9
20.7
20.4
22.1
16.0
12.1
10.7
8.6
3.6
June 2013
38 / 47
Application to real data
Estimated women unemployment rates (top) and their estimated
RMSEs (bottom) in the fourth quarter of 2011.
County
27
11
10
41
42
53
27
11
10
41
42
53
n
13
45
75
108
193
1554
13
45
75
108
193
1554
dir
37.2
23.3
21.1
22.7
27.6
25.4
59.0
55.1
55.7
34.8
25.2
7.7
mod0
17.7
16.5
16.9
20.8
24.5
22.2
16.6
12.9
22.1
7.0
18.4
4.3
mod1
22.7
21.6
20.3
20.9
26.0
24.4
36.2
22.7
18.7
14.4
10.9
4.2
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
mod2
20.0
17.1
20.7
17.7
25.1
22.5
22.5
17.4
14.5
16.5
12.1
5.3
mod3
19.2
16.7
20.3
17.9
25.0
22.9
28.7
19.2
16.2
11.0
10.2
3.1
June 2013
39 / 47
Application to real data
Unemployment rates in some counties and all periods.
Unemploment rate − women − County 27
Unemploment rate − women − County 11
Unemploment rate − women − County 10
●
Direct
●
40
Direct
●
40
Model 0 ●
Direct
●
●
40
Model 0
Model 0
●
●
30
Model 1
Model 1
Model 2
Model 2
30
●
Model 3
Model 1
Model 2
30
Model 3
Model 3
●
●
●
●
●
●
●
20
●
20
20
●
●
●
●
●
●
●
●
●
●
●
●
10
0
●
10
●
●
●
1
2
3
4
5
6
7
8
9
10
0
1
2
3
4
Time, n= 13
5
40
40
10
1
3
4
6
7
●
Model 1
Model 2
Model 3
●
●
●
●
10
Model 0
30
●
20
●
9
Direct
●
40
●
●
●
8
Unemploment rate − women − County 53
Model
3
●
●
●
●
5
Time, n= 66
Model 2
●
30
Model 3
20
●
●
●
●
●
2
Model 1 ●
Model 2
20
9
Model 0
Model 1
●
8
Direct
●
Model 0
30
7
Unemploment rate − women − County 42
Direct
●
6
Time, n= 47
Unemploment rate − women − County 41
●
10
0
●
●
●
●
●
10
10
●
10
●
0
0
1
2
3
4
5
6
Time, n= 115
7
8
9
10
0
1
2
3
4
5
6
7
8
9
10
Time, n= 191
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
1
2
3
4
5
6
7
8
9
10
Time, n= 1510
June 2013
40 / 47
Application to real data
Measures for model comparison.
Loglikelihood
BIC
Model 2
-5269,5
10593.1
Model 3
-5037,6
10135.4
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
41 / 47
Application to real data
Model 3 estimates of men unemployment rates in Galician counties in
IV/2011 (left) and their variations between from IV/2009 to IV/2011 (right).
Unemployment rate − men − IV/2011
No data (2)
<=10 (5)
10 − 15 (25)
15 − 20 (14)
>20 (7)
Variation unemployment rate − men − IV/2009−IV/2011
No data (2)
<=1 (8)
1 − 2 (6)
2 − 3 (15)
>3 (22)
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
42 / 47
Application to real data
Model 3 estimates of women unemployment rates in Galician counties in
IV/2011 (left) and their variations between from IV/2009 to IV/2011 (right).
Unemployment rate − women − IV/2011
No data (2)
<=10 (4)
10 − 15 (21)
15 − 20 (15)
>20 (11)
Variation unemployment rate − women − IV/2009−IV/2011
No data (2)
<=1 (24)
1 − 2 (15)
2 − 3 (7)
>3 (5)
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
43 / 47
Conclusions
The model-based estimates have less mean squared error than the direct
ones.
The estimates are consistent in the sense that estimates of domain totals
of employed, unemployed and inactive people sum up to the size of the
domain.
In the simulations we have observed that the bootstrap estimators work
better than the G1 − G3 approximations.
Area-level temporal multinomial mixed models can be recommended for
estimating domain unemployment indicators.
Future research: Introduce alternative fitting algorithm and derive domain
estimates based on Empirical Best Predictors.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
44 / 47
Acknowledgements
Acknowledgements
Supported by the Instituto Galego de Estatística, by the grants
MTM2009-09473 and MTM2008-03010 of the Spanish “Ministerio de Ciencia
e Innovación” and AAII DE2009-0030 “Grupos de referencia competitiva”
(2007/132) of the “Consellería de Educación e Ordenación Universitaria” and
by the Belgian network IAP-Network P6/03.
Thank you
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
45 / 47
References
Breslow, N. and Clayton, D. (1996). Approximate inference in generalized
linear mixed models. Journal of the American Statistical Association, 88,
9-25.
González-Manteiga, W., Lombardía, M. J., Molina, I., Morales, D. and
Santamaría, L. (2007). Estimation of the mean squared error of
predictors of small area linear parameters under a logistic mixed model.
Computational Statistics and Data Analysis, 51, 2720-2733.
Jiang, J. (1998). Consistent estimators in generalized linear mixed
models. Journal of the American Statistical Association, 93, 720-729.
López-Vizcaíno, E., Lombardía, M.J. and Morales, D. (2013).
Multinomial-based small area estimation of labour force indicators.
Statistical Modelling, 13, 153-178.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
46 / 47
References
Molina, I., Saei, A. and Lombardía, M. J. (2007). Small area estimates of
labour force participation under multinomial logit mixed model. The
Journal of the Royal Statistical Society, series A, 170, 975-1000.
Prasad, N. G. N. and Rao, J. N. K. (1990). The estimation of the mean
squared error of small area estimators Journal of the American Statistical
Association, 85, 163-171.
Saei, A. and Chambers, R. (2003). Small area estimation under linear an
generalized linear mixed models with time and area effects. S3RI
Methodology Working Paper M03/15, Southampton Statistical Sciences
Research Institute.
Schall, R. (1991). Estimation in generalized linear models with random
effects. Biometrika, 78, 719-727.
Särndal, C. E., Swensson, B. and Wretman, J. (1992). Model Assisted
Survey Sampling, Springer.
M.E. López-Vizcaíno, M. J. Lombardía and D. Morales (Universities
EstimationofofSomewhere
labour forceand
indicators
Elsewhere)
June 2013
47 / 47

Similar documents

Die BUCHSTAVIER - Das Dosierte Leben

Die BUCHSTAVIER - Das Dosierte Leben Das Dosierte Leben Das Avant-Avantgarde-Magazin 16. Jahrgang

More information