Les articles publiés dans la série "Économie et statistiques" n'engagent que leurs auteurs. Ils ne reflètent pas forcément lesvues du STATEC et n'engagent en rien sa responsabilité.
NN° 11022 AAoût 22018
AAuteur: Christian Glocker, Austrian Institute of Economic Research –Department for Macroeconomics (WIFO)
A state space approach to forecasting short-term dynamics in Luxembourg
Abstract
This paper presents small-scale dynamic factor models in order to compute short-term forecasts for the Luxembourgish economy. Particular models are designed for each of the following variables (i) real goods exports, (ii) real private household consumption, and (iii) employment, commuters and unemployment, where the latter three variables are considered jointly within one model. The models are estimated using the Kalman filter, which in turn allows for a straightforward application of the dynamic factor models for nowcasting, backcasting and forecasting alike. To examine the real-time forecasting accuracy, a pseudo real-time analysis has been applied; the results highlight the superior forecasting performance of the small scale factor models to various alternatives, including experts' forecasts.
dynamics in Luxembourg
Abstract
This paper presents small-scale dynamic factor models in order to com-
pute short-term forecasts for the Luxembourgish economy. Particular models
are designed for each of the following variables (i) real goods exports, (ii) real
private household consumption, and (iii) employment, commuters and unem-
ployment, where the latter three variables are considered jointly within one
model. The models are estimated using the Kalman filter, which in turn allows
for a straightforward application of the dynamic factor models for nowcasting,
backcasting and forecasting alike. To examine the real-time forecasting accu-
racy, a pseudo real-time analysis has been applied; the results highlight the
superior forecasting performance of the small scale factor models to various
alternatives, including experts’ forecasts.
1 Introduction
Early assessment of the current economic conditions is of importance for economic
agents’ decision making and economic policy alike. The lack of timely information
associated with the publication of macroeconomic variables, the presence of missing
values in historical time series, and the problem surrounding mixed frequencies im-
pede the day to day monitoring of the economic activity. This applies also when
econometric models are used, which are estimated relying on traditional methods.
*Project Report written by Christian Glocker, Spring 2018. Thanks to Ferdy Adam and histeam, Lionel Fontagne, and Massimiliano Marcellino for support. Les opinions exprimees dans lapresente publication sont celles des auteurs et ne refletent pas forceement les opinions du STATECet de l’ANEC.
1
A state space approach to forecasting short-term
In this context the main challenges within model-based projections consists of
finding an appropriate statistical framework which allows to (1) analyze jointly data
of different sampling frequencies (i.e. monthly business cycle indicators and quarterly
National Account figures), (2) to deal with missing observations at various instances
of the time-series used, and (3) to take account of data revisions. The problem of
mixed frequencies could be solved in a straightforward fashion by aggregating all
monthly series to quarterly ones. This would allow standard estimation techniques,
though this approach is associated with a considerable loss of information as the
dynamics within a quarter are left un-modeled. Furthermore, this approach does
not solve the problem of how the latest available information from monthly business
cycle indicators could be used if observations are available only until the first or
second month within a quarter. Hence the method should allow for a quick update
of any forecast to incorporate new information on the highest possible frequency.
New information can also be in the form of data revisions. As this can at times be
useful, it could also lead to significant changes in the forecast trajectory rendering
the projections too sensitive to data revisions; this constitutes a key drawback of
auto-regressive models in forecasting time series as their forecasts are highly sensitive
to data revisions. Against this background, an adequate model should be able to
produce forecasts which are as far as possible invariant to data revisions.
Finally, as concerns the time span that the data series cover - sophisticated tech-
niques require a decent number of observations; series too short render the estimation
unfeasible and the results unreliable. This poses a particular challenge for variables
whose statistical data definitions is altered regularly. This problem applies to many
macroeconomic time series involving national accounts data, labour market data as
well as financial data, of which credit stock data in particular. Against this back-
ground the statistical method applied should be able to allow for the possibility of
missing observations in general.
Once having specified an appropriate statistical model which is capable of dealing
with all these aforementioned problems, the next challenge arises - forecasting. Per-
forming multi-period forecasts requires to decide whether to use a recursive model
based upon one-step-ahead forecasts, or a multi-period model that is estimated with
a loss function tailored to the forecast horizon. Although the recursive method pro-
vides more efficient parameter estimates than the direct method and does not require
different models for different forecasting horizons, it is prone to distortion if the
2
one-step-ahead model is incorrectly specified and usually requires separate forecast-
ing models for the explanatory variables. Which approach is better depends on the
properties of the forecast model and will ultimately be an empirical matter.
In this setup, one convincing approach is the VAR framework with mixed frequen-
cies. However, the calculation of the predictions from this approach usually suffers
from serious dimensionality related problems, especially when the number of time
series or their frequency increases. One possibility to account for this is by relying
on Bayesian methods. The parameter proliferation problem can also be addressed by
means of the well-known dimensionality reduction allowed for by factor models. Since
macroeconomic data are usually collinear, it is reasonable to assume that these are
multiple, indirect measurements of some low-dimensional sources that can be used
to reproduce most of the variability of a data set, although they cannot normally be
measured directly. Factor models in turn allow to calculate indices of macroeconomic
activity, which can be useful in tracking economic developments.
Against this background the dynamic factor model as proposed by Stock and Wat-
son (1992) and extended by Camacho and Perez-Quiros (2010) comprises a promis-
ing econometric approach. The basic idea is to separate a possibly large number of
observable variables into two independent and unobservable, yet estimable, compo-
nents: a common component that captures the main bulk of co-movement between
the observable variables, and an idiosyncratic component that captures any remain-
ing individual movement. The common component is assumed to be driven by a
few common factors, thereby reducing the dimension of the system. In this context
the models are converted into state space representations and estimated using the
Kalman filter. Since the estimation of a model by means of the Kalman filter is re-
cursive, the approach is able to take missing observations in the data set into account
in a relatively straightforward manner. The strategy is to skip some calculations
while others do not need to be changed so that the basic Kalman filter remains valid
and the parameters of the model can be estimated with maximum likelihood meth-
ods. This characteristic is of practical relevance when calculating forecasts, since
the future values of the time series can be considered as (yet) missing observations.
Consequently, the Kalman filter also provides the necessary calculations for forecasts.
Moreover, the estimation of a model by means of the Kalman filter occurs via two
steps which are repeated consecutively; one of these steps involves the computation
of forecasts. Hence the application of the Kalman filter to estimate models which are
3
in turn used for forecasting purposes is a natural extension of the Kalman filtering
technique. Considering the population parameters, the Kalman filter also provides
the mean square prediction error (MSFE).
In what follows this econometric methodology will be applied to key variables of
different sectors of the Luxembourgish economy. Section 2 introduces basic elements
of the dynamic factor model and discusses aspects relevant for the current application.
Section 3 establishes a dynamic factor model for real goods exports and Section 4
develops a model for private household consumption. Section 5 establishes a dynamic
factor model for key labour market variables; the analysis draws particular attention
to (i) employment, (ii) unemployment and (iii) commuters. Each section involves a
discussion on the precision of the forecasts of the dynamic factor model relative to
various alternative models as well as the forecasts of an expert panel. Moreover, the
analysis also features a discussion on the extent to which new information changes
the forecast trajectory. Finally Section 6 concludes.
2 Dynamic factor models
The basic idea of dynamic factor models (DFM) is that the information of an ob-
servable vector of time series under investigation can be explained by a vector of
unobserved components with the requirement that the vector of unobserved com-
ponents has a lower dimension than the vector of time series under investigation.
By this, dynamic factor models capture the most important co-movements of the
variables in the vector of observed time series.
Dynamic factor models are motivated by theory, which predicts that macroeco-
nomic shocks should be pervasive and affect most variables within an economic sys-
tem. They have therefore become popular among macroeconometricians (see, e.g.,
Breitung and Eickmeier 2006, for an overview). In particular, it has been demon-
strated that dynamic factor models are valuable in business cycle analysis (e.g. Forni
and Reichlin 1998; Eickmeier 2007; Ritschl, Sarferaz, and Uebele 2016), forecasting
(e.g. Stock and Watson 2002a,b) and nowcasting the state of the economy, that is,
forecasting of the very recent past, the present, or the very near future of indicators
for economic activity (see, e.g., Banbura, Giannone, Modugno, and Reichlin 2012,
and references therein).
Let yt =((xh
1,t)′, (xs
1,t)′, y1stt , yft
)′be a vector of N distinct time series available
4
at time t and It = {y1, ...,yt} be the information set including all information up
to and including time t. Then a dynamic factor model is usually specified such
that all observable variables in yt can be represented as the sum of two independent
components: a common component ft which is common to all variables in yt and
the remaining idiosyncratic component et (see for instance Bai and Ng, 2008). These
idiosyncratic disturbances arise from measurement error and from special features
that are specific to an individual series. The latent factors follow a time series process,
which is commonly taken to be a vector autoregression (VAR). In equations, the
dynamic factor model is,
• System of static equations(xht
xst
)=
(γh · ft
γs ·∑11
j=0 ft−j
)+
(uh
t
ust
)(1)(
y1stt
yft
)=
(ω
ω
)· [γqft + ut,q] +
(0
εt
)(2)
with ω := 13+ 2
3· L+ L2 + 2
3· L3 + 1
3· L4
• System of dynamic equations
(1− ϕε(L))εt = με + et, et ∼ NID(0, σ2e) (3)
φu(L) · ut,q = νqt with νq
t ∼ NID(0, σ2
q
)(4)
φf (L) · ft = νft (5)
Φu(L)
(uh
t
ust
)= νt (6)(
νft
νt
)∼ NID
(0,
[σ2f 0
0 Σν
])(7)
where xht is a vector of nh hard indicators on a monthly frequency, xs
t is a vector of
ns soft indicators on a monthly frequency, y1stt and yft typically a national account
variable involving its first release (y1stt ) and final value (yft ) with ε being the revision
term. uht and us
t are some residuals, allowed in turn to follow an autoregressive
process, and finally - the key feature of the dynamic factor model - is to assume a
5
dynamic equation for the factor ft which is done by equation (5). The idiosyncratic
disturbances are assumed to be uncorrelated with the factor innovations at all leads
and lags, that is, Eνtuit+k = 0 ∀k ∈ Z and i ∈ {s, h}.
In this set-up, y1stt and yft are usually specified as quarter-on-quarter (q-o-q)
growth rate. The term ω hence decomposes the q-o-q to a month-to-month growth
rate and relates it to the variables in xht which are specified in terms of month-
to-month growth rates. Since soft indicators typically have a high correlation with
the year-on-year growth rates of the variables in xht , y
1stt and yft , the second line in
equation (1) hence relates the soft indicators to the approximate year-on-year growth
rate of the factor ft. This model set-up has been frequently used in the literature
(REF) and has become a standard for short-term forecasting exercises.
Estimation of dynamic factor models concern foremost the common component.
The idiosyncratic component is generally considered as a residual. The common com-
ponent of the dynamic factor model may be consistently estimated in the frequency
domain by spectral analysis; see Forni, Hallin, Lippi, and Reichlin (2000, 2004). The
main benefit of the factor model, is that the common component may be consistently
estimated in the time domain, which reduces the computational complexity. The
workhorse for estimating the factor is the method of principal components (PC). In
a second step, the dynamic equation (5) can be estimated by standard ordinary-
least-squares method treating the estimated factors in ft as observed variables. This
method is easy to compute, and is consistent under quite general assumptions as long
as both the cross-section and time dimension grow large. It suffers, however, from
a large drawback: the data set must be balanced, where the start and end points of
the sample are the same across all observable variables. In practice, data are often
released at different dates. A popular approach is therefore to cast the dynamic factor
model in a state space representation and then estimate it using the Kalman filter,
which allows unbalanced data sets and offers the possibility to smooth missing val-
ues. The state space representation contains a signal equation, which links observed
variables to latent states, and a state equation, which describes how the states evolve
over time. The Kalman filter and smoother provide mean-square optimal projections
for both the signal and state variables. In what follows the dynamic factor model will
be cast into the structure of a state space model where the measurement equation
6
and the transition equation read as follows:
yt = Hst +wt, wt ∼ NID (0, R) (8)
st = Fst−1 + vt, vt ∼ NID (0, Q) (9)
with a corresponding definition of the matrizes H, F , R, Q, the vectors yt, st, wt
and vt and their relation to the equation system (1)-(7).
If all series in the model were observable at a monthly frequency and the data panel
was balanced, then the estimation of the dynamic factor model could be implemented
using standard maximum likelihood methods in conjunction with the Kalman filter.
This assumption is, however, rather unrealistic, since in our empirical application
we have to deal with mixed frequencies and with time series which are published at
different time lags; moreover the series start at different points in time.
According to Mariano and Murasawa (2003), with the subtle transformation of
replacing missing observations by random draws rt ∼ iidN(0, σ2r), the system of
equations remains valid. Importantly, if the distribution of rt does not depend on
the parameter space that characterizes the Kalman filter, then the matrices in the
state-space representation are conformable and do not have an impact on the model
estimation since the missing observations just add a constant term in the likelihood
function to be estimated.
Let yi,t be the ith element of vector yt and let Rii be its variance. Let Hi,t be the
ith row of matrix Ht, which has z columns. The measurement equation can then be
replaced by the following expressions
y∗i,t =
⎧⎨⎩yi,t if yi,t is observable
rt otherwise,(10)
H∗i,t =
⎧⎨⎩Hi if yi,t is observable
01×z otherwise,(11)
w∗i,t =
⎧⎨⎩0 if yi,t is observable
rt otherwise,(12)
7
R∗ii,t =
⎧⎨⎩0 if yi,t is observable
σ2r otherwise.
(13)
With this transformation the time-varying state-space model can be treated as having
no missing observations and the Kalman filter can be directly applied to y∗t , H
∗t , w
∗t
and R∗t . The implementation of this algorithm corresponds to expanding yt, H, wt
and R in equation (8) by means of an indicator function which takes into account if
yi,t ∈ yt is observed or not.
The estimation of the model’s parameters can be developed by maximizing the
log-likelihood of {y∗t }t=T
t=1 numerically with respect to the unknown parameters in
matrices. Let st|t−1 be the estimate of st based on information up to period t − 1.
Let Pt|t−1 be its covariance matrix. The prediction equations are:
st|t−1 = F st−1|t−1, (14)
Pt|t−1 = FPt−1|t−1F′ +Q. (15)
The predicted value of yt with information up to t − 1, denoted yt|t−1 is yt|t−1 =
H∗t st|t−1, such that the prediction error is ηt|t−1 = y∗
t − yt|t−1 = y∗t − H∗
t st|t−1 with
covariance matrix ξt|t−1 = H∗t Pt|t−1H
∗t +R∗
t . In each iteration, the log-likelihood can
therefore be computed as
logLt|t−1 = −1
2ln(2π∣∣ξt|t−1
∣∣)− 1
2η′t|t−1
(ξt|t−1
)−1ηt|t−1. (16)
The updating equations are:
st|t = st|t−1 +K∗t ηt|t−1 (17)
Pt|t = Pt|t−1 −K∗t H
∗t Pt|t−1 (18)
in which K∗t is the Kalman gain defined as K∗
t = Pt|t−1H∗′t (ξt|t−1)
−1. The initial
values: s0|0 = 0 and P0|0 = I used to start the filter are a vector of zeros and the
identity matrix, respectively.
Computing short-term forecasts in real-time from this model is straightforward.
The future values of the time series can be regarded as missing observations at the
end of the sample periods. The Kalman filter accounts for the missing data which
8
are replaced by forecasts. Particularly, the k-period ahead forecasts are
yt+k|t = H∗t st+k|t, (19)
with st+k|t = F kst|t.
To conclude, the Maximum-Likelihood estimation of the dynamic factor using
the Kalman filter can smooth over missing values, allowing an unbalanced panel with
missing values at the end or start of the panel; the so called ragged-edge-problem.
This feature is very valuable for economic forecasting, because key economic indica-
tors tend to be released at different dates. For any estimation approach, the number
of factors q is generally unknown, and needs to be either estimated or assumed. Pop-
ular estimators for the number of factors in approximate factor models can be found
in Bai and Ng (2002), Onatski (2010) and Ahn and Horenstein (2013). Throughout
the paper, the number of factors will be treated as known.
An important step in developing an appropriate model addresses the selection of
the variables. This applies not only to the specific variables used, but also on the
time dimension; that is, assume the interest lies in setting up a model in order to
better forecast a core-variable xt and to use an appropriate auxiliary variable yt−k.
The selection along the time-dimension would result in judging the forecast of xt in
relation to k ∈ Z.
The process surrounding the selection of appropriate variables and their time-
structure can hence be endless. For this it is convenient to rely on an efficient strategy
in order to judge if a variable is accepted or not. In what follows, this will involve two
steps. The first step is based on the experience of professional forecasters in order
to gather a pre-selected list of variables. The second step now involves the dynamic
factor model. Once a pre-selected list has been specified, each particular variable is
analyzed and included as an additional variable only if it increases the percentage
of the variance of the core-variable explained by the common factor(s). Within this
step, the time-dimension is considered as an additional search-direction.
2.1 Some practical issues
The model outlined in equations (1)-(7), has been frequently used when the variable
yft has some (significant) positive degree of (first-order) autocorrelation. The perfor-
mance of this model deteriorates significantly if quarterly growth rates of the data in
9
yft are used. This is due to the zero autocorrelation or even negative autocorrelation.
An approach that seems more promising in this case, is to utilize year-on-year (y-o-y)
growth rates instead. Extending the use of y-o-y growth rates also for the variables
in xht yields a factor ft that can be interpreted in terms of y-o-y growth rates. Fi-
nally, since the variables in xst have a high correlation with the y-o-y growth rates of
the variables in xht and yft , the model hence simplifies significantly to the following
structure:
• System of static equations(xht
xst
)=
(γh · ftγs · ft
)+
(uh
t
ust
)(20)(
y1stt
yft
)=
(ω
ω
)· [γqft + ut,q] +
(0
εt
)(21)
with ω = 1
• System of dynamic equations
(1− ϕε(L))εt = με + et, et ∼ NID(0, σ2e) (22)
φu(L) · ut,q = νqt with νq
t ∼ NID(0, σ2
q
)(23)
φf (L) · ft = νft (24)
Φu(L)
(uh
t
ust
)= νt (25)(
νft
νt
)∼ NID
(0,
[σ2f 0
0 Σν
])(26)
The simplification is due to the fact that the state space of the model is reduced.
In what follows, this set-up will be used, modified with minor extensions in order
to adjust the dynamic factor model to specific features of the data. The state space
representation of this model structure is presented in Section A in the Appendix.
The selection of the indicators is based on three steps. First of all, a set of
core variables is chosen utilizing the advice of professional forecasters at Statec; this
comprises in general around thirty variables for each model. This amount of variables
is too large of being used effectively in the model as computational difficulties arise.
10
Hence, as a second step the long list of variables is reduced by searching along two
dimensions: cross-sectional and temporal. Each variable has then been analyzed if
it should be added to the model; if added the analysis also evaluates an appropriate
lag. It has been decided to include an additional variable when it increases the
percentage of variance explained by the common factor. This procedure is applied
for each dynamic factor model as discussed in Sections 3 to 5. In addition, to be
included in the dynamic factor model, all of these series have been normalized to
have zero mean and unit variance.
The key element in judging the adequacy of the proposed models addresses their
forecast performance. In this context Stark and Croushore (2002) among others sug-
gest that the analysis of in-sample forecasting performance of models is questionable
since the result can be deceptively less conclusive when using a real-time data set.
This is due to the fact that the in-sample analysis misses to adequately capture the
following three elements in the analysis: (i) the recursive estimation of the parame-
ters of the model; (ii) the flow of the data in real time (this addresses the problem
that data are released at different points in time); and (iii) revision of the data.
Producing real-time data vintages is, however, not feasible for several variables
within the current application. For this reason the out-of-sample analysis carried out
here follows instead the example of Stock and Watson (2002) in which (i) holds. The
method consists of computing forecasts from successive enlargements of a partition
of the latest available data set. Using this recursive sample structure, the model is
fully estimated and h-step ahead forecasts are computed. This procedure continues
iteratively until the final iteration, which is computed from a model that uses the
complete latest available observation.
Within the iterative enlargement of the data set, the analysis takes the publication
lag of certain variables into account. This procedure is based on trying to mimic as
closely as possible the real time analysis that would have been performed in case a
truly real-time data set were available. The current analysis can hence be considered
as a pseudo real-time analysis.
In what follows this procedure is applied to all factor models developed here in
order to (i) assess the forecast trajectory of the model within periods characterized
by exceptional economic fluctuations, and (ii) to assess the precision of the forecasts
of the dynamic factor model relative to alternative models.
Finally it has to be stressed that the specification of the DFMs should not be
11
understood as a structural model; instead the models focus purely on a Granger-
causality objective.
3 Specifying a DFM for real goods exports
The following describes a dynamic factor model for real goods exports. The particular
model used is given by equations (20)-(26), where (22) is specified as follows:
εt = με + et, et ∼ NID(0, σ2e) (27)
The variables used for the estimation are given by:
xht =
⎡⎢⎢⎢⎢⎢⎢⎢⎢⎢⎣
steel productionyoy,Mt−7
steel prodroduction EUyoy,Mt−3
industrial productionyoy,Mt−2
prices of iron and steel productsyoy,Mt
PPI EAyoy,Mt
GDP EAyoy,Qt
⎤⎥⎥⎥⎥⎥⎥⎥⎥⎥⎦
(28)
xst =
[PMI US manufacturingMt−7
](29)
where yoy indicates that a variable enters in year-on-year growth rates and the su-
perscripts M or Q indicate if the time series is available on a monthly or quarterly
frequency. yft and y1stt represent the growth rate of the final real goods export series
as well as its first estimate respectively. The data are depicted in Figure 1 and fur-
ther information can be found in Table 11; data for the two measures of real goods
export are shown in Figure 2; as can be seen in the figure, the two measures for the
growth rate of real goods exports follow on average a rather similar path, however,
there are indeed several years where the difference is sizable. The model takes this
into account by means of equation (27). A detailed overview of the data used can be
found in Section B in the Appendix.
The variables were selected following the procedure described in Section 2. As can
be seen, the final list of variables contains three indicators which are directly related
to the iron&steel market (steel production in Luxembourg; steel production in the EU
and the (domestic) price of iron and steel products). In the case of Luxembourg this
12
Figure 1: Data used - DFM for real goods exports
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
-6
-4
-2
0
2
1985 1990 1995 2000 2005 2010 2015 2020
-4
0
4
8
12
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
6
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-3
-2
-1
0
1
2
1985 1990 1995 2000 2005 2010 2015 2020
-6
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
13
XB_R_Q XB_FR PMIMANUS_M
PRODSTEELLU_M
PRODSTEELEU_M PRODINDLU_M P_MET_M
P_PPIEA_M
PIBEA_R_Q PRODINDHW_M
Figure 2: Data used for real goods exports
-.3
-.2
-.1
.0
.1
.2
.3
96 98 00 02 04 06 08 10 12 14 16 18 20
seems appropriate, since steel products comprise a major part of real goods exports.
Table 1 shows the estimated coefficients of the complete model jointly with com-
mon further statistical measure to evaluate the parameters’ statistical importance,
and also a measure for the weight of each variable in the model. As can be seen
there is some evidence for a non-zero mean με of the revision term εt. One has to
take into account here that the estimation of the equation of the revision term is
based on a few observations only, since the first estimate for the growth rate of real
goods exports starts only in the year 2009. Any extension of equation (27) involving
auto-regressive or moving-average elements did not contribute to improve the model
fit.
The autoregressive coefficients for the factor equation are significantly different
from zero. Their particular parameter values imply that the stochastic process for
ft is governed by complex roots. Including a third lag would results in a parameter
estimate for the third lag which is rather small and statistically not different from
zero. The extent to which the dynamic factor is governed by complex roots can also
be seen in Figure 3 where the left subplot shows the path of the latent factor over time.
Moreover, the subplot highlights in how far the factor captures the recent peaks and
troughs surrounding the global financial crisis and its aftermath. The second subplot
14
XB_FR XB_R_Q
Figure 3: Factor and Revision Term
-80
-60
-40
-20
0
20
40
1985 1990 1995 2000 2005 2010 2015 2020
S1 – 2 RMSE
Smoothed Factor
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
RVER_0 – 2 RMSE
Smoothed Revision Error
in Figure 3 shows the estimated revision term. Revisions were sizable especially in
the year 2016; on average there seems to be no systematic autocorrelation structure,
however, considering the parameter estimate for με, the first release of the growth
rate of real goods exports seems to be systematically lower than the final value.
Turning to the factor loadings, the estimates imply a rather high value for exports,
steel production in the EU and the price measure for iron and steel products. The
factor loadings of the remaining variables are slightly lower. To the extent that factor
loadings show the contribution of a particular variable in shaping the path of the
extracted latent factor, one can interpret these loadings also as weights. Considering
the factor loading of each variable relative to the factor loading of the export variable
implies that the contribution of the price measure of iron and steel products has a
weight of around 75%, and of the variable capturing steel production in the EU of
around 70%, and so on. This information can be used to judge the contribution of
an update of a particular variable in shaping the model’s prediction for the growth
rate of real goods exports.
Finally, Figure 4 shows the overall model fit. The residuals have been checked for
normality and autocorrelation; the corresponding tests do not provide evidence for
non-normality or autocorrelation.
15
Table 1: Estimated coefficients
Coeff. St. Dev. t-Value Prob. rel. Weight
factor loadingsexports γi 0.09 0.02 5.58 0.00 1.00pmi US manufacturing γi 0.05 0.02 3.31 0.00 0.55steel production γi 0.05 0.02 2.37 0.02 0.37steel production EU γi 0.08 0.02 3.58 0.00 0.70industrial production γi 0.06 0.02 3.91 0.00 0.65prices of iron and steel products γi 0.08 0.02 3.41 0.00 0.75PPI EA γi 0.06 0.02 3.33 0.00 0.54GDP EA γi 0.07 0.02 3.46 0.00 0.57autoregressive coefficientsexports φi,1 -1.71 0.07 -22.84 0.00
φi,2 -0.82 0.07 -12.22 0.00pmi US manufacturing φi,1 0.92 0.06 15.09 0.00
φi,2 -0.01 0.06 -0.11 0.91steel production φi,1 0.41 0.05 7.56 0.00
φi,2 0.07 0.11 0.61 0.54steel production EU φi,1 1.16 0.08 15.13 0.00
φi,2 -0.30 0.07 -4.57 0.00industrial production φi,1 0.52 0.07 7.54 0.00
φi,2 0.17 0.07 2.51 0.01prices of iron and steel products φi,1 1.03 0.11 8.99 0.00
φi,2 -0.16 0.12 -1.36 0.18PPI EA φi,1 1.46 0.06 25.38 0.00
φi,2 -0.51 0.05 -9.20 0.00GDP EA φi,1 1.13 0.33 3.47 0.00
φi,2 -0.20 0.32 -0.61 0.54factor φf,1 1.82 0.05 34.44 0.00
φf,2 -0.87 0.05 -16.92 0.00revision termmean με 0.30 0.19 1.56 0.12
The stability of the estimated parameters and the overall model fit are assessed
using the evidence presented in Figures 5 and 6. The red lines in Figure 6 refer to
the band (min vs. max) of the variation of the point estimate of each factor loading
from an estimation where the sample size is extended recursively; and the boxplots
show the point estimates and a one-standard and two-standard deviation confidence
interval based on an estimation using the whole sample. As can be seen the variation
of the factor loadings from an estimation based on recursively extending the sample
size is rather small. In general it lies within the one-standard deviation interval from
the estimation using the whole sample. Against this background, the evidence points
towards rather stable parameter estimates over time. This argument is supported
additionally by the evidence presented in Figure 5; as can be seen, the temporal
variation of the overall model fit is rather small. The correlation coefficient of the
extracted factor with the final measure for real goods exports varies in a close band
ranging from 0.92 - 0.94.
16
Figure 4: Model fit
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-6
-4
-2
0
2
1985 1990 1995 2000 2005 2010 2015 2020
-5
0
5
10
-5
0
5
10
1985 1990 1995 2000 2005 2010 2015 2020
-4
0
4
-4
-2
0
2
4
6
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
1985 1990 1995 2000 2005 2010 2015 2020
Std. Residuals
Actual
Predicted
17
XB_R_Q XB_FR PMIMANUS_M
PRODSTEELLU_M PRODSTEELEU_M PRODINDLU_M
P_MET_M P_PPIEA_M PIBEA_R_Q
Figure 5: Parameter Stability - Dynamic Correlation Coefficient
2004 2006 2008 2010 2012 2014 2016 20180.85
0.9
0.95
1
Figure 6: Parameter Stability - Factor loadings
expo pmi_us steel prod steel prod EU IP price PPI EA GDP EA global IP-0.05
0
0.05
0.1
0.15Factor loadings: γ
3.1 Forecast evaluation
The forecast evaluation is done by considering standard statistical measures, as for
instance, mean-squared errors, and mean-absolute error statistics. However, before
18
Figure 7: Forecast trajectory for annual growth rates
Oct 2007 Apr 2008 Oct 2008 Apr 2009 Oct 2009 Apr 2010 Oct 2010-20
-15
-10
-5
0
5
10
15
Gro
wth
rat
e [in
%]
200820092010
turning to these measures in more detail, it is convenient to analyze the model’s
methodology in generating forecasts. Figure 7 is intended to provide some insights
for this. The figure shows predictions for the annual growth rate of real goods exports
for the years surrounding the global financial crisis (2008-2010). Each square refers
to the point in time when a forecast was made - that is, always in the middle (second
month) of a quarter. In particular, consider for instance the first black square -
this square is a forecast for the annual growth rate of real goods exports made in
November 2007. The forecast for the 2008 annual growth rate made in November
the year before is already close to the realized growth rate (black horizontal line).
This is especially due to the fact that this year’s annual growth rate turned out to
be close to the long run mean growth rate which is around 4%. For the following two
years, the realized growth rate is, however, far off the mean. The forecast trajectory
in the figure highlights the extent to which new information in form of updates of
the monthly series, triggers a change in the model’s forecast. This turns out to be
particularly striking for the forecasts of the year 2009 made in late 2008. At that time,
the indicators, as for instance the US PMI, industrial production in Luxembourg and
several other indicators declined drastically, inducing the dynamic factor model to
change the forecast. Within two quarters only, the growth forecast was reduced from
19
+4% (September 2008) down to -14% (March 2009), which turned out to be already
rather close to the observed rate of around -17%. It has to be stressed here that
the realized growth rate of the year 2009 is more than two standard deviations off
the unconditional mean. Against this background, the model produces a reliable
prediction for the annual growth rate already as early as in March of the same year.
The model’s core structure is autoregressive - this can be seen in the model’s
forecasts. The forecast trajectory for the annual growth rate for the year 2010 fol-
lows the forecast trajectory of the year 2009 to some extent; however, once valuable
information for the year 2010 is available, the model quickly adjusts its forecasts.
This explains the tremendous forecast revision from around -2% to up to +14% for
the growth forecast for the year 2010.
Table 2: Forecast evaluation for exports - annual growth rates
full sample sub-sample
MSE DFM 47.201 57.080ARMA(2,2) 58.818 79.991RW 192.297 292.702Expert 63.268
Relative performance of DFM toARMA(2,2) 0.803 0.714RW 0.245 0.195Expert — 0.902
MAE DFM 4.751 4.878ARMA(2,2) 6.004 7.797RW 11.457 13.805Expert 6.183
Relative performance of DFM toARMA(2,2) 0.791 0.626RW 0.415 0.353Expert — 0.789
Figure 7 only shows the forecast trajectory of the dynamic factor model. Table 2
in turn confronts the dynamic factor model’s forecasting performance with alternative
models: an ARMA(2,2) model and a Random-Walk (RW). In line with Figure 7, the
focus in the table is on annual growth rates. As can be seen, the mean-squared error
and the mean-absolute error of the dynamic factor model are noticeably smaller than
those of the two alternative models. The comparison of the dynamic factor model with
the ARMA(2,2) highlights the extent to which high frequency information improves
the forecasts. The dynamic factor model’s stochastic structure is basically of the
form of an ARMA(2,2) model: the latent factor is governed by an auto-regressive
process of order two, and the error terms of equation (20) and (21) are comparable to
20
a moving-average process of order two. Hence the key difference arises from the high-
frequency information that enters the dynamic factor model but not the ARMA(2,2)
model. The smaller MSE and MAE statistics in the case of the dynamic factor model
relative to the ARMA(2,2) model hence show in how far this additional information
increases the precision of the forecast.
Table 2 also offers a comparison of the dynamic factor model with the forecasts
of an expert-panel. As can be seen the MSE and MAE statistics of the factor model
are slightly smaller than the ones of the expert panel; however care has to be taken
here as the number of observations in this case is fairly small.
Table 3: Forecast evaluation for exports - quarterly growth rates
1 quarter 2 quarters 4 quarters
MSE DFM 0.0061 0.0069 0.0096ARMA(2,2) 0.0066 0.0087 0.0126RW 0.0094 0.0167 0.0314
Relative performance of DFM toARMA(2,2) 0.9187 0.7862 0.7627RW 0.6484 0.4100 0.3064
MAE DFM 0.0643 0.0705 0.0763ARMA(2,2) 0.0632 0.0754 0.0885RW 0.0777 0.1067 0.1430
Relative performance of DFM toARMA(2,2) 1.0174 0.9350 0.8618RW 0.8275 0.6601 0.5337
Equal predictive accuracy test (p-values)ARMA(2,2) 0.5562 0.0851 0.0424RW 0.0814 0.0000 0.0000
Table 7 assessed the model’s performance only by means of annual growth rates.
This leaves in general too few observations to carry out statistical tests. Table 3
carries out the same exercise using the y-o-y growth rate for each quarter starting in
the year 2004 until 2016. As can be seen, the dynamic factor model again performs
better than the two alternative models. Since now there is a sufficient number of
observations, the Diebold and Mariano (1995) test can be applied. The test results
provide evidence for a significant difference of the forecasts of the ARMA(2,2) and
RW model in relation to the dynamic factor model. The dynamic factor model’s
forecasts outperform those of the other two models especially for the two, three (not
shown in the table) and four quarter ahead predictions. As concerns the one-quarter
ahead forecast, the MSE and MAE statistics of the dynamic factor model are smaller
than those of the other two models, however, the Mariano-Diebold test does not find
21
evidence for a statistically significant difference for that.
4 Specifying a DFM for real private household
consumption
The dynamic factor model specified for real private household consumption is con-
structed in the same fashion as the one for real goods exports discussed in the previous
Section. Again, the model is based on equations (20)-(26), where equation (22) is
specified as follows:
(1− ϕε · L)εt = et, et ∼ NID(0, σ2e) (30)
and L is the lag operator. The variables used for the estimation are given by:
xht =
⎡⎢⎢⎢⎢⎢⎢⎢⎢⎢⎣
employment Dyoy,Mt
social contributionsyoy,Mt−3
unemploymentyoy,Mt−5
car registrationyoy,Mt−1
GDPyoy,Qt
house pricesyoy,Qt−2
⎤⎥⎥⎥⎥⎥⎥⎥⎥⎥⎦
(31)
xst =
⎡⎢⎣ CSI: car purchaseMt
CSI: economic situation in 12 monthMt−7
yield curveMt
⎤⎥⎦ (32)
where employment D refers to employment excluding commuters and CSI refers
to the indicators in the Consumer Sentiment Indicator set; yft and y1stt are given be
the y-o-y growth rate of the final real private household consumption series as well as
its first estimate respectively. The data are depicted in Figure 8 and further details
are provided in Section B in the Appendix; the data for the two measures of real
private household consumption are shown in Figure 9. As can be seen in the figure,
the two measures for the growth rate follow on average a rather similar path, however,
there are indeed several years where the difference is sizable. The model takes this
into account by means of equation (30). Figure 9 already gives a first visual hint
towards the possibility of a positive autocorrelation in the revision term.
22
Figure 8: Data used - DFM for real private household consumption
-2
-1
0
1
2
3
4
1985 1990 1995 2000 2005 2010 2015 2020
-2
-1
0
1
2
1985 1990 1995 2000 2005 2010 2015 2020
-4
-3
-2
-1
0
1
2
1985 1990 1995 2000 2005 2010 2015 2020
-1.2
-0.8
-0.4
0.0
0.4
0.8
1.2
1985 1990 1995 2000 2005 2010 2015 2020
-2
-1
0
1
2
3
4
1985 1990 1995 2000 2005 2010 2015 2020
-6
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
6
1985 1990 1995 2000 2005 2010 2015 2020
-3
-2
-1
0
1
2
1985 1990 1995 2000 2005 2010 2015 2020
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
23
CFIN_R_Q CFIN_FR CONJCONF12_M
CONJCONF20_M
COSOCEFF_Q EMMPNAR_M IMAVOITPART
P_IMMOLU_Q
PIB_R_Q U_M YIELD_CURVE
Figure 9: Data used for real private household consumption
-.02
.00
.02
.04
.06
.08
.10
94 96 98 00 02 04 06 08 10 12 14 16 18
The selection of the variables is based on the same methodology as already used
in the previous section. The variables capture consumption related elements along
various dimensions. There are contemporaneous wage and income related elements
taken into account by GDP and social contributions; there are in turn also ele-
ments capturing the expectation of wage and income paths which is accounted for
by employment, unemployment and the assessment of the economic situation in 12
months. The selected variables also capture elements related to wealth effects by
means of house prices. The two variables for car registration - one soft indicator and
one hard indicator - capture consumption relevant elements related to durable goods
consumption. Finally the yield curve captures uncertainty and hence allows for the
possibility of consumption being adjusted due to precautionary savings motives. In
general, the variables are, however, closely interrelated, implying that a particular
variable could in principle capture consumption related elements across several of the
aforementioned dimensions.
Table 4 lists all the parameter estimates of the dynamic factor model for real
private household consumption. As can be seen there is evidence for a high first-order
autocorrelation of the revision term εt. The coefficient is 0.85 and highly significant
- again one has to take into account here that the estimation of the equation of the
24
CFIN_R_Q CFIN_FR
Figure 10: Factor and Revision Term
-30
-20
-10
0
10
20
1985 1990 1995 2000 2005 2010 2015 2020
S1 – 2 RMSE
Smoothed Factor
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015 2020
RVER_0 – 2 RMSE
Smoothed Revision Error
revision term is based on a few observations only, since the first estimate for the
growth rate of real private household consumption starts only in the year 2009.
The autoregressive coefficients for the factor equation imply that both parameters
are significantly different from zero. The two estimated parameters imply that the
stochastic process is governed by complex roots. As before, including a third lag
would result in a parameter estimate for the coefficient of the third lag which is
rather small and statistically not different from zero.
Table 4 also provides the weights of the variable, measured by means of each
variables’ factor loading relative to the one of consumption. As can be seen, the
measure for house prices has the highest weight suggesting that wealth effects might
be an important element in forecasting household consumption in Luxembourg. The
weights of the other variables are still sizable, though, noticeably lower than the one
for house prices.
Figure 10 shows the temporal trajectory of the extracted latent factor. As can
be seen, the factor looks rather different from the one of the factor model for goods
exports. The global financial crisis shows up in the factor only by a modest swing;
much larger swings can be observed in the run up to the financial crisis. This how-
ever, should not be interpreted as a misspecification of the model. In fact Mody et al.
25
Table 4: Estimated coefficients
Coeff. St. Dev. t-Value Prob. rel. Weight
factor loadingsconsumption γi 0.18 0.04 4.07 0.00 1.00employment γi 0.07 0.03 2.32 0.02 0.38car purchase (CSI) γi 0.08 0.04 2.06 0.04 0.43house prices γi 0.13 0.08 1.68 0.09 0.72yield curve γi 0.05 0.03 1.79 0.07 0.26economic situation (CSI) γi 0.07 0.05 1.47 0.14 0.38social contributions γi 0.05 0.03 1.65 0.10 0.30GDP γi 0.07 0.04 1.95 0.05 0.40unemployment γi -0.04 0.02 -2.34 0.02 0.23car registration γi 0.05 0.02 2.75 0.01 0.27autoregressive coefficientsconsumption ϕi,1 -1.86 0.03 -70.19 0.00
ϕi,2 -0.94 0.02 -38.66 0.00employment ϕi,1 0.91 0.03 31.53 0.00
ϕi,2 0.01 0.03 0.37 0.71car purchase (CSI) ϕi,1 0.94 0.18 5.24 0.00
ϕi,2 -0.05 0.18 -0.25 0.80house prices ϕi,1 0.37 0.49 0.75 0.45
ϕi,2 0.44 0.50 0.87 0.38yield curve ϕi,1 1.40 0.05 27.12 0.00
ϕi,2 -0.43 0.05 -8.50 0.00economic situation (CSI) ϕi,1 0.81 0.09 8.93 0.00
ϕi,2 0.11 0.09 1.32 0.19social contributions ϕi,1 1.06 0.39 2.70 0.01
ϕi,2 -0.14 0.37 -0.37 0.71GDP ϕi,1 0.99 0.48 2.08 0.04
ϕi,2 -0.08 0.46 -0.18 0.86unemployment ϕi,1 0.94 0.05 18.55 0.00
ϕi,2 0.02 0.05 0.32 0.75car registration ϕi,1 0.22 0.06 3.86 0.00
ϕi,2 0.11 0.06 1.99 0.05factor ϕi,f 1.57 0.13 11.73 0.00
ϕi,f -0.63 0.13 -5.03 0.00revision termAR-coefficient ϕε 0.85 0.07 11.69 0.00
(2012) document for many countries that the global financial crisis had only a com-
paratively small effect on real private household consumption. The second subplot
in the same figure shows the estimated revision term. As can be seen, the revisions
show a sizable degree of autocorrelation.
Finally, Figure 11 gives some insights to the overall model fit. The residuals
have been checked for normality and autocorrelation; the corresponding tests do not
provide evidence for non-normality or autocorrelation.
The stability of the estimated parameters and the overall model fit are assessed
using the evidence presented in Figures 12 and 13. The red lines in Figure 13 refer to
the band (min vs. max) of the variation of the point estimate of each factor loading
from an estimation where the sample size is extended recursively; the boxplots in turn
26
Figure 11: Model fit
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-8
-4
0
4
8
-8
-4
0
4
1985 1990 1995 2000 2005 2010 2015 2020
-8
-4
0
4
8
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
0
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
-2
0
2
4
-4
-2
0
2
1985 1990 1995 2000 2005 2010 2015 2020
-4
0
4
8
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
Std. ResidualsActualPredicted
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-5.0
-2.5
0.0
2.5
5.0
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015 2020
-4
0
4
8
-4
0
4
8
1985 1990 1995 2000 2005 2010 2015 2020
27
CFIN_R_Q CFIN_FR EMMPNAR_M CONJCONF20_M
P_IMMOLU_Q YIELD_CURVE CONJCONF12_M COSOCEFF_Q
PIB_R_Q U_M IMAVOITPART
Figure 12: Parameter Stability - Dynamic Correlation Coefficient
2004 2006 2008 2010 2012 2014 2016 20180.8
0.82
0.84
0.86
0.88
0.9
0.92
0.94
0.96
0.98
1
Figure 13: Parameter Stability - Factor loadings
cons empl car purch (CSI) hou pri y curve econ sit (CSI) soc contr GDP unempl car regist
-0.1
0
0.1
0.2
0.3
0.4Factor loadings: γ
show the point estimates and a one-standard and two-standard deviation confidence
interval based on an estimation using the whole sample. As can be seen, the variation
28
of the factor loadings from an estimation based on various sub-samples is rather small.
In general it lies within the one-standard deviation interval from the estimation using
the whole sample. Against this background, the evidence points towards rather
stable parameter estimates over time. This argument is supported additionally by
the evidence presented in Figure 12; as can be seen, the temporal variation of the
overall model fit is rather small. The correlation coefficient of the extracted factor
with the final measure for real private household consumption varies in a close band
ranging from 0.86 - 0.88.
Figure 14: Forecast trajectory for annual growth rates
Oct 2007 Apr 2008 Oct 2008 Apr 2009 Oct 2009 Apr 2010 Oct 20100.5
1
1.5
2
2.5
3
Gro
wth
rat
e [in
%]
200820092010
4.1 Forecast evaluation
The forecast evaluation of the dynamic factor model for real private household con-
sumption is carried out in the same way as for the export model. Before discussing
some particular forecast related statistical measures, it is convenient to look on how
the dynamic factor model adjusts its forecasts for consumption as a reaction to new
information. For this, Figure 14 gives some insights. The figure replicates what was
done for exports in Figure 7, however, now for consumption growth rates. The figure
plots the projections for the annual consumption growth rate as well as the final pub-
29
Table 5: Forecast evaluation for consumption - annual growth rates
full sample sub-sample
MSE DFM 0.837 0.355ARMA(2,2) 2.661 1.054RW 1.493 0.579Expert — 0.669
Relative performance of DFM toARMA(2,2) 0.315 0.337RW 0.561 0.613Expert — 0.531
MAE DFM 0.649 0.453ARMA(2,2) 1.277 0.853RW 0.939 0.663Expert 0.599
Relative performance of DFM toARMA(2,2) 0.509 0.531RW 0.692 0.682Expert — 0.756
lished values (horizontal lines). The forecast trajectory for the 2008 growth rate does
not show any strong adjustment - this is in line with the indicators in the model. As
concerns the 2009 growth rate, the picture of the indicators used in the model turns
fairly negative in September/October 2008; the model takes this into account by ad-
justing the annual growth forecast for the year 2009 - the growth forecast of 2008 is
left rather unaffected by the steady deterioration of the indicators. From November
2008 onwards, the growth forecast for the annual growth rate of consumption in the
year 2009 was already fairly close to the realized rate. The same applies more or less
also for the forecast for the year 2010. All in all, Figure 14 offers an optimistic picture
of the performance of the dynamic factor model in producing forecasts. However, yet
this comprises only a rough assessment; in what follows several statistical concepts
will be used again for a more thorough evaluation, following broadly Section 3.1.
Table 5 confronts the dynamic factor model’s forecasting performance with alter-
native models: an ARMA(2,2) model and a Random-Walk (RW). As can be seen in
the table, the mean-squared error and the mean-absolute error of the dynamic factor
model are noticeably smaller than those of the two alternative models. The same
applies also once the forecast of an expert-panel is taken into account - however care
has to be taken here as the number of observations in this case is fairly small. Again,
the Mariano-Diebold test has not been applied here since the number of observations
available to implement this test is fairly small, rendering questionable the test results.
Table 6 carries out the same exercise using the y-o-y growth rate for each quarter
30
Table 6: Forecast evaluation for consumption - quarterly growth rates
1 quarter 2 quarters 4 quarters
MSE DFM 0.4827 0.6836 0.9262ARMA(2,2) 0.8817 1.0051 1.0338RW 1.3715 1.7970 2.2774
Relative performance of DFM toARMA(2,2) 0.5475 0.6802 0.8960RW 0.3520 0.3804 0.4067
MAE DFM 0.5379 0.6528 0.7185ARMA(2,2) 0.7054 0.7334 0.7526RW 0.8932 1.0013 1.1391
Relative performance of DFM toARMA(2,2) 0.7625 0.8902 0.9547RW 0.6022 0.6520 0.6308
Equal predictive accuracy test (p-values)ARMA(2,2) 0.0069 0.0990 0.1642RW 0.0000 0.0001 0.0000
starting in the year 2004 until 2016. As can be seen, the dynamic factor model
again performs better than the two alternative models. Since now there is a sufficient
number of observations, the Diebold and Mariano (1995) test can be applied. The test
results provide evidence for a significant difference of the forecasts of the ARMA(2,2)
and RW model in relation to the dynamic factor model. The dynamic factor model’s
forecasts outperform those of the other two models especially for the one and two
quarter ahead predictions.
5 Specifying a DFM for the labour market
This section develops a dynamic factor model (DFM) for the Luxembourgish labour
market. Key variables in this respect are (i) employment and (ii) unemployment.
Yet there is one additional variable that plays an important role in Luxembourg. A
large part of the employees commute every day. As depicted in Figure 15, the share
of commuters has risen steadily over the last decades. From the year 2009 onwards,
this share stabilized and is currently at around 40 % of total employment. The
flattening out, however, will pose a particular challenge for the temporal stability
of the estimated parameters; more on that below. The high share of commuters
extends the domestic workforce far beyond domestic borders and hence demonstrates
the attractiveness of the Luxembourgish labour market among foreigners.
The dynamic factor model specified here for the labour market focuses in partic-
31
Figure 15: Share of commuters in total employment
.08
.12
.16
.20
.24
.28
.32
.36
.40
.44
1980 1985 1990 1995 2000 2005 2010 2015
ular on the following three variables: employment, cross border workers and unem-
ployment. Since these three variables are highly interrelated, only one model will be
developed, involving all of the previously mentioned three key variables. The employ-
ment variable used here includes commuters; hence it is distinct to the one used in
Section 4 where commuters were excluded; Other indicators used will be selected in
line with the procedure applied in the previous two sections. The final factor model
will be based on the equations system (20)-(26); though there are a few noteworthy
differences to the models specified in the previous sections. In contrast to consump-
tion and exports, the key three variables in this section are characterized by two
distinct characteristics: (i) they are available at a monthly frequency and (ii) they
are not revised. The former feature is tacitly solved by the Kalman filter, and hence
needs no further consideration; the latter one renders obsolete equation (21) and in
turn also equation (22). Hence the model is characterized only by the variables in xht
32
Figure 16: Data used
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-3
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015
-4
-3
-2
-1
0
1
2
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
33
EMP_M U_M FRIN_M
CONJIND_SA_M
CONJEMP_M EAMPRODIND PIB_R_Q
PIBEA_R_QNC2LUX OENSFLUX_M
Figure 17: Model Fit
-8
-4
0
4
8
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-5.0
-2.5
0.0
2.5
5.0
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
0
4
-2
-1
0
1
2
3
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
Std. ResidualsActualPredicted
-4
-2
0
2
4
-4
-2
0
2
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
-4
-2
0
2
4
-4
-2
0
2
4
1985 1990 1995 2000 2005 2010 2015
34
EMP_M U_M FRIN_M CONJIND_SA_M
CONJEMP_M EAMPRODIND_M PIB_R_Q PIBEA_R_Q
NC2LUX_G UFLUX_M_G
Figure 18: Factors
-12
-8
-4
0
4
8
1985 1990 1995 2000 2005 2010 2015
Smoothed Factor_1st
-60
-40
-20
0
20
40
60
1985 1990 1995 2000 2005 2010 2015
Smoothed Factor_2nd
and xst , which are specified as follows:
xht =
⎡⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎣
employmentyoy,Mt
unemploymentyoy,Mt
commutersyoy,Mt
vacanciesyoy,Mt
industrial productionyoy,Mt
GDPyoy,Qt
EA GDPyoy,Qt
unemployment gapyoy,Mt
⎤⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎦
(33)
xst =
[ESI (industry)MtESI (employment)Mt
](34)
where as before yoy indicates that the particular variable enters in terms of year-
on-year growth rates. The two soft indicators are taken from the set of European
Sentiment Indicators (ESI) and used in levels. M and Q indicate if the corresponding
variable is observed at a monthly (M) or quarterly (Q) frequency. The unemploy-
ment gap-variable is constructed as a weighted average of unemployment in Belgium,
35
Table 7: Estimated coefficients
Coeff Std t-Val Prob Coeff Std t-Val Prob rel. Weight
factor loadingsemployment γi,1 0.13 0.02 5.36 0.00 γi,2 0.05 0.01 4.11 0.00 1.00unempl. γi,1 0.02 0.02 0.59 0.63 γi,2 -0.05 0.01 -4.17 0.00 0.86commuters γi,1 0.05 0.01 5.44 0.00 γi,2 0.04 0.00 4.21 0.00 0.64esi (industry) γi,1 0.03 0.03 0.30 0.76 γi,2 0.03 0.01 2.61 0.01 0.22esi (empl.) γi,1 -0.04 0.05 -0.04 0.86 γi,2 0.05 0.02 3.18 0.00 0.75indust. prod. γi,1 0.32 0.05 6.94 0.00 γi,2 0.01 0.01 0.80 0.43 0.29GDP γi,1 0.25 0.06 4.02 0.00 γi,2 0.03 0.01 2.25 0.02 0.41EA GDP γi,1 0.29 0.05 5.78 0.00 γi,2 0.03 0.01 3.58 0.00 0.54unempl. gap γi,1 0.26 0.06 5.06 0.00 γi,2 -0.04 0.02 -3.61 0.00 0.69vacancies γi,1 0.29 0.06 5.21 0.00 γi,2 -0.01 0.01 -0.50 0.62 0.23
autoregressive coefficients
employment ϕi,1 0.89 0.05 19.34 0.00 ϕi,2 0.03 0.05 0.72 0.47unempl. ϕi,1 0.85 0.06 14.64 0.00 ϕi,2 0.09 0.06 1.55 0.12commuters ϕi,1 0.94 0.08 11.54 0.00 ϕi,2 0.06 0.08 0.68 0.50esi (industry) ϕi,1 0.83 0.06 14.54 0.00 ϕi,2 0.10 0.06 1.73 0.08esi (empl.) ϕi,1 0.82 0.08 9.99 0.00 ϕi,2 0.05 0.08 0.66 0.51indust. prod. ϕi,1 0.37 0.08 4.69 0.00 ϕi,2 0.17 0.09 1.95 0.05GDP ϕi,1 -0.58 0.12 -4.98 0.00 ϕi,2 -0.75 0.07 -10.99 0.00EA GDP ϕi,1 0.18 0.22 0.82 0.41 ϕi,2 0.53 0.29 1.81 0.07unempl. gap ϕi,1 -0.41 0.21 -1.95 0.05 ϕi,2 -0.20 0.16 -1.25 0.21vacancies ϕi,1 0.31 0.07 4.43 0.00 ϕi,2 0.17 0.08 2.19 0.031st factor ϕf1,1 0.55 0.11 5.09 0.00 ϕf1,2 0.36 0.11 3.40 0.00
2nd factor ϕf2,1 1.87 0.05 40.02 0.00 ϕf2,2 -0.89 0.05 -19.15 0.00
France and Germany relative to unemployment in Luxembourg. This variable acts
as an attractor variable: relatively low unemployment in Luxembourg increases the
country’s attractiveness among foreigners as a work destination. Hence an increase
in the gap should ultimately trigger and increase in commuters and employment;
concerning unemployment there is no specific effect to be expected.
The remaining variables are rather standard - the two GDP series capture domes-
tic and foreign aggregate demand conditions; they act as contemporaneous indicators.
The remaining variables - vacancies, industrial production and the two soft indicators
from the ESI - are in turn purely leading indicators.
As before, all the data in xst and xh
t are standardized before being used in the
model. The standardized data are shown in Figure 16. The Maximum likelihood
estimates of the factor loadings are depicted in Table 7 and Figure 17 provides a
graphical overview of the model fit. As concerns the particular specification of the
dynamic factor model, each lag polynomial is again specified with two lags; however,
in contrast to the previous model for goods exports and consumption, the model
here considers two factors. Hence ft is two-dimensional rendering equation (24) to a
36
multivariate auto-regressive equation. This in turn implies that νft and σ2
f are now a
2× 1 vector and a 2× 2 matrix respectively. Identification of the two factors is based
on the following two assumptions: (i) φf (L) is a diagonal matrix polynomial, and (ii)
σ2f = I2 (I2 refers to the identity matrix of order two). These two assumptions imply
that equation (24) features two independent auto-regressive processes of order two.
Figure 19: Parameter Stability - Dynamic Correlation Coefficient
2008 2010 2012 2014 2016 20180
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1Correlation with 1st Factor
2008 2010 2012 2014 2016 20180
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1Correlation with 2nd Factor
EmploymentCommutersUnemployment
As concerns the estimated coefficients outlined in Table 7 the auto-regressive
coefficients for the factor equation imply that the parameters are both significantly
different from zero. The two estimated parameters for each factor imply that the
stochastic process is governed by complex roots. Including a third lag would results
in a parameter estimate for the third lag which is rather small and statistically not
different from zero.
The two factors are shown in Figure 18 and Figure 19 displays the correlation of
the two factors with the three variables of interest (employment, unemployment and
commuters). As can be seen, the second factor has a high correlation with employ-
ment and unemployment, however, a comparatively low correlation with commuters.
37
The first factor has a negligibly low correlation with employment and unemployment
while a decent correlation with commuters. Considering the two factors in a linear
regression of each of the three key variables of interest implies that, the two fac-
tors explain 73%, 79% and 62% of the variation in employment, unemployment and
commuters respectively.
The Maximum likelihood estimates of the factor loadings reflect the degree to
which the variation in each observed variable can be explained by the the two latent
series. Each series has at least one significant factor loading and the corresponding
sign indicates that, with the exception of unemployment, all series are procyclical.
The unemployment-gap variable depicts a slightly different pattern as concerns its
relation with the factors. It has a positive factor loading with the first factor and a
negative one with the second one. This accounts for the circumstance that employ-
ment in Luxembourg and the amount of commuters to Luxembourg can both increase
or decrease once unemployment is high in the neighbour countries. For instance, high
unemployment in neighbour countries might induce workers in these countries to start
working in Luxembourg. This might in turn increase Luxembourgish employment,
and also raises the volume of commuters. On the other hand, high unemployment in
neighbour countries might in turn also be a signal of an economic downswing in these
countries. Since the Luxembourgish economy is highly interrelated with neighbour-
ing economies, a slack abroad is quickly imported worsening domestic labour market
conditions. In this respect the estimates of the dynamic factor model allow for both
of these situations to occur.
The selection of the variables has been applied to the whole sample. However,
structural changes in the underlying economic dynamics could influence the choice
of indicators over time or lead to time-variation in the estimated parameters. For
instance, it might be possible that the variable selection could motivate a different
set of indicators once applied to a time span ending before the global financial crisis.
Against this background, the model is analyzed concerning its stability over time.
This is assessed based on the factor loadings γh and γs; using a measure for the
model fit as done by means of the correlation coefficients within a time-span starting
in January 2008 and ending in December 2016. In order to introduce time-variation
in the parameters, a rolling regression approach is applied: the model is estimated
based on a sample starting in 01:1985 until 01:2008 and then the sample-window is
recursively extend adding an additional month until 12:2016. For each step the model
38
Figure 20: Parameter Stability - Factor loadings
empl unempl commu esi ind esi empl IP GDP EA GDP unempl gap vacan
-0.2
-0.1
0
0.1
0.2
0.3
0.4
0.5Factor loadings: γ
1
empl unempl commu esi ind esi empl IP GDP EA GDP unempl gap vacan-0.15
-0.1
-0.05
0
0.05
0.1
Factor loadings: γ2
is fully estimated. Figure 19 displays the time variation by means of the dynamic
correlation coefficient and Figure 20 shows boxplots for the factor loadings to assess
the overall change in the model fit over time.
The figures highlight that time variation is present. On the other, however, it
is fairly small for most parameters. The variation of the estimated parameters for
the factor loadings (red lines) is smaller than the estimate of the in-sample standard
deviation (black boxplots) depicted in Table 7. All the time-variation of the param-
eters is still within the one-standard-deviation confidence interval of the full-sample
estimates. Hence, the time-variation is of negligible statistical importance. A similar
reasoning applies when considering the temporal trajectory of the model fit - the
correlation coefficient between the model and the final value for the three variables
of interest shows a limited degree of variation over time and there appears to be no
systematic pattern of time variation in the model fit.
39
5.1 Forecast evaluation
Figure 21: Forecast trajectory for annual growth rates
Oct 2007 Apr 2008 Oct 2008 Apr 2009 Oct 2009 Apr 2010 Oct 20100
2
4
6
Gro
wth
rate
[in
%]
Employment
Oct 2007 Apr 2008 Oct 2008 Apr 2009 Oct 2009 Apr 2010 Oct 20100
2
4
6
8
10
Gro
wth
rate
[in
%]
Commuters
Oct 2007 Apr 2008 Oct 2008 Apr 2009 Oct 2009 Apr 2010 Oct 2010-10
0
10
20
30
40
50
Gro
wth
rate
[in
%]
Unemployment
200820092010
The forecast evaluation of the dynamic factor model for the three labour market
variables is carried out in the same way as for the previous models. Before discussing
some particular forecast related statistical measures, it is convenient to look on how
40
the dynamic factor model adjusts its forecasts as a reaction to new information. For
this, Figure 21 shows the forecast trajectory for the annual growth rates for the
years surrounding the global financial crisis (2008-2010) within the pseudo real-time
analysis. The figure plots the annual forecast for each year made at different points in
time for all of the three variables of interest. Considering for instance the employment
projections first, the first black square is the forecast of the growth rate of employment
in the year 2008 made in November 2007. The next square corresponds to a forecast
for employment growth rate for the year 2008 made in March 2008. In each subplot,
the solid lines refer to the observed annual growth rate for the three years. As can
be seen, the model projections converge to the observed values rather quickly. As of
the third quarter, the model’s projections are in most cases already indistinguishable
from the observed value of the same year. This also applies to the forecast trajectory
for commuters and unemployment. All in all, Figure 21 offers an optimistic picture
of the performance of the dynamic factor model in producing forecasts. However, yet
this comprises only a rough assessment; in what follows several statistical concepts
will be used again for a more thorough evaluation, following broadly the previous
Sections.
Table 8: Forecast evaluation - annual growth rates
Employment Commuters UnemploymentMSE DFM 0.0000 0.0001 0.0056
ARMA(2,2) 0.0001 0.0003 0.0079RW 0.0156 0.0014 0.0140Expert 0.0001 0.0002 0.0042
Relative performance of DFM toARMA(2,2) 0.5854 0.5689 0.7137RW 0.0031 0.1052 0.4021Expert 0.6145 0.7332 1.3510
MAE DFM 0.0054 0.0081 0.0559ARMA(2,2) 0.0068 0.0116 0.0716RW 0.0992 0.0310 0.1004Expert 0.0057 0.0106 0.0582
Relative performance of DFM toARMA(2,2) 0.8034 0.6983 0.7807RW 0.0548 0.2619 0.5570Expert 0.9598 0.7648 0.9604
In order to better judge the precision of the dynamic factor model’s forecasts,
several alternative model forecasts are utilized. The forecasts of each model are
evaluated by means of mean-squared error and mean-absolute error statistics. The
41
alternative models comprise (i) a random walk model, (ii) and ARMA(2,2) model and
(iii) the forecast of an expert -panel (again only as concerns the annual forecasts).
Tables 9 and 8 provide the results.
Table 9: Forecast evaluation - monthly frequency
Forecast horizon: 1 month 3 months 6 months 12 months
Employment MSE DFM 0.0438 0.0929 0.2670 0.8106ARMA(2,2) 0.0492 0.1151 0.3256 0.8303RW 0.0574 0.1673 0.4970 1.3654
Relative performance of DFM toARMA(2,2) 0.8912 0.8073 0.8201 0.9762RW 0.7643 0.5553 0.5372 0.5937
MAE DFM 0.1435 0.2170 0.3580 0.6854ARMA(2,2) 0.1592 0.2493 0.4136 0.6848RW 0.1768 0.3033 0.5043 0.8647
Relative performance of DFM toARMA(2,2) 0.9010 0.8705 0.8656 1.0009RW 0.8117 0.7155 0.7099 0.7927
Equal predictive accuracy test (p-values)ARMA(2,2) 0.7379 0.0160 0.0120 0.2340RW 0.0110 0.0000 0.0000 0.0000
Commuters MSE DFM 0.0074 0.0177 0.0529 0.1710ARMA(2.2) 0.0079 0.0202 0.0634 0.2082RW 0.0093 0.0297 0.0912 0.2654
Relative performance of DFM toARMA(2,2) 0.9328 0.8774 0.8355 0.8211RW 0.7882 0.5958 0.5806 0.6443
MAE DFM 0.0606 0.0938 0.1564 0.2984ARMA(2,2) 0.0668 0.1071 0.1834 0.3340RW 0.0731 0.1291 0.2199 0.3796
Relative performance of DFM toARMA(2,2) 0.9071 0.8758 0.8529 0.8933RW 0.8288 0.7267 0.7114 0.7860
Equal predictive accuracy test (p-values)ARMA(2,2) 0.3380 0.0726 0.0265 0.0010RW 0.0026 0.0000 0.0000 0.0000
Unemployment MSE DFM 0.0297 0.1007 0.2661 0.6635ARMA(2,2) 0.0342 0.1063 0.2865 0.6701RW 0.0398 0.1547 0.4552 1.2071
Relative performance of DFM toARMA(2,2) 0.8669 0.9479 0.9290 0.9901RW 0.7460 0.6512 0.5846 0.5496
MAE DFM 0.1248 0.2257 0.3660 0.6369ARMA(2,2) 0.1350 0.2449 0.4159 0.6412RW 0.1414 0.2871 0.4940 0.8271
Relative performance of DFM toARMA(2,2) 0.9248 0.9217 0.8799 0.9933RW 0.8829 0.7864 0.7408 0.7700
Equal predictive accuracy test (p-values)ARMA(2,2) 0.0743 0.0394 0.0603 0.1788RW 0.0090 0.0000 0.0000 0.0000
Table 8 compares the dynamic factor model’s forecasting performance with al-
42
ternative models: an ARMA(2,2) model and a Random-Walk (RW). As can be seen
in the table, the mean-squared error and the mean-absolute error of the dynamic
factor model are noticeably smaller than those of the alternative models. The same
applies also to the forecast of the expert-panel; however care has to be taken here
as the number of observations in this case is fairly small. Note that in the case
of unemployment the expert-panel’s forecast has a noticeably smaller mean-squared
error than the dynamic factor model; the contrary applies to the mean absolute er-
ror, implying that outliers from exceptional periods play a role (the years 2008 and
2009 can be identified as the two most important ones for these outliers). Again,
the Diebold and Mariano (1995) test has not been applied here since the number of
observations available to implement this test is fairly small, rendering questionable
the test results.
In order to assess the statistical evidence concerning the difference of the forecasts
of the different models - dynamic factors model, random-walk, and ARMA(2,2) - the
forecast evaluation has been run additionally on a quarterly basis. This generates
enough observation rendering feasible the Diebold and Mariano (1995) test. Table
9 shows the results for the three key variables - employment, commuters and un-
employment - for various horizons. As can be seen there is statistically significant
evidence that the dynamic factor model performs better in forecasting employment,
commuters and unemployment at the horizon three months and six months than the
alternative models. The evidence for the one month and twelve months horizon is
mixed.
5.2 An extension: MF-FAVAR model
The dynamic factor model specified for the labour market variables considers three
variables modeled jointly within the same model. An extension that is capable of
generating slightly more accurate forecasts is considered here by means of a Mixed
Frequency-Factor Augmented Vector Autoregressive (MF-FAVAR) model. In order
to motivate the model and show its performance, it is only applied to employment.
In principle it could also be extended involving yet further variables, however, care
has to be taken here as one might be quickly confronted with problems related to the
over- dimensionality of the model. The model reads as follows:
43
MF-FAVAR model:
• System of static equations(xht
xst
)=
(γh · ftγs · ft
)+
(uh
t
ust
)(35)
• System of dynamic equations
φu(L) · ut,q = νqt with νq
t ∼ NID(0, σ2
q
)(36)
φf (L) ·[
ft
empt
]= νf
t (37)
Φu(L)
(uh
t
ust
)= νt (38)(
νft
νt
)∼ NID
(0,
[σ2f 0
0 Σν
])(39)
where empt refers to the employment series used before. The vectors of observed
variables are defined as follows:
xht =
⎡⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎢⎣
unemploymentyoy,Mt
commutersyoy,Mt
vacanciesyoy,Mt
industrial productionyoy,Mt
GDPyoy,Qt
EA GDPyoy,Qt
unemployment gapyoy,Mt
⎤⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎥⎦
(40)
xst =
[ESI (industry)MtESI (employment)Mt
](41)
The model is different to the extent that the latent factor ft is extracted excluding
employment. The relation between the factor and employment is than re-established
by means of equation (37) - this is the FAVAR equation as it relates the latent factor
ft directly to the observed variable, in this case, employment. Again, a lag order of
44
two has been used for all lag-polynomials and φ(L), σ2f and νf
t are now 2×2 matrices
and a vector respectively. The model is identified assuming that the (1,1) element
in the variance-covariance matrix σ2f is unity. The remaining elements of σ2
f are
estimated. The (2,1) - and due to symmetry also the (1,2) element - of σ2f captures
the contemporaneous relation between the latent factor ft and employment where
the lag-polynomial φ(L) establishes the dynamic interaction of these two variables.
Table 10: MF-FAVAR model: Forecast evaluation - monthly frequency
Forecast horizon: 1 month 3 months 6 months 12 months
Employment MSE MF-FAVAR 0.0401 0.0850 0.2371 0.8020DFM 0.0438 0.0929 0.2670 0.8106
Relative performance of DFM toMF-FAVAR 1.092 1.093 1.127 1.010
Equal predictive accuracy test (p-values)MF-FAVAR 0.7571 0.1201 0.1101 0.8502
The model is estimated making use of the same estimation procedure as before.
For brevity, only the statistics for the forecast evaluation are shown and are depicted
in Table 10. The mean-squared error statistics of the MF-FAVAR are smaller than
those of the dynamic factor model for any horizon considered. The statistical evidence
for a better forecast precision is limited - the p-values for the three months and six
months horizon are slightly above the 10% significance level.
All in all this is to show in how far small variations of the specification of the
dynamic factor model motivated in Section 2 can quickly alter the model to another
popular group of models commonly used. Even though the forecast precision of the
MF-FAVAR is slightly better, clear preference is given to the dynamic factor model
of Section 5 here for at least two reasons: (i) the dynamic factor model is more
parsimonious than the MF-FAVAR which allows for a significantly faster estimation
of the model; and (ii) the variable selection process might turn out more challenging
in case of the MF-FAVAR model since one has to decide not only which variables to
include in xst and xh
t , but also as to which variables to include in the FAVAR equation.
The latter adds an additional search dimension within the variable selection process
rendering the specification of the model more tedious.
45
6 Conclusion and outlook
The dynamic factor models developed and presented in the previous sections are based
on a small set of indicators. The selection of them takes into account specificities of
the Luxembourgish economy. The models showed a decent forecasting performance
on a pseudo real-time basis.
The three dynamic factor models are all independent to each other; merging
them into one large-scale model might quickly render the estimation intractable due
to problems related to the dimensionality of the model. However, there is another
possibility to merge them. The idea is not to merge the models, yet, only their indi-
vidual forecasts. Starting for instance with the model for goods exports, it produces
forecasts not only for goods exports, but also for the other variables, among oth-
ers, industrial production and Euro-zone GDP growth. As these two variables show
up also in the dynamic factor model for the labour market, the forecasts from the
dynamic factor model for exports could then be used in the dynamic factor model
for the labour market to produce conditional forecasts of the remaining variables.
In other words, the forecasts from the dynamic factor model for exports enter the
dynamic factor model for the labour market as if they were observed. The forecasts
of certain variables of the dynamic factor model for the labour market can then be
used in the dynamic factor model for consumption. This applies in particular to GDP
growth, and partly to employment. By this one can construct a consistent forecast
involving all of the three models, where in turn each dynamic factor model is used
individually, however, they are merged by making use of the concept of conditional
forecasts. Continuing in this fashion and constructing similar models for yet other
important macroeconomic variables would allow for a consistent projection of yet
other variables despite using independent models in each case.
46
References
[1] Banbura, Marta, Domenico Giannone, Michele Modugno and Lucrezia Reichlin
(2013) Now-Casting and the Real-Time Data Flow, Handbook of Economic Fore-
casting.
[2] Bai, Jushan and Serena Ng (2002) Determining the Number of Factors in Ap-
proximate Factor Models. Econometrica, 70, pages 191-221.
[3] Bai, Jushan and Serena Ng (2008) Forecasting economic time series using targeted
predictors, Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317.
[4] Breitung, Joerg and Sandra Eickmeier (2006) Dynamic factor models, AStA Ad-
vances in Statistical Analysis, Springer;German Statistical Society, vol. 90(1),
pages 27-42.
[5] Camacho, Maximo and Gabriel Perez-Quiros (2010) Introducing the euro-sting:
Short-term indicator of euro area growth, Journal of Applied Econometrics, 25(4),
pages 663-694.
[6] Diebold, Francis X. and Roberto S. Mariano (1995) Comparing predictive accu-
racy, Journal of Business and Economic Statistics 13(3), pages 253-263.
[7] Eickmeier, Sandra (2007) Business cycle transmission from the US to Germany–
A structural factor approach, European Economic Review, Elsevier, vol. 51(3),
pages 521-551.
[8] Forni, Mario and Lucrezia Reichlin (1998) Let’s Get Real: A Factor Analyti-
cal Approach to Disaggregated Business Cycle Dynamics, Review of Economic
Studies, Oxford University Press, vol. 65(3), pages 453-473.
[9] Forni, Mario, Marc Hallin, Marco Lippi and Lucrezia Reichlin (2000) The Gen-
eralized Dynamic-Factor Model: Identification And Estimation, The Review of
Economics and Statistics, MIT Press, vol. 82(4), pages 540-554.
[10] Forni, Mario, Marc Hallin, Marco Lippi and Lucrezia Reichlin (2004) Do financial
variables help forecasting inflation and real activity in the euro area?, Journal of
Monetary Economics, Elsevier, vol. 50(6), pages 1243-1255.
47
[11] Mariano, R. S., and Y. Murasawa (2003) A new coincident index of business
cycles based on monthly and quarterly series, Journal of Applied Econometrics,
18(4), pages 427-443.
[12] Mody, Ashoka, Franziska Ohnsorge and Damiano Sandri (2012) Precautionary
Savings in the Great Recession, IMF Economic Review, 60, issue 1, pages 114-138.
[13] Onatski, Alexei (2010) Determining the Number of Factors from Empirical Dis-
tribution of Eigenvalues, The Review of Economics and Statistics, 92, issue 4,
pages 1004-1016.
[14] Ritschl, Albrecht, Samad Sarferaz and Martin Uebele (2016) The U.S. Business
Cycle, 1867-2006: A Dynamic Factor Approach, The Review of Economics and
Statistics, MIT Press, vol. 98(1), pages 159-172.
[15] Ahn, Seung and Alex Horenstein (2013) Eigenvalue Ratio Test for the Number
of Factors, Econometrica, 81, issue 3, p. 1203-1227.
[16] Stark, T., and D. Croushore (2002) Forecasting with a real-time data set for
macroeconomists, Journal of Macroeconomics, 24(4), 507-531.
[17] Stock, James H. and Mark W. Watson (1992) A probability model of the coinci-
dent economic indicators, in Leading Economic Indicators: New Approaches and
Forecasting Records, Cambridge University Press, pages 12-63.
[18] Stock, James H. and Mark Watson (2002a) Forecasting Using Principal Com-
ponents From a Large Number of Predictors, Journal of the American Statistical
Association, 97, issue , pages 1167-1179.
[19] Stock, James H. and Mark Watson (2002b) Macroeconomic Forecasting Using
Diffusion Indexes, Journal of Business & Economic Statistics Vol. 20, Iss. 2, 2002.
48
A State-space representation of the dynamic fac-
tor model
This section provides details on how the dynamic factor model spelled out in equa-
tions (20)-(26) can be cast into a state-space representation; the following considers
in particular a version of the dynamic factor model with one factor and the revision
term εt is specified as a white noise process. To illustrate how the corresponding
matrices of the transition and measurement equation look like, the following defini-
tions are used: 0(i,j) is a matrix of zeros of dimension i × j, Ir is an identity matrix
of dimension r; xst is a ns dimensional vector of soft-indicators, xh
t is a nh dimen-
sional vector of hard-indicators. For simplicity, it is assumed here that the variables
are all observed at a monthly frequency and that there are no missing observations.
Relaxing this assumption would require to extend yt, wt, R and H by means of an
indicator function as discussed in Section 2.
Transition equation: This equation relates the observed variables to the factor
and can be expressed as:
yt = Hst +wt, wt ∼ NID (0, R) (42)
We use the following defintion of the vectors yt, st wt and variance co-variance matrix
R:
yt =[yft , y
1stt , (xs
t )′, (xh
t )′]′
(43)
wt = 0(nX ,1) (44)
R = 0(nX ,nX) (45)
with n = nh + ns, nX = n+ 2 and
st = [ft, ft−1, ut,q, ut−1,q, εt, ...
ust,1, u
st−1,1, ..., u
st,ns
, ust−1,ns
, ...
uht,1, u
ht−1,1, ..., u
ht,nh
, uht−1,nh
]′(46)
Given these definitions, the matrix H will be the following:
49
H =
⎛⎜⎜⎜⎜⎝
ft, ft−1︷ ︸︸ ︷η11 0(1,2)
ut, ut−1︷ ︸︸ ︷η12
εt︷︸︸︷0
ust︷ ︸︸ ︷
0(1,2·ns)
uht︷ ︸︸ ︷
0(1,2·nh)
η11 0 η12 1 0(1,2·ns) 0(1,2·nh)
η31 0 0(ns,1) 0(ns,1) η32 0(ns,2·nh)
η41 0(nh,1) 0(nh,1) 0(nh,1) 0(nh,2·ns) η42
⎞⎟⎟⎟⎟⎠ (47)
with ust =(ust,1, u
st−1,1, us
t,2, ust−1,2, ..., u
st,ns
, ust−1,ns
); uh
t =(uht,1, u
ht−1,1, uh
t,2, uht−1,2, ..., u
ht,ns
, ust−1,nh
),
and
η12 =(
1 0)
(48)
η32 = Ins ⊗(
1 0)
(49)
η42 = Inh⊗(
1 0)
(50)
and η11 = γq, η31 = γs, and η41 = γh.
State equation: Using the previous definitions of the vectors, the state equation
can be expressed as:
st = Fst−1 + vt, vt ∼ NID (0, Q) (51)
where Q is a matrix whose off-diagonal elements are all zero and its diagonal is given
by:
diag(Q) =[σ2f , 0, σ
2q , 0, σ
2ε , diag (Σν)
′ ⊗(
1 0)]′
(52)
where diag (Σν) =(σ2νs,1, ..., σ
2νs,ns
, σ2νh,1
, ..., σ2νh,nh
)′; and the error term vt is given by:
vt =[νft , 0, ν
qt , 0, εt, [ν
st,1, 0], ..., [ν
st,ns
, 0], [νht,1, 0], ..., [ν
ht,nh
, 0]]′
(53)
The matrix F becomes:
50
F =
⎛⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎜⎝
ft−1, , ft−2︷ ︸︸ ︷φf,1 φf,2
ut−1, ut−2︷ ︸︸ ︷0(1,2)
εt−1︷︸︸︷0
ust−1,1, u
st−2,1︷ ︸︸ ︷
0(1,2)
[· · · ]︷︸︸︷· · ·
ust−1,ns
, ust−2,ns︷ ︸︸ ︷
0(1,2)
uht−1,1, u
ht−2,1︷ ︸︸ ︷
0(1,2)
[· · · ]︷︸︸︷· · ·
uht−1,nh
, uht−2,nh︷ ︸︸ ︷
0(1,2)
1 0 0(1,2) 0 0(1,2) · · · 0(1,2) 0(1,2) · · · 0(1,2)
0(2,1) 0(2,1) fγq 0 0(2,2) · · · 0(2,2) 0(2,2) · · · 0(2,2)
0 0 0(1,5) 0 0(1,2) · · · 0(1,2) 0(1,2) · · · 0(1,2)
0(2,1) 0(2,1) 0(2,2) 0 f s1 0(2,·) 0(2,2) 0(2,2) · · · 0(2,2)
......
......
. . . . . . . . ....
. . ....
0(2,1) 0(2,1) 0(2,2) 0 0(2,2) · · · f sns
0(2,2) · · · 0(2,2)
0(2,1) 0(2,1) 0(2,2) 0 0(2,2) · · · 0(2,2) fh1 0(2,·) 0(2,2)
......
......
.... . .
.... . . . . .
...
0(2,1) 0(2,1) 0(2,2) 0 0(2,2) · · · 0(2,2) 0(2,2) · · · fhnh
⎞⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎟⎠
(54)
fγq =
(φq,1 φq,2
1 0
)(55)
f s1 =
(φs1,1 φs
1,2
1 0
)(56)
f sns
=
(φsns,1 φs
ns,2
1 0
)(57)
fh1 =
(φh1,1 φh
1,2
1 0
)(58)
fhnh
=
(φhnh,1
φhnh,2
1 0
)(59)
51
B The data
Table 11: Data used
Variable Description Ctry Unit DFM for Frequency
CFIN R Q House hold final consumption (chainedlinked volumes 2010)
LU Mio Euro Consumption Quarterly
CONJCONF12 M Consumer Confidence, General Eco-nomic Situation over Next 12 Months,Balance
LU Index Consumption Monthly
CONJCONF20 M Consumer Confidence, Intention toBuy a Car within the Next 12 Months
LU Index Consumption Monthly
CONJEMP M Average employment expectations LU Index Labour market MonthlyCONJIND SA M Business Surveys, Industry Confidence
Indicator, Balance, SALU Percent Labour market Monthly
COSOCEFF Q Social contributions receivable - ofwhich employers actual social contri-butions
LU Mio Euro Consumption Quarterly
EAMPRODIND M Industrial production 2010 =100 LU Index Labour market MonthlyEMMPNAR M National employment LU 1000 Persons Consumption Monthly
EMP M Total employment; interior concept LU Persons Labour market MonthlyFRIN M Cross border workers in Luxembourg LU Persons Labour market Monthly
IMAVOITPART Luxembourg, Domestic Trade, VehicleRegistrations (ACEA), New, Passen-ger Cars
LU Number of cars Consumption Monthly
OENSFLUX M New job offers (flows) LU Persons Labour market MonthlyPIBEA R Q Gross domestic Product Luxembourg
(chained linked volumes 2010)EA Mio Euro Labour mar-
ket, ExportQuarterly
PIB R Q Gross domestic Product Luxembourg(chained linked volumes 2010)
LU Mio Euro Consumption,Labour market
Quarterly
P IMMOLU Q House Price Index LU Index Consumption QuarterlyP MET M Price index of iron and steel products,
tubes and othersLU 2010=100 Exports Monthly
PMIMANUS M United States, Business Surveys, ISM,Manufacturing, Purchasing Managers’Index
US Rolling index Exports Monthly
P PPIEA M Industrial Produces Price Index EA 2010=100 Exports MonthlyPRODINDLU M Working day adjusted production in-
dex total economyLU 2010=100 Exports Monthly
PRODSTEELEU M EU, Metal Production, Crude Steel EU metric tonnes Exports MonthlyPRODSTEELLU M Luxembourg, Metal Production,
Crude SteelLU metric tonnes Exports Monthly
TICTEUR M Interbank Rates, EURIBOR, 3 Month,Fixing
EA Percent Consumption Monthly
TILTEUR M Government Benchmarks, Eurostat,Government Bond, 10 Year, Yield
EA Percent Consumption Monthly
U BE M Unemployment Belgium ILO defini-tion
BE Persons Labour market Monthly
U DE M Unemployment Germany ILO defini-tion
DE Persons Labour market Monthly
U FR M Unemployment France ILO definition FR Persons Labour market MonthlyU LU M Unemployment Luxembourg ILO defi-
nitionLU Persons Consumption Monthly
U M Number of registered unemployed. LU Persons Consumption,Labour Market
Monthly
XB R Q Exports of goods (chained linked vol-umes 2010)
LU Mio Euro Exports Quarterly
52