Our focus in this post is on Granger causality tests; rather, on a popular panel version of the test proposed in Dumitrescu and Hurlin (2012) (DH). Below, we summarize Granger causality testing in the univariate case, follow the discussion on the panel version of the test, and close with our findings from a large Monte Carlo simulation replicating and extending the work of DH to cases which were not covered in the original article. In particular, our focus is on studying the impact on size and power when the regression lag order is misspecified relative to the lag order characterizing the true data generating process (DGP).
Granger Causality Tests
The idea behind Granger causality is simple. Given two temporal events, xt and yt, we say xt Granger causes yt, if past information in xt uniquely contributes to future information in yt. In other words, information in {xt−1,xt−2,…} has predictive power for yt, and knowing both {xt−1,xt−2,…} and {yt−1,yt−2,…} together, yields better forecasts of yt than knowing {yt−1,yt−2,…} alone.In the context of classical, non-panel data, testing whether xt Granger causes yt reduces to parameter significance on the lagged values of xt in the regression: yt=c+γ1yt−1+γ2yt−2+⋯+γpyt−p+β1xt−1+β2xt−2+⋯+βpxt−p+ϵt where ϵt satisfies the classical assumptions of being independent and identically distributed, the roots of the characteristic equation 1−γ1r−γ2r2−…−γprp=0 lie outside the unit circle, namely, yt is stationary, xt is stationary itself, and, p≥1. In other words, we have the following null and alternative hypothesis setup: H0:∀k≥1,βk=0;xt does not Granger cause yt.HA:∃k≥1,βk≠0;xt does Granger cause yt. Although the traditional Granger causality test is only valid for stationary series, we diverge briefly to caution on cases where xt and yt may be non-stationary. In particular, whenever at least one variable in the regression above is not stationary, the traditional approach is no longer valid. In such cases one must resort to the approach of Toda and Yamamoto (1995). In this regard, we also emphasize that unlike non-stationary but non-cointegrated variables, which may or may not exhibit Granger causality, all cointegrated variables necessarily Granger cause each other in at least one direction, and possibly both. Since our friend Dave Giles has exceptional posts on the subjects here, here, and here, we will not delve further and urge interested readers to refer to the material in these posts.
Dumitrescu-Hurlin Test: Panel Granger Causality Test
Recall that time series panel data associates a cross-section i=1,…,N for each time observation t=1,…T. In this regard, a natural extension of the Granger causality regression (1) to cross-sectional information, would assume the form: yi,t=ci+γi,1yi,t−1+γi,2yi,t−2+⋯+γi,pyi,t−p+βi,1xi,t−1+βi,2xi,t−2+⋯+βi,pxi,t−p+ϵi,t where now, we require the roots of the characteristic equations 1−γi,1ri−γi,2r2i−…−γi,prpi=0 to be outside the unit circle for all i=1,…,N, in addition to requiring stationarity from xi,t for all i. Moreover, we assume ϵi,t are independent and normally distributed across both i and t; namely, E(ϵi,t)=0, E(ϵ2i,t)=σ2i, and E(ϵi,tϵj,s)=0 for all i≠j and s≠t. In other words, we exclude the possibility of cross-sectional dependence and serial correlation across t. While restrictive, relaxing these assumptions is still in theoretical development so we restrict ourselves to the aforementioned specification.At this point, it is instructive to reflect on what the presence and absence of Granger causality in panel data actually means. In this regard, while the absence of Granger causality is as simple as requiring non-causality across all cross-sections simultaneously, namely: H0:∀k≥1 and ∀i,βi,k=0;xi,t does not Granger cause yi,t, ∀i the alternative hypothesis, namely the presence of Granger causality, is more involved. In particular, are we to assume presence of Granger causality implies causality across all cross sections simultaneously, namely, HA1:∀k≥1, and ∀i,βi,k≠0;xi,t does Granger cause yi,t, ∀i or, are we to hypothesize the presence of Granger causality as causality that is present for some proportion of the cross-sectional structure; in other words: HA2:∀k≥1 and ∀i=1,…,N1,βi,k=0;xi,t does not Granger cause yi,t, ∀i≤N1∀i=N1+1,…,N, ∃k≥1,βi,k≠0;xi,t Granger cause yi,t for i>N1. where 0≤N1/N<1. Since HA1 is evidently restrictive, we focus here on HA2. In particular, the theory for a panel Granger causality test in which H0 is contrasted with HA2 is the foundation of the popular work of Dumitrescu and Hurlin (2012). In fact, the approach taken follows closely the work of Im, Pesaran, and Shin (2003) for panel unit root tests in heterogenous panels. In particular, estimation proceeds in three steps:
- For each i and t=1,…,T, estimate the regression in (2) using standard OLS.
- For each i, using the estimates in Step 1, conduct a Wald test for the hypothesis βi,k=0 for all k=1,…,p, and save this value as Wi,T.
- Using the N statistics Wi,T from Step 2, form the aggregate panel version of the statistic as: WN,T=1NN∑i=1Wi,T
Given the test statistic (3), DH demonstrate its limiting distribution when T⟶∞ followed by N⟶∞, denoted as T,N⟶∞; in addition to the case where N⟶∞ with T fixed. The results are summarized below: ZN,T=√N2K(WN,T−K)d⟶T,N→∞N(0,1)˜ZN=√N(T−3K−5)2K(T−2K−3)((T−3K−3T−3K−1)WN,T−K)d⟶N→∞N(0,1) provided T>5+3K as a necessary condition for the validity of results. The latter ensures that the OLS regression in Step 1 above is valid, by preventing situations in which there are more parameters than observations.
In either case, the results follow from classical statistical concepts and central limit theorems (CLT). In particular, in the case where T,N⟶∞, observe that Wi,Td⟶T→∞χ2(k) for every i. Accordingly, one is left with N independent and identically distributed random variables, each with mean K and variance 2K. Thus, the classical Lindberg-Levy CLT applies, and the first limiting result follows. For the second case, DH demonstrate that when T is fixed, Wi,T represent N independent random variables but each has mean K(T−3K−1)T−3K−3 and variance 2K(T−3K−1)2(T−2K−3)(T−3K−3)2(T−3K−5), and so they are not identically distributed. In this case, one can invoke the Lyapunov CLT, and the second result follows. Of course, it follows readily that as T⟶∞, both limiting results coincide. We refer interested readers to the original DH article for details.
EViews has allowed estimation of the Dumitrescu-Hurlin test as a built in procedure since EViews 8. Dumitrescu and Hurlin have also made available a set of Matlab routines to perform their test and a companion website. In recent months, a Stata ado file allowing estimation of the test has also been made available. It should be noted that due to slight calculation errors in the original Matlab and Stata code, EViews results did not always match those given by Matlab and Stata. In recent months those mistakes have been fixed by the respective authors, and now both Matlab and Stata match the results produced in EViews.
In EViews, the test is virtually instant. Proceeding from an EViews workfile with a panel structure, open two variables, say xt and yt as a group, proceed to View/Granger Causality, select Dumitrescu Hurlin, specify the number of lags to use, namely, set p, and hit OK.
Dumitrescu-Hurlin Test: Monte Carlo Study
We close our post with findings from our extensive Monte Carlo study of the Dumitrescu and Hurlin (2012) panel Granger causality test. Although the authors conducted a simulation study of their own, we were disappointed that more emphasis was not placed on the impact of incorrectly specifying the lag order p in the Granger causality regression (2). In this regard, we wrote an EViews program to study both size and power under the following configurations:- Monte Carlo replications: 5000
- Sample sizes considered: T=11,20,50,100,250
- Cross-sections considered: N=1,5,10,25,50
- Regression lags considered: p=1,…,7
- Hypothesis configurations (includes H0): N1/N=0,25,50,75,1
- Statistics Used: ZN,T and ˜ZN
- First, both size and power drastically improve with increased sample size T, for all possible configurations. This effect is evidently more pronounced using the asymptotic statistic ZN,T since ˜ZN a priori accounts for the finiteness of T.
- Second, for each lag selection p and cross-section specification N (with the exception of N=1), size improves as N decreases, whereas power improves as N increases. On the other hand, the improvement in power due to increasing N can be drastically more pronounced and varied relative to the decrease in size from the same effect. This effect is much less pronounced for size, and much more pronounced for power when considering the ˜ZN statistic.
- Lastly, the sensitivity of the test to misspecification of the regression lag length p can be severe! In fact, our results show that size distortion is smallest with p=1, regardless of what the true underlying DGP is. While particularly evident in the case of the ZN,T statistic, the effect is somewhat less pronounced for the ˜ZN version of the test. In contrast, the test can be grossly underpowered whenever the regression lag p deviates from the lag structure characterizing the true DGP. In particular, if k is the number of lags in the true DGP, and p is the number of regression lags selected, the test is severely underpowered for all p<k and improves as p approaches k, although if p>k, the effect is not nearly as severe, and virtually unnoticeable.
If you would like to conduct your own simulations, you can find the entire code (mostly commented), here.
Thank you a lot for such a nice post.
ReplyDeleteInquisitive to know if any add-on package is coming on Panel VAR and Global VAR...
Which particular estimators are you interested in (references?)
DeleteThank you for your reply. Here's the reference;
DeletePanel VAR:
Love and Zicchino (2006) Financial development and dynamic investment behavior: Evidence from panel VAR. Quarterly Review of Economics and Finance 46: 190–210.
Global VAR:
Pesaran, Schuermann and Weiner (2004)
Dees, di Mauro Pesaran and Smith (2007, DdPS)
Looking forward to some add-on packages...
Thank you.
Hi,
ReplyDeleteThanks for this nice post. Some questions:
(1) Is it applicable to both unbalanced and balanced data?
(2) If data consists of both I(0) and I(1) variables, can we apply this test?
Thank you
SKD
For those interested in this article and the Dumitrescu-Hurlin test but do not have access to EViews/like to use R: The R package 'plm' has a fully-fledged implementation of the panel Granger (non-)causality test since version 1.6-6.
ReplyDeleteWhy are there no startup values when generating the Monte Carlo simulation DGP (data generating process)? Usually one have at least 200 or more startup values before starting to analyze the statistical size or power. Isn't that necessary (or is that only necessary when generating a unit root process, or generating an AR(1), and not necessary when generating a nrnd series)? Enlighten me please.
ReplyDeletePls is it applicable to variables that are I(0) and I(1)
ReplyDelete