Integration and Cointegration of Variables (ICV)

Abstract

In this report, we present an overview of notions of stationarity (integration) and cointegration of variables. Although they occupy an important place in statistics and econometrics, their concepts have, for many researchers, been a little vague. I allow myself to further substantiate this by proposing only the essentials from the theoretical and empirical point of view on the integration and cointegration of variables.

Keys Words: ARDL, Box-Jenkins, Engle & Granger (1987), Johansen (1988)

Introduction

A temporal variable, a process of short or long memory requires, before being analyzed, a study making it possible to raise the great characteristics of a quantity from the statistical point of view. This is among others, its tendency; seasonality; its stationarity, its law of probability or its density, etc. One of the most interesting and in most cases very interesting is the question of the stationarity of the chronicle. Several processes are at our disposal for its implementation. In addition, the interest of cointegrated variables in the case of a linear combination has become viral in econometric studies. For fear of presenting an illusory model with no-standard coefficients, the literature hurries to propose a lot of tools to come around this major obstacle in econometrics. In the following sections, we present the stationarity of the variable in a more concise manner and discuss the notion of cointegration later.

Abdi-Basid ADAN * Integration and Cointegration of Variables (ICV) * abdi-basid@outlook.com P a g e 4 | 8 I.

Integration of Variables (IV) One of the fundamental concepts in statistics and econometrics is actually that related to the integration of variables. What is really the integration of variables? It turns out that in statistics, the idea of no-stationarity is rather commonly used compared to that of integration of variables. However, the meaning of the latter is versatile from one discipline to another. In statistics, a temporal process fluctuating independently of time is said to be stationary. For illustration (chronogram and correlogram), we can witness a concentration of the chronic around its mean value (mathematical expectation) or a rapid decrease of the autocorrelation function. In terms of probability, the joint distribution (law or density) is identical for the first k and k + 1 variables. In this sense, the process is stationary in the strict sense (also called first-order or strong). To say more, whatever the moment t considered, the variation of the chronic in t and t + 1 is not influenced by the temporal reference. From this point of view, its properties, also known as "stationarity conditions", namely mean, variance and covariance, are all convergent and independent of time. In view of the complexity of estimating a law of probabilistic nature for a given distribution, it has been proposed to remedy this, a second sense of stationarity, called second-order stationarity (weak sense or covariance). Only the conditions of expectancy and the variance of the variable are required and indispensable to judge and affirm stationarity against a variable. Knowing the third condition also includes the second when k is zero (cov (xt, xt-k)). This ensures a pioneering role especially in the forecast of a time series. In principle, the confidence interval of the estimated values depends on its validity. It is mentioned in the literature the existence of a multitude of forms of non-stationarity of a temporal sequence among which the stationarity with tendency (TS: Trend Stationnarity, Nelson et Plosser (1982)), which graphically, one witnesses a growth evolution of the series during of time: as a result, stationarity conditions are not met. To circumvent this adversity and later obtain the series devoid of trend component, we have to choose one of three techniques including straight ordinary least squares, simple moving average or by the filter Hodrick-Prescott. Here we obtain a white noise Hamilton (1994), by definition

Abdi-Basid ADAN * Integration and Cointegration of Variables (ICV) * abdi-basid@outlook.com P a g e 5 | 8

stationary but of second order. In the statistical vocabulary, one also evokes the independent white noise, because of the independence between the variables, which implies the nullity of the value of the covariance or simply the decorrelation between the variables (the reciprocal being already false). Sometimes of a Gaussian nature, it is both independent and follows a normal law. Under these conditions, the white noise is strictly stationary (or stationary in the strict sense). In contrast, a stochastic process is not necessarily stationary. The second form of stationarity, for its part, is of stationarity by difference (DS: Difference Stationnnarity, Nelson et Plosser (1982)). It is due, in particular, to the influence of the series by its own historical values. In evidence, the differentiated in a good order differ from zero allows to stationise the series. In practice, two families of stationarity test are announced. It is, on the one hand, the test having the null hypothesis of stationarity (KPSS Test) and, on the other hand, the tests having the null hypothesis of non-stationarity (Dickey-Fuller or Phillips-Perron test). Even more, a variable can be stationary with constant and trend; neither one nor the other or with constant only, according to the significativities of these parameters. The nuance which it seems appropriate to specify is that of telling oneself how a series can be stationary with a tendency, whereas it is the tendancy component that must be deprived of it. Quite logical, in fact, it would mean the same thing and not seen as two different or contradictory propositions. In a word, variable integration consists of depriving the time series of any trend or historical disturbance in order to identify the stable series for use in various analyzes. Finally, it is with the support of this primordial and imperious step that we can afford to begin the estimation of the following models precisely models of Box-Jenkins (ARIMA, SARIMA, VAR, ARMAX, ... etc) or Engle (ARCH, GARCH, ... etc).

Abdi-Basid ADAN * Integration and Cointegration of Variables (ICV) * abdi-basid@outlook.com P a g e 6 | 8

II. Cointégration des Variables (CV) Cointegration is indispensable in two-dimensional analysis both in statistics and econometrics. We are aware of the imminent error that can lead us to the simultaneous consideration of two or more variables without first having used a study by variable. In a way, "better first evaluate each variable before the considered helpful". Introduced in economics by Engle and Newbold (1974), then developed by Granger and Engle (1987) and continued in 1991 by Johansen, the notion of cointegration is indeed, the integration of the linear combination between two variables taken together to which tests in the literature allows us to assert in case of presence of cointegrating vector. This is the Engle and Granger tests (1987); Johansen (1988); Johansen and Juselius (1990), ... etc. It is so common to end up with the problem of fallacious regression in the modeling of the variables, because of a linear regression on non-stationary variables, which in principle testifies to an explanatory mechanism R² and a t of Student very appreciable when in reality, there is no relationship between them. In this perspective, the distribution of the estimated parameters no longer follows a Student's law but a Wiener process (Brownian motion), which occurs when the variance of at least one of the variables diverges, due to its dependence on the temporal dimension and well identifiable with recursion procedure. Indeed, good prospects of the residual analysis accompany for the final validity of the model. When the residue is not stationary, it is assimilated to the case of presence of autocorrelation between the model residues. The order of integration of the residual model is not necessarily below that of the variables of the model. In obviousness, a residual component of the stationary model with an order different from 0 is considered as a model where the autocorrelation of the residues is imminent. In other words, autocorrelation and stationarity are indirectly involved through the last condition of covariance (in the weak sense of stationarity). The goal of cointegration is to be able to determine a stationary residue while working on two nonstationary variables in level. The idea proposed by Engle is in the sense that in the short term the variables diverge, but there exists in the long term a stability, a balance between them. A common evolution of variables. Now knowing the possibility of no-zero order cointegration, in the long

Abdi-Basid ADAN * Integration and Cointegration of Variables (ICV) * abdi-basid@outlook.com P a g e 7 | 8

term the desired perspective exists according to well-defined conditions upstream and downstream. Among them, the same order of integration of the variables, verifying the possibility of cointegrating the variables, in other words an order less than or equal to the common order of integration of the variables. In addition, downstream, a negative sign and significant equilibrium restoring force coefficient (or speed of adjustment) is required while checking the stationarity of residues obtained. Gran-ger's (1983) theorem highlights the cointegration relationship and the error-correction model. Short-term relationship estimation with OLS is only possible when the variables are differentiated. In other words, by integrating in the model, the delayed variables as explanatory. On the other hand the long term relation are es-timées in level by OLS. No method is perfect, in this sense the disadvantage of the Engle-Granger approach (1987) is the non-distinction of cointegration relation. In principle, it has only one cointegration relation, whereas it is of number k-1 relationship with k the number of variables. Johansen (1988) provides a solution to this problem with his multivariate maximum likelihood approach. For the validity of the VECM model, a cointegration rank of less than the number of variables and non-zero is required, which is evidenced by the maximization of the likelihood log. Otherwise, a VAR model (p) is estimated in place of a VECM. On the other hand, as in the self-gressive Vector model, the specification of the model according to the absence or presence of a constant and trend is necessary. We can determine them with their significativities once estimated. The one-step method of BANERJEE et al. or MCE at Handry makes it easier to interpret the long-term relationship. In addition, to estimate long-term relationships in the case of small samples, the two-step model could lead to estimation bias according to Banerjee, Dolado, Hendry and Smith (1986). The ARDL model "AutoRegressive Distributed Lag / ARDL" and the cointegration test at the terminals of Pesaran et al. (2001) take a new approach to overcome the case of cointegration of variables at different orders of sta-tionality. On the other hand, when the variable integration order is greater than 1, the application of the ARDL model poses a problem.

Abdi-Basid ADAN * Integration and Cointegration of Variables (ICV) * abdi-basid@outlook.com P a g e 8 | 8

Conclusion

Stationarity is more than ever a prerequisite for studying variables before introducing them into larger studies. It involves denying the temporal process of trend and historical disturbances with appropriate procedures to conduct statistical and econometric analyzes. In the same way, cointegration proves even more complex, because one wants to be interested in working on level variables, while avoiding that the regression is misleading. Several approaches are pressing, one more efficient than the other. In both cases, these notions can not be taken with negligence, because a scientific production based on the temporal process inevitably depends on it.

Bibliography : 1. Atoumane Diagne (2015) : Econometric modeling of electricity consumption in Senegal from 1999 to 2015 2. Hélène Hamisultane: Error-correction model and applications 3. Jonas Kibala Kuma (2018) : ARDL Modeling, Cointegration Testing and Toda-Yamamoto Approach: Elements of Software Theory and Practice 4. Lonzo Lubu Gastonfils (2015) : Application of the Box-Jenkins Prediction Method

Translate