pmc logo imageJournal ListSearchpmc logo image
Logo of hippokratiaJournal URL: redirect3.cgi?&&auth=0zIPOsTpgEYQjwbClIwyD542ofyCO1-ZpBv0IykNC&reftype=publisher&artid=2532959&article-id=2532959&iid=171599&issue-id=171599&jid=620&journal-id=620&FROM=Article|Banner&TO=Publisher|Other|N%2FA&rendering-type=normal&&http://www.hippokratia.gr/
Hippokratia. 2008 Jan–Mar; 12(1): 59–64.
PMCID: PMC2532959
Testing the effect of specific socioeconomic factors on the ischemic mortality rate. The case of Greece
AM Mouza
Technological Educational Institute of Serres, Department of Business Administration, Terma Magnisias, 621 24, Serres, Greece
Mouza A M, Technological Educational Institute of Serres, Department of Business Administration, Terma Magnisias, 621 24, Serres, Greece, tel: +323210 49178, fax number: +323210 49178 Email: mouza/at/teiser.gr
Abstract
In this paper we present a model to evaluate the effect of certain majors socioeconomic factors (such as alcohol and fat consumption, cigarettes smoking, unemployment rate as a proxy for uncertainty which results frustration, number of passenger cars as a proxy for physical exercise and per capita GDP as a proxy for nutrition quality), to the ischemic mortality rate. Since the existing research works on this field, suffer from the proper model testing, we analytically present all the tests necessary to justify the reliability of the result obtained. For this purpose, after specifying and estimating the model, we applied the specification error test, the linearity, multicollinearity and heteroscedasticity tests, the autocorrelation and stability tests and the ARCH effect test.

Finally, we present the aggregate efect of the above socioeconomic factors. In brief, we found that an increase of cigarettes smoked, of fat and alcohol consumption and the number of passenger cars will result to a relevant increase regarding mortality. The latter one is also affected by the changes in unemployment rate. On the other hand, an increase of personal disposable income may negatively affect mortality, by almost the same portion.

Keywords: model estimation, model testing, ischemic mortality rate, socioeconomic factors
 
There is a fundamental need to explain changes in morbidity and mortality as they are related to changes in socioeconomic factors, given that at national level, the socioeconomic status of the population is usually the most powerful health predictor. With this in mind, the main aim of this paper is to present a detail methodology as a guideline for the purpose of specification, estimation and testing a model, which enables us to determine the effect of certain specific social economic factors, on the ischemic mortality rate. In this context, in the model presented below we do not consider any medicine oriented or individual health status effects, or any medical treatment effects, but only certain major socioeconomic factors, to explain ischemic mortality rate.

Given that till now, there is no any detailed quantitative study for the case of Greece on this particular field, we decided to present this model which gives a clear picture regarding the extend that each of the main factors considered, influences the above mortality rate. The methodology of model testing, integrates this attempt to correctly assess the significance of specific etiological factors on the population health status, as far as the ishaemic disease is concerned.

On theoretical grounds, the choice of the particular factors considered here is based upon relevant research works discussing the effect of fat and alcohol consumption, cigarette smoking, quality of nutrition, physical exercise and frustration, on various heart diseases15.

Model specification

The model specification is based on similar works612, where the great majority of risk factors biochemical, behavioral stress or physical and health care utilization, are directly or indirectly related to economic growth factors and socioeconomic status. There is an indirect effect of many factors of this kind on the mortality changes. Which factors are to be considered in model specification, depends on their importance as well as the available data. In our case, this combination produced the following structural form, which is presented after estimation by OLS (Ordinary Least Squares) from annual data obtained from various sources as indicated in table 1.

Table 1.Table 1.
Main characteristics of the variables used

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is hippokratia-12-59-e001.jpg

  • where MORTALITYi = Ischemic mortality rate at year i.
  • ALCOCHOLi = Per capita average annual consumption of alcohol at year i.
  • CARSi-2 = Total number of passenger cars (national level), two years earlier.
  • CIGARSi-3 = Per capita average annual consumption of cigarettes, three years earlier.
  • ΔUNRi = UNRi – UNRi-1 (UNRi = unemployment rate at year i).
  • FATSi-3 = Per capita fat and land animal consumption, three years earlier.
  • PerCaGDPi-2 = Per capita Gross Domestic Product (in real terms), with two years time lag.

It is recalled that in (1) the numbers in brackets are standard errors of the corresponding estimated coefficients, equation M2 the adjusted coefficient of determination, d is the Durbin-Watson statistic, PC is the Amemiya criterion and BIC is the Bayesian Information Criterion, known also as Schwarz Bayesian Criterion. Note also that p values refer to the lowest level of significance (α), for the corresponding coefficient.

equation M3
is the error sum of squares, where ûi are the residuals from (1). The OLS estimate of the disturbance's variance, is denoted by s2.

Although the d statistic reveals the possible existence of a slight negative first order autocorrelation, which will be discussed later, we may conclude from (1), since all coefficients seem to be significant at an acceptable α, that at national (aggregate) level, the influence of the major socio-economic factors considered here on mortality rate, can be summarized in what follows.

  • We observe a positive effect of the quantity of alcohol consumed during the current year.
  • There is a cumulative effect of the cigarettes smoked and the fat consumed during the last three year.
  • Also there is a cumulative effect of the increase in the number of passenger cars, during the last two years, since the more intensive use of private cars, restrains physical exercise and increases stress.
  • The current change in unemployment rate seems to have a positive effect too, through increasing uncertainty in many individuals, who in turn feel more frustrated.
  • The increase of per capita GDP in the last two years seems to have a negative effect, mainly due to the improvements in nutrition quality, the better exercise and health care too.

More details regarding the model variables and further calculations to facilitate the computation of individual elasticities, are presented in table 1.

Model testing

The majority of the empirical works on this field suffer from the lack of further tests1316, which establish the grounds to conclude that the estimates we obtained have the desired statistical properties and the specified model is not going to lead us towards misleading inference. With this in mind, we suggest the following tests.

Test for specification error

It has been pointed out, that the number of explanatory variables selected, heavily depends on available data. This implies that we may exclude some other variables, which play an important role in explaining the variation of the dependent variable. In this case we have a specification error, resulting to biased end inconsistent estimates. We have tested this kind of specification error, with the following two ways.

1. By computing the recursive residuals, denoted by ûj0(j = m+1,.....T), with mean ū0.It is recalled, that m denotes the number of coefficients in (1), and T the number of observations actually used in the estimation process. From these residuals we calculate the statistic ψ from

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-60-e001.jpg
which follows the Student's t distribution, with T-(m+1) degrees of freedom, under the null hypothesis of the correct model specification. It is clear that a large value of |ψ|, indicates the existence of specification error17,18. In our case we found that ū0 = -08199 wiht standard error = 0.684 and ψ = -0.1186, indicating that no specification error exists.

2. By applying the RESET (Regression Specification Error Test) of Ramsey19. For simplicity, denote the estimated values of the dependent variable ŷi. Then, consider ŷi as the dependent variable in new (auxiliary) regressions, with explanatory variables ŷi2, or (ŷi2i3), or (ŷi2i3i4) and all the independent variables in (1) plus a constant term. From the ESS* of these regressions we consider the largest F* which is computed from

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-60-e002.jpg
and follows the F distribution with [k, T-(m+k-1)] degrees of freedom, where m denotes the number of coefficients of the regression with the largest F*. In our case the largest F* corresponds to the auxiliary regression with ŷi2i3 and ŷi4 as explanatory variables. Given that F* is much smaller than the tables F, for α=0.05 and 0.01, we reach to the same conclusion, that no specification error exists.

Linearity test

It is recalled that linearity is one of the basic assumptions, in order to apply OLS for model estimation. This test presupposes the estimation of the regression

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-61-e001.jpg
to obtain R2 = 0.0165753 and LM = TXR2, where LM is the Lagrange multiplier, which follows the X2 distribution with k-1 (k=3) degrees of freedom. Given that LM is much less than the tables X2 for any standard significance level, we may conclude, that no problem regarding linearity exists.

Multicollinearity test

Given that some of the explanatory variables have a very small total variation (0.12E-2 and 0.86F-2) the use of condition number (which is the largest condition index) will produce misleading results20. In such cases it is preferable to employ the Variance Inflation Factor (VIF), for testing multicollinearity. VIF has to be calculated for each explanatory variable i (i = 1, 2,...., m-1) taken as dependent and the remaining as independent, in order to get the corresponding Ri2. For variable i, the VIFi is computed from

equation M7

The results obtained are presented in table 2.

Table 2.Table 2.
Estimated Variance Inflation Factors (VIFi) and the corresponding R2.

Since none of the VIF values exceeds 10, we conclude that there is no any severe collinearity problem, regarding the columns of data matrix X of the explanatory variables, which implies that X has full column rank.

Heteroscedasticity test

To apply this test, we computed the Spearmans's rank correlation coefficient rs, considering each explanatory variable and the (absolute values of) residuals. The obtained coefficients, together with the computed t* values (given that the standard error is 0.2357), are presented in table 3.

Table 3.Table 3.
The computed rank correlation coefficients rs and the corresponding t* values

Since the last t* statistic is greater than the tables t, for (T-m) degrees of freedom at α=0.05, we see that there is a problem of heteroscedastisity of the form:

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-61-e003.jpg

In such a case we have to transform the initial model, dividing throughout by PerCaGDPi and to re-estimate it. The results are presented bellow.

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-61-e004.jpg

where

  • YSTARi =MORTALITYi/PerCaGDPi
  • ONEovGDPi = 1/ PerCaGDPi
  • ALCovGDPi = ALCOHOLi/PerCaGDPi
  • CARovGDPi = CARSi-2/PerCaGDPi
  • CIGARovGDPi = CIGARSi-3/ PerCaGDPi
  • ΔUNRovGDPi = ΔUNRi/ PerCaGDPi
  • FATovGDPi = FATSi-3/ PerCaGDPi

It should be noted that we avoided applying the general test of White21, to test for heteroscedasticity, since there are too many explanatory variables in the model.

Before going back to the original form of the model seeing in (1), by multiplying (3) by PerCaGDPi, it is advisable to perform one more test, regarding autocorrelation.

Autocorrelation and stability test

Given the value of d statistic in (3), one may suspect that there is a problem of first order (negative) autocorrelation. Applying ordinary Durbin-Watson test, we found dL<d<dU, so that the test was inconclusive. However, according to other tests (Brausch-Godfrey), it seems that the problem of autocorrelation is present. At this point it is useful to recall that we may have some indications for autocorrelation, although the real problem is an ARCH (AutoRegressive Conditional Heteroscedasticity) effect. To be sure that we have to face the problem as being a pure autocorrelation problem, we should trace any ARCH effect, by performing the proper test. With this in mind, and considering that the order h (maximum lag) is 3, we get the following estimates.

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-62-e001.jpg

We further compute LM = (T-h)xR2, which is much smaller than the tables X2, for h =3 degrees of freedom, so that we may conclude that the error terms do not follow an ARCH scheme, and thus we are facing a pure autocorrelation problem. The application of Hildreth-Lu iterative procedure to get the minimum |d-2|, produces the following results.

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-62-e002.jpg

To test the coefficients stability, we consider the recursive residuals and compute the CUSUM (cumulative sums) and CUSUMSQ (cumulative sums of squared residuals) statistics, presented in 1 and 2. From these graphs one may conclude that no problem of coefficients stability exists.

To transform model (4) to its original form, which is model (1), we multiply (4) throughout by PerCaGDPi to obtain

A mathematical equation, expression, or formula that is to be displayed as a block (callout) within the narrative flow. The name of referred object is hippokratia-12-62-e003.jpg

With the new estimates we get 3, which refers to the observed and estimated values of the dependent variable.

Discussion

In this paper a relevant model has been specified, estimated and properly tested, in order to investigate the quantitative effect of specific socioeconomic factors on the ischemic mortality rate. We presented step by step all the tests required in order to obtain correct and unbiased results which reflect the real world in the best possible way. We also discussed the effect of each factor on the dependent variable in a rather general manner. Further we may obtain some more detailed results, if we combine the ratios of means of the explanatory variables over the mean value of the dependent variable presented in table 1, together with the estimated coefficients in (5), in order to get estimates of the elasticities. Thus we may determine the extend that each explanatory variable affects mortality, which is the dependent variable in our model. These elasticities are presented in table 4, bellow.

Table 4.Table 4.
Estimated elasticities

It is noted, that all positive elasticities are less than one and only the (absolute) value of the last one is slightly greater than unity. According to the last column of table 4, we can say that a 10% increase of cigarettes smoked (i.e. 100 cigarettes), will result to a 1.77% increase in mortality. This harmful effect of smoking will be realized with a delay of three years. Also an increase of fat consumption by 10%, will gradually increase mortality by 1%, in the following three years An increase in alcohol consumption by 10% (i.e. 100 kgr) will increase mortality rate by 1.2% without any delay, i.e. in the same time period. A similar interpretation holds for the other socioeconomic factors considered, which constitute the set of the explanatory variables. In other words, a 1% increase of the number of passenger cars, will result to a gradual increase in mortality by 0.48%, which is to be realized two years later. Changes in unemployment rate by 1%, will affect mortality by 0.1%, at the same time period, through increasing frustration. On the other hand, an increase of personal disposable income by 1%, may contribute towards reducing mortality, by almost the same portion. This effect is due to the expected improvements in nutrition quality, the better exercise and health care too.

Finally it is worthy to mention that we haven't met in the literature an equivalent quantitative analysis on this particular field.

Endnotes

  • From the mathematical point of view, a model is a mathematical expression to describe a real phenomenon. In this context, the flight of an air plain can be simulated by a single differential equation. On the other hand, to model a national economy one may need more than a hundred of difference equations. A model is used for simulation purposes, and mainly to investigate the effect of some predetermined factors on the characteristic we are interested in.
  • The right specification is the first step in model building. This implies that we must know in advance the factors (usually called explanatory variables), which actually affect the evolution of the characteristic under consideration (usually called dependent variable).
  • A model should comply with some specific tests, in order to be efficient and reliable. All these tests are described in details and properly applied in this paper, given that there is a lack of such an analysis in the relevant literature.

Footnotes
For terminology explanation see endnotes in page 63
References
1.
Hansagi, H; Romelsjö, A; Garhardsson de Verdier, M, et al. Alcohol consumption and stroke mortality, 20-year follow-up of 15,077 men and women. Stroke. 1995;26:1768–1773. [PubMed]
2.
Shaper, AG; Wannamethee, SG. Alcohol intake and mortality in middle aged men with diagnosed coronary heart disease. Heart. 2000;83:394–399. [PubMed]
3.
Mukamal, KJ; Ascherio, A; Mittleman, MA, et al. Alcohol and risk for ischemic stroke in men: the role of drinking patterns and usual beverage. Ann Intern Med. 2005;142:11–19. [PubMed]
4.
Wannamethee, SG; Shaper, AG. Patterns of alcohol intake and risk of stroke in middle-aged British men. Stroke. 1997;27:1033–1039. [PubMed]
5.
Johansson, SE; Sundquist, K; Ovist, J; Sundquist, J. Smokeless tobacco and coronary heart disease: a 12-year follow-up study. Eur J Cardiovasc Prev Rehabil. 2005;12:387–392. [PubMed]
6.
Brenner, MH. The economic change model of pathology. Health projections in Europe. Methods and applications. World Health Organization . 1986
7.
Brenner, MH. Economic, alcohol consumption and heart disease mortality in nine industrialized countries. Soc Sci Med. 1987;25:119–132. [PubMed]
8.
Kauhanen, J; Kaplan, G; Colberg, D; Salonen, J. Beer Bingeing and Mortality: Results from Kuopio Ischaemic Heart Disease Risk Factor Study, a Prospective Population Based Study. BMJ. 1997;315:846–851. [PubMed]
9.
Lozano, R; Murray, CJ; Lopez, AD; Satoh, T. Miscoding and misclassification of ischaemic heart disease mortality. World Health Organization. 2001 Sep;
10.
Murray, C; Lopez, A. The Global Burden of Disease: A Comprehensive Assessment of Mortality and Disability from Diseases, Injuries and Risk Factors in 1990 and Projected to 2020. Vol: 1. Harvard University Press; 1996.
11.
Peto, R; Lopez, A; Boreham, J; Thun, M; Heath, C. Mortality from Tobacco in Developed Countries: Indirect Estimation from National Vital Statistics. Lancet. 1992;339:1268–1278. [PubMed]
12.
Zatonski, W. Alcohol and Health: What is good for the French may not be for the Russians. J Epidemiol Community Health. 1998;52:766–767. [PubMed]
13.
Critchley, JA; Unal, B. Is smokeless tobacco a risk factor for coronary heart desease? A systematic review if epidemiological studies. Eur J Cardiovasc Prev Rehabil. 2004;11:101–112. [PubMed]
14.
Asplund, K; Nasic, S; Janlert, U; Stegmayr, B. Smokeless tobacco as a possible risk factor for stroke in men: a nested case-control study. Stroke. 2003;34:1754–1759. [PubMed]
15.
Huhtasaari, F; Lundberg, V; Eliassin, M, et al. Smokeless tobacco as a possible risk factor for myocardial infraction: a population-based study in middle-aged men. J Am Coll Cardiol. 1999;34:1784–1790. [PubMed]
16.
Milosević, M; Mustajbegović, J; Abdivić, S, et al. Relationship of regional alcohol drinking habit and cardiovascular risk factors in Croatia. Acta Med Croatica. 2007;61:275–279. [PubMed]
17.
Harvey, AC. The Econometric Analysis of Time Series. Oxford: Philip Allan Publishers; 1981.
18.
Harvey, AC. The Econometric Analysis of Time Series. 2nd edition. Cambridge: Massachusetts: The MIT Press; 1990.
19.
Ramsey, JB. Testing for Specification Errors in Classical Linear Least Squares Regression Analysis. J R Stat Soc. 1969;31:350–371.
20.
Lazaridis, A. A Note Regarding the Condition Number: The Case of Spurious and Latent Multicollinearity. Qual Quant. 2007;41:123–135.
21.
White, H. A Heteroscedasticity Consistent Covariance Matrix Estimator and a Direct Test of Heteroscedasticity. Econometrica. 1980;48:817–838.