A New Test for ARCH Effects and Its Finite-Sample Performance Author(s): Yongmiao Hong and Ramsey D. Shehadeh Source: Journal of Business & Economic Statistics, Vol. 17, No. 1 (Jan., 1999), pp. 91-108 Published by: American Statistical Association Stable URL: http://www.jstor.org/stable/1392241 . Accessed: 22/11/2013 14:15 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. . American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal of Business &Economic Statistics. http://www.jstor.org This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions A New Test for ARCH Effects and Its Performance Finite-Sample Yongmiao HONGand Ramsey D. SHEHADEH Departmentof Economics,CornellUniversity,Ithaca,NY 14853-7601 ([email protected]) ([email protected]) We proposea test for autoregressiveconditionalheteroscedasticity based on a weightedsum of the squaredsampleautocorrelations of squaredresidualsfrom a regression,typicallywith greater weight given to lower-orderlags. The tests of Engle, Box and Pierce, and Ljungand Box are equivalentto the test with equalweighting.Ourtest does not requireformulationof an alternative and permitschoice of the lag numbervia data-drivenmethods.Simulationstudiesshow that the new test performsreasonablywell in finitesamplesespeciallywith greaterweighton lower-order lags. We applythe test in two empiricalexamples. KEY WORDS: Cross-validation; Efficiency;Frequencydomain;Monte Carlo;Spectraldensity; Weighting. Since Engle (1982) introducedthe autoregressivecon- also asymptoticallydistributedas chi-squaredrandomvariditionalheteroscedasticity(ARCH)model, there has been ables with q df. Grangerand Terdisvirta (1993, pp. 93-94) considerableinterest in estimationof and testing for dy- showed that they are asymptoticallyequivalentto Engle's namic conditionalheteroscedasticityof the regressiondis- (1982) LM test. turbance.The ARCHmodel andits variousgeneralizations Detection of ARCH effects has attracted significant [e.g., Bollerslev's (1986) generalized ARCH (GARCH)] recent attention from researchers.Contributionsinclude have provedquite useful in modelingthe disturbancebe- those of Bera and Higgins (1992), Brock, Dechert, and haviorof regressionmodelsof economicandfinancialtime Scheinkman(BDS, 1987),Gregory(1989),Lee (1991),Lee series. See Bera and Higgins (1993), Bollerslev,Chou,and and King (1993), and Robinson(1991a).In this article,we Kroner(1992),andBollerslev,Engle,andNelson (1994)for proposea new test for the presenceof ARCHin the residuals from a possibly nonlinearregressionmodel. The test recent surveysof this literature. From the perspectiveof econometricinference,neglect- is basedon an extensionof Hong's (1996) spectraldensity ing ARCH effects may lead to arbitrarilylarge loss in approachand results for testing serial correlationof unasymptoticefficiency(Engle 1982) and cause overrejection known form in conditionalmean.Specifically,the null hyof standardtests for serial correlationin conditionalmean pothesisof no ARCHimplies that the normalizedspectral (Taylor1984; Milhoj 1985; Diebold 1987; Domowitz and densityof the squaredresidualsfrom a regressionmodelis Hakkio 1987). In the contextof autoregressivemovingav- the uniformdensity on the frequencyinterval[-7r,7r].Alerage(ARMA)modeling,Weiss (1984) pointedout thatig- ternatively,if ARCHexists, the normalizedspectraldensity noringthe ARCHeffect will resultin overparameterizationof the squaredresidualsis nonuniformin general.Therefore, we can construct a test for ARCH by comparing of an ARMA model. a kernel-basedspectraldensity estimatorof the squared In practice,the most populartest for ARCH is Engle's residuals to the uniformdensity via an appropriatediver(1982) Lagrangemultiplier(LM)test for ARCH(q)undera measure. When a quadraticnorm is used, the new gence two-sidedalternativeformulation.When the null hypothetest turns out to be a properlystandardizedversionof the sis of no ARCHis true,this statisticis asymptoticallydissum of squaresof all n - 1 sample autocorrelatributedas a chi-squaredrandomvariablewith q degrees weighted tions of the squaredresiduals,with weighting depending of freedom.It is simple to calculateand is asymptotically on the kernel function.Most commonlyused kernelstyplocally most powerfulif the truealternativeis ARCH(q),a characteristicit shareswith the likelihoodratio and Wald ically give greaterweight to lower-orderlags. The exceptests (Engle 1982) undera two-sided alternativeformula- tion is the truncatedkernel, which gives equal weight to tion. [In the ARCHcontext,it is also possible to construct each lag. Interestingly,when using the truncatedkernel (i.e., unia one-sidedtest. Withthe one-sidedalternative,the LM test form weighting),our approachdeliversa generalizedBox has a standardchi-squareddistribution.On the otherhand, and Pierce (1970)type of test, whichalso is asymptotically the likelihood ratio and Wald tests may have an asympto totic mixture of chi-squareddistributions.See Bera, Ra, equivalent a generalizedversion of Engle's (1982) LM and Sarkar(1998).] An alternativeapproachis to subject test for ARCH. (See Theorem3 following.)Because ecothe squaredresidualsto such standardtests for serialcorre- nomic agentsnormallydiscountpast information,the older lationas the portmanteau tests of Box andPierce(1970)and and Box which are based on the sum of the (1978), Ljung ? 1999 American Statistical Association firstq-squaredsampleautocorrelations of the squaredresidJournal of Business & Economic Statistics uals. These tests, as shown by McLeodand Li (1983), are January 1999, Vol. 17, No. 1 91 This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 92 Journalof Business & EconomicStatistics,January1999 the information,the less effect it has on currentvolatility. Therefore,when a relativelylarge q is used, it seems that uniformweightingtests are likely not fully efficient;a better test should put greaterweight on recent information. Indeed,it can be shownthat,when a large q is used, many nonuniformkernels deliver more powerfultests than the truncatedkernel.Therefore,our approachmight be more powerfulthanthe Box-Pierce-LjungandLM tests in many practicalsituations.In fact, a linearlydecliningweighting schemewith a relativelylong lag oftenhasbeenusedin testing and estimatingARCHmodels (e.g., Engle 1982, 1983; EngleandKraft1983;Engle,Lilien,andRobins1987;Bera and Higgins 1993).Amongotherthings,such a schemeincreasespower,althoughit has been criticizedby Bollerslev (1986)andothersas an adhoc approach.In comparison,our kernel-basedweighting scheme arises endogenouslyfrom the frequency-domainapproach.For persistentARCH efinferencesuggeststhatit is better fects, frequency-domain to employ a long lag. Therefore,it is desirableto allow growthof the numberof lags with the samplesize for such alternatives. In addition to its relative efficiency gain, our test is also relativelyconvenientto implement.It is based on a weighted sum of squaredautocorrelationsof the squared residualsand has a null, asymptoticallyone-sidednormal distribution.Furthermore,we do not requireformulation of the alternativemodel. Ourasymptotictheoryallows the lag numberto be chosen via such data-drivenmethodsas cross-validation.Such methodsmay reveal some information about the true alternative.With a data-drivenq, one can obtain a nonparametricspectraldensity estimate for the squaredresiduals.This estimate usually enjoys some optimalitypropertiesand containsinformationon autocorrelationsin squaredresiduals-that is, volatilityclustering. This maybe appealingwhenno priorinformationaboutthe true alternativeis available,as is common in practice.In applicationsof the existing ARCHtests, one usuallymust calculatestatisticsfor several,possiblymany,valuesof q. It is commonthatsome statisticsare significantand some insignificant.Consequently,drawingconclusionsfrom sucha sequencecan be a delicatebusinessbecausethese statistics are not independent. In Section 1, we introducea class of new ARCHtests. In Section2, we discusschoice of the lag numberfor our test via data-dependentmethods,particularlycross-validation. In Section3, we discussthe relationshipsbetweenour tests andsome importantexistingtests. In Section4, we conduct a simulationstudy to comparefinite-sampleperformances of the new tests andthose of Engle (1982), Box andPierce (1970), Ljung and Box (1978), Lee and King (1993), and BDS. We also presenttwo empiricalexamplesin Section 5. In Section6, we concludethe article.All proofs are collected in the Appendix. 1. A NEW TEST FOR ARCH Considerthe data-generating process (DGP) Yt = g(Xt, bo) + Et, t = 1,..., n, (1) with ARCHerror Et = tth/2, (2) where Yt is the dependentvariable;Xt is a d x 1 vector containingexogenous variablesand lagged dependent variables;bo is an 1 x 1 unknown true parametervector in IRl;g(Xt,b) is a given, possibly nonlinear,function such that, for each b,g(, b) is measurablewith respect to Il-1, the informationset available at period t - 1; and g(Xt, -) is twice differentiablewith respect to b in an open neighborhoodN(bo) of bo almost surely, with limn• nr1 t=1 ESUPbEN(bo) flVbg(Xt, b)l4 < 0c and limn-+ n-' •-•n ESUPbEN(bo) V2g(Xt, b)l2 < where1I-IIis the Euclideannormin I1. [Forlinearmodelsi.e., g(Xt, b) = XIb--these dominanceconditionsreduceto limn-+on- 1jE EjjXt114< 00.] The functionht is a positive,time-varying,and measurable functionwith respectto tl-. We assumethat?t is iid with E(?t) = 0, E(t2) = 1, and E(?t8)< 00. In particular, we do not requirethat•t be N(O,1), which may be too restrictivefor manyhigh-frequencyfinancialdata.In addition, we assumethat?t andX, aremutuallyindependentfor t > s. By definition,Etis seriallyuncorrelatedwith E(Et) = 0. = ht, maychange But, its conditionalvariance,E(et Ctl) overtime.Forexample,if ht follows an ARCH(qo) process, we have h = to -+ i=1 a2-i, whereao > 0 and ai 0 to ensurepositivityof ht. If ht follows a GARCH(po,qo) E? Po2 qo process, h = ao ++ process, ht -i + jht-j, where +j=l i=1 o a0o > 0. The nullhypothesisof no ARCHeffectis Ho:ht = a2 almost surelyfor some 0 < a < 00 andall t = 1, 2, ..This is equivalentto conditionalhomoscedasticity.Define ut = E 0o - E(E2). Then,underHo,ut = 1t2 t Et/g2 ao0_ 1,, where whrt g2 a white-noiseprocess.The white-noisehypothesisimplies a uniformnormalizedspectraldensityor distributionfunction.WhenARCHexists,the spectraldensityor distribution function will not be uniformin general.Therefore,a test for Ho can be basedon the shapeof the spectraldensityor distributionfunction.Researchershaveemployedthe shape of the spectraldistributionfunctionto test varioushypotheses on serial dependence(e.g., white noise). These include Anderson(1993), Bartlett(1955), Durbin(1967), Durlauf (1991), and Grenanderand Rosenblatt(1953, 1957). See Priestley(1981) and Anderson(1993) for surveys.Extending these works to the ARCHcontext shouldbe possible, thoughwe have not seen such resultsin the literature. Relatively few tests are based on the spectraldensity function.Althoughthe periodogramis not consistentfor the spectraldensity,one can use Parzen's(1957) smoothedkernelspectraldensityestimatorto test the white-noisehypothesis, as did Hong (1996). This approachinvolves the choiceof a smoothingparameter,a sometimesdelicatebusiness. But, maximalpoweris often attainableif this parameter is chosenvia an appropriate data-drivenmethod.In this article,we extendHong's (1996) spectraldensityapproach to the ARCHcontext.Ourasymptotictheoryhere permits a data-dependent smoothingparameter. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Hong and Shehadeh:A New Test forARCHEffects 93 Let f(w) be the normalizedspectraldensityfunctionof where the secondequalityfollows by Parseval'sidentity, ut. Consequently, f(w) = fo(w) - 1/21r for all frequencies w e [-7r, wr]underHo. In contrast,f(w) fo(w) in general when ARCH exists. It follows that a test for Ho can be based on the L2 norm L2(f; fo) n-1 Cn(k) = E (1 - j/n)k2(j/q) j=1 and n-2 = 2 7r(f(w) - fo(w))2 dw(3) Dn(k) = > (1 - j/n)(1 - (j + 1)/n)k4(j/q). j=1 where f is a consistent spectraldensity estimatorfor f. As will be seen, tests based on (3) can be rathersimpleto compute;no integrationis required. We now constructan estimatorfor f. Let b be an n1/2consistentestimatorfor bo.For example,b can be the nonlinearleast squaresestimator b = argmin n' b (Y - g(Xt, b))2. t=1 We note that Cn(k) and Dn(k) are approximatelythe mean andvarianceof n En-. k2(j/q)132(j).The factors(1-j/n) and (1 - j/n)(1 - (j + 1)/n) canbe viewed as finite-sample correctionsandare asymptoticallynegligible.Althoughwe motivateour test using (3), Q(q) involvesneithernumerical mointegrationnor estimationof f. This frequency-domain tivationleads to an appropriatechoice of q via data-driven methods,as will be seen. Our first result shows that Q(q) is asymptoticallystandardnormalunderHo. Such an estimatordoes not take into accountthe possible Theorem 1. Let q --+ o, q/n -+ 0. Then under the stated ARCHeffects but consistentlyestimatesbo underboth the conditionsand H0, Q(q) -+ N(O,1) in distribution. null and alternative.The estimatedresidualof (1) is Et = The test is one-sidedbecausein generalQ(q) divergesto = Yt - g(Xt, b). Put Ut = t2 1, where t = ?t/8n and r t=> 2. Then the sampleautocorrelationfunctionof positive infinitywhen ARCH exists. Consequently,appron-1 {ut} is (j) = i(j)/r(O),j = O,+1,..., +(n - 1), where priateupper-tailedcriticalvalues of N(O, 1) must be used. For example,the criticalvalue at the 5% significancelevel i(j) = n-1 Z=iil+ utut-IjI. A kernelestimatorfor f is is 1.645. given by For largeq, q-Dn (k) -+ D(k) = fo7 k4(z) dz. Thus,one n-1 can replaceDn(k) by qD(k) withoutaffectingthe asympf(w) = (2r)-1 totic distributionof Q(q). Under some additionalcondik(j/q)1(j)cos(jw) j=1-n tions on k and/or q, q-1Cn(k) = C(k) + o(q-1/2), where fo k2(z) dz; we also can replaceCn(k) by qC(k). for w E [-ir, wr].Here, the bandwidthq - q(n) -+ oc and C(k) _a more Thus, compact,asymptoticallyequivalenttest statisq/n -+ 0. The kernel function k: R -- [-1, 1] is symmetric, tic is continuousat 0, and all but a finite numberof points,with "n-1 = k(0) = 1 and f_0Vk2(z) dz < 00. This implies that, for j (2qD(k))1/2 n qC(k) smallrelativeto n, the weightgivento p(j) is close to unity, Q*(q) k2(j/q)2(j) j=1 the maximumweight.The squareintegrabilityof k implies 00. as that k(z) 0 Thus, eventually, less weight is jzj -+ Because Q(q) is basedon the whole-sampleautocorrelagiven to p(j) as j increases. tion function 3(j) of the squared residual fit, it can detect Examples of k include the Bartlett, Daniell, general the whole class of lineardependenciesof ht-that is, auto(QS), and truncatedker- correlationin the squaredresiduals.Hence, it may be powTukey,Parzen,quadratic-spectral nels. (See, e.g., Priestley 1981, p. 441.) Of these, the erful against ARCH and GARCH alternatives,especially Bartlett,generalTukey,Parzen,and truncatedkernels are stronglypersistentones when a long lag is used. The test of compact support--k(z) = 0 for z > 1. For these kernels, q is the lag truncation number because lags of order j > q receive zero weight. In contrast, the Daniell and QS kernels are of unbounded support. Here, q is not a truncation point but determines the degree of smoothing for f(w). When the kernel is of unbounded support, all n- 1 sample autocorrelations of the squared residual are used. Following Hong (1996), we define our test statistic Q(q) = (2(f; -( 1n-1- fo) - Cn(k)) /(2Dn(k))1/2 cannot, however, detect all deviations from conditional homoscedasticity. For example, it has no power if ht follows a tent map, a typical nonlinearity from chaos theory. Of course, like the LM test, the test has power against many types of nonlinearity. When the regression model (1) is misspecified, the Q(q) test may falsely reject the null hypothesis Ho. This is not peculiar to Q(q). The same is true of all existing ARCH tests; all, explicitly or implicitly, assume correct specification of the regression model. In the present context, Equations (1) and (2) with existence of a consistent estimator b for bo (cf. Assumptions A.1-A.5 in the Appendix) imply correct specification of regression model (1). Such an assumption makes sense because one is usually interested in ARCH only when the regression model is correctly specified. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 94 Journalof Business & EconomicStatistics,January1999 The Q(q) test is based on the squareddeviationof f (w) because r = 00. Consequently,the conditionq/q - 1 = from fo(w), weighted equally for all frequenciesw over op(q-((3/2)v-1)) is ratherweak.FortheDaniellkernel,any [-7r,ir]. This is appropriatewhen no prior informationis v > 3/2 is allowed because 7 = 1. In this case, the rate available.One can directpower towardspecific directions condition q/q - 1 = op(q-((3/2)v-1)) is also easyto satisfy. of interestby giving differentweightsto differentfrequen- For suchmethodsas Andrews's(1991)parametric"plug-in" cies. For example,if one is interestedin detectingstrongly bandwidth,q/q - 1 vanishesin probabilityat the parametric persistentARCHeffects,greaterweightcanbe givento fre- rate n-1/2 with q oc n1/5 for most kernels,but for such quenciesnear0. Whennonuniformweightingfor frequen- methodsas Newey andWest's(1994)nonparametric "plugcies is used, an L2-norm-based test will involve numerical in" bandwidth,q/q - 1 vanishesat a rate slower than the integration.(We emphasizethat weightingfor frequencies parametricrate with q oc n1/3 for the Bartlettkernel. In is differentfrom weightingfor lags.) general, our conditionon q allows for a variety of dataAlternatively,tests can be based on other appropriate dependentmethods. One appropriatedata-drivenprocedureis the crossglobal divergencemeasures,such as the supremumover validationproposedby BeltraoandBloomfield(1987).This [0,7r]of the absolutevalue of the deviationf(w) - fo(w): delivers an automaticq by minimizinga cross-validated frequency-domainlikelihood function. Robinson (1991b) (q) =()1/2 2 sup showed that, asymptotically,such chosen q minimizes a wE[o,Gr]If(w)-fo(w) weightedintegratedmean squarederrorof f with suitable n-1 weights dependingon the true spectraldensity f(w). This 71/2 sup k(j/q)1(j)xV/cos(jw) global procedureis more appropriatein the presentconwE[,0-r]j=1 text than the narrow-bandmethodsused in estimatingan covariancematrix(e.g., Andrews autocorrelation-consistent This test is computationallymore complicatedthan Q(q). 1991; Newey andWest 1994) becauseQ(q) is basedon all Following reasoninganalogousto, but more complicated frequenciesover [-w, 7r]ratherthanon frequency0 only. than,thatof Anderson(1993) and Durlauf(1991), one can The Beltrdo-Bloomfieldprocedurecan be describedas null asymptoticdistribu- follows. Define expectthatQ(q)has a nonstandard tion thatis the supremumof a Gaussianprocesswith mean n-1-2 0 and covariancedependingon both w and k. For analytic n, t exp(-iAt)) cE(-oo, oo). I(A) E simplicity,we focus on Q(q) here, but we studythe power t=o of Q(q) in our simulations. This is the periodogramof it = E^2 /-2 1. Note thatI is pe2. CROSS-VALIDATION riodic in A with periodicity27. Put the Fourierfrequencies for j = 0, 1,..., n- 1. Then definethe crossAn importantpracticalissue with Q(q) or Q*(q) is the Aj = 27j/n estimatorat the Fourier validated spectral-density-function choice of q, which may have significantimpacton power as in finite samples.Clearly,the optimalchoice of q depends frequencies on the unknownalternative.It is, therefore,desirableto fJ(AJ;q) =j(q)-1 W(qAl)I(Aj -A•), choose q via data-drivenmethods.Althoughwe need not ION(n,j) computef to computeQ(q), our approachsuggeststhat it is appropriateto choose q using an optimalitycriterionfor whereaj (q) = lgN(n,j) W(qA1),W(A)= (27)-1 f_c k(z) the estimationof f. Before we discuss some data-driven exp(iAz)dz is the Fouriertransformof a kernelfunctionk, methodsin detail,we extendTheorem1, which assumesa and N(n,j) = {0,+n,...} U {2j, 2j n,...} is the set of nonstochasticq, to allow for a data-dependentbandwidth indicesj for which I(A• - Al) = I(Aj). Note thatwhen W > 7, the q. For this purpose,we furtherrestrictthe class of kernels is of compactsupportsuch that W(A) = 0 for AlA k such that lk(zi) - k(z2)1 _ All1 - z21 for any z1,z2 and effective summationis only over 1 for Il1< n/2q. The cross-validatedq• solves some 0 < A < co, and Ik(z)l ? alzl- for all z E R and for some 7 > •. The Lipschitz condition on k rules out the truncatedkernel but allows for most commonlyused nonuniformkernels. [n/2-1] qc= argmin qE[an,bn] Theorem2. Suppose the data-dependentbandwidth where v > (27 satisfies O/q - 1 = op(q-((3/2)v-1)), )/(2r - 1) and q is a nonstochastic bandwidth such that q -+ oo, q"/n -+ 0. Then, under the stated conditions and -+ N(0, 1) in distribuHo, Q(q) - Q(q) = op(1) and Q(Q) tion. 5 {ln(Aj;q) +I(Aj)/fj(Aj;q)}, (4) j=l where [n/2 - 1] is the integer part of n/2 - 1 and the interval[an,b,] is predetermined.The objectivefunction In j (A•j;q) + I(Aj)/l (Aj; q) is the well-knownWhittle approximationfor the negativelog-likelihoodfunction of {it}. Therefore, t• approximately maximizes the log- likelihoodof {St }. The parameter0 canbe real-valued,but aninteger-valued on which in turn The range of admissible0 depends v, • is moreconvenienthere.Thenthe probcan lem be solved k. the is the on or The smaller is (4) v, using grid search. By letting b, -+ 00 larger range depends r of admissibleratesfor q. Forkernelswith boundedsupport as n -+ 00, 0 always will tend to infinity under the altersuchas the BartlettandParzenkernels,anyv > 1 is allowed natives with nonuniformf(w). Under the null hypothesis This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Hong and Shehadeh:A New Test forARCHEffects 95 of no ARCH,q will convergeto the lower bound a, as the use of the truncatedkernel-that is, uniformweighting. n -+ 00 because the spectrum is flat. Consequently, we also Becausemanynonuniformkernelsdeliverbetterpowerthan let an -+ oo to satisfythe conditionsof Theorem2; namely, the truncatedkernelwhenq is large,we expectthatourtests q, -+ o0. As long as an diverges at a slow rate and bn di- may have greaterpowerthanthese tests for large q. verges at a fast rate, the theoreticallyoptimalqc will fall Suppose that k is the truncated kernel, k(z) = 1 for Izl < in this interval,and q^ will convergeto it (cf. Robinson 1 and 0 for IzI> 1;Q(q) becomes 1991b).For example,with the Daniell kernel,q = Cn1/5 = (BP(q) - q)/(2q)1/2 QTRUN(q) for the alternativesfor which f"(w) exists, where C dependson f(w). Thenqc will convergeto q, underthe alter- given q3/2/n -+ 0, where native as n -+ oo if an/n1/5 -+ 0 and bn/n1/5 -+ 00o. We q note thatthe choices of an and bn are to some degreearbiBP(q) = n PZ2(j) trary,butthesechoicesareof secondaryimportancebecause j=1 qc will convergeto qc for large n. This is similarin spirit to Newey and West's (1994) method.Oursimulationstud- is a Box and Pierce (1970) type of statisticfor the squared ies, like those of BeltrdoandBloomfield(1987) andRobin- residuals.The conditionq3/2/n -+ 0 is strongerthanq/n -+ son (1991b),reveal thatqc is variablein finite samples.It 0. This ensures q-1Cn(k) = 1 + o(q-1/2) and q-1Dn(k) = gives better power,however,than simple "rule-of-thumb" 2 + o(1) for the truncatedkernel.The test BP(q), as shown choices of q, suggestinggains from its use. The procedure by McLeodand Li (1983), is asymptoticallyx2 underHo. can be implementedefficientlyusing the fast Fouriertrans- Intuitively,when q is large, we can transformBP(q) into an N(0, 1) by first subtractingthe mean q and dividingby form (FFT). We now summarizethe proceduresto implementthe the standarddeviation (2q)1/2. Therefore QTRUN(q)can be viewed as a generalizedBP(q) test for largeq. In practice,a test: modifiedbut asymptoticallyequivalentstatistic,originally 1. Obtainany nl/2-consistent estimatorb and save the proposedby Ljungand Box (1978), = residuals Et Yt g(Xt, b). q functionp(j) of 2. Constructthe sampleautocorrelation Et= a S LB(q) = n2 E 1. 3. Choose a kernel k. For example,we can choose the 2(j)/(n - j), j=1 Daniell kernel k(z) = sin(rz)/rz, z e (-oo, 00oo).Its Fourier often is used for testing ARCH. The weights n/(n - j) transform W(A) = 7-11[I1AI< 7r]. are introducedto improvesize performanceand do not af4. Choose qc by the Beltrdo-Bloomfield(1987) cross- fect asymptoticpower. These weights are fundamentally validationprocedure. differentfrom those of our Q(q) test, which affects the 5. Computethe statistic Q(qc) and compare it to the asymptoticpower.The test LB(q) is asymptoticallyequivupper-tailedcriticalvalueC, of N(0, 1) at significancelevel alent to BP(q). It follows that QTRUN(q) is also asymptotrq.If Q(qc) > C,, then one rejects Ho at level qj. ically equivalentto a generalizedversion of LB(q) under Step 4 is not always necessary.If some q is preferreda Ho; namely, = Op(l). priori,one can omit this step. A GAUSS programimpleQTRUN(q) - (LB(q) - q)/(2q)1/2 from us. is available these steps menting We now relate a generalizedversion of Engle's (1982) Althoughthe cross-validatedq, is asymptoticallyoptimal for the estimationof f(w) in terms of an integrated LM test to QTRUN(q). Put U = (Ui,...,un)' and Z = Then the (1,2_ ,...,t2_q)'. weightedmean squarederrorcriterion,it may not be opti- (Z',... ,Z)', where Zj mal for power of Q(q). For hypothesistesting, a sensible LM test criterionis to choose q so as to maximizepower and/or LM(q) = nU'Z(Z'Z)-1Z'/UU/U' minimize size distortion.It can be expectedthat the optimal q chosen this way will generally be different from ic, although they may be of the same order of magnitude. This is a theoretically interesting but technically complicated issue that deserves further investigation; we defer it to future work. Nevertheless, our simulation and application suggest that • often delivers maximal or reasonably good power for Q(q) in finite samples. 3. RELATIONSHIPTO SOME EXISTINGTESTS We now discuss the relationship of our test to some important existing tests for ARCH. We will show that, when q is large, a generalized version of Engle's (1982) LM test, as well as those of Box and Pierce (1970) and Ljung and Box (1978), can be viewed as a special case of our approachwith - nR2 , where R2 is the uncentered squared multicorrelation coefficient from the ARCH regression 4 O+ at 1 + +2'2 + '.. + Qq +tq, t= 1,...,n, with the initial values g2 = 0 for t = -q + 1,...,0. This test is asymptotically x2 under Ho and is asymptotically locally most powerful if the true alternative is ARCH(q) with q fixed (cf. Engle 1982, 1984). Lee (1991) showed that a modified LM(q) test for GARCH(p, q) is the same as the LM(q) test for ARCH(q). Granger and Teriisvirta (1993) showed that the LM(q) test is asymptotically equivalent to BP(q) and LB(q) for fixed q. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Journalof Business & EconomicStatistics,January1999 96 Forlargeq it is naturalto considerthe generalizedversion kT(Z) = 1[ zI _ 1] becauseEFF(kB : kT) > 51/2. As done of Engle'stest, by Hong (1996),withina suitableclass of kernelfunctions, the Daniellkernelmaximizesthe powerof Q(q) underHan. = (LM(q) - q)/(2q)1/2 The relativeefficiencyover the LM test does not contraQREG(q) the well-knownresultthat the LM test is locally most dict = (nR2 - q)/(2q)/2 powerfulwhen the true alternativeis ARCH(q)with fixed The following theorem shows that QTRUN(q) is asymptotically equivalent to QREG(q). Theorem 3. Let q - oo, q5/n O0.Then, under and the stated conditions Ho, QTRUN(q) - QREG(q) in -? N(0, 1) distribution. op(l), QREG(q) Like the LM test, QREG(q) can be viewed as a test for the hypothesisthatall q coefficientsof an ARCH(q)model arejointly equalto 0, where q grows with the samplesize n. Intuitively,if the data are homoscedastic,then the variance cannotbe predictedby any past squaredresiduals.If a large variancefor it can be predictedby large values of the past squaredresiduals,however,thenARCHeffects are present.Any stationaryinvertibleprocessin Et with continwell by a truncatedARCH uous f(w) can be approximated model of sufficiently high order. Therefore, QREG(q) even- q; here we consider a different regime; namely, q - 00o. Many earlier applications(e.g., Engle 1982, 1983; Engle and Kraft1983;Engle et al. 1987;Bera andHiggins 1993) used linearlydecliningweighting schemes with relatively long lags. Such weightingschemes also serve to discount past information.But, as pointedout by Bollerslev(1986), these weightingschemesare somewhatad hoc. In comparison, ourweightingdependson the kernelfunctionk, which arisesendogenouslyfrom the kernelspectralestimation. Recently,Lee and King (1993) proposeda locally most meanpowerfulscored-based(LBS)test for ARCHthatexploits the one-sidednatureof the ARCHalternative.Their is test statisticfor ARCH,whichis robustto nonnormality, given by ((n-- LBS(q) q) (2 / _- 1) Eq1 q) 2-i -1Zt-i) t- _ tuallywill capturea wide rangeof alternativesas morelags [Et(^21 2_ \1)2] Et Eq 2 ) of g2 are includedwhenn increases.This test is convenient - (Et Eql 2-i)2}1/2 to implement,butit cannotbe expectedto be fully efficient, (5) as we show next. The preceding discussions show that the tests of UnderHo, LBS(q)is asymptoticallyone-sidedN(O,1). ObQTRUN(q), QREG(q), BP(q), LB(q), and LM(q) put unialso puts uniformweight on all q sample form,or roughlyuniform,weightson all q sampleautocor- viously, LBS(q) of the squaredresiduals.Consequently,it autocorrelations relations.Intuitively,this mightnot be the optimalweightin detectingthe ARCH alternabe not efficient fully may ing scheme.For most stationaryARCHprocesses,the au- tives whose autocorrelation p(j) decays to 0 as the lag j tocorrelationdecays to 0 as the lag increases.Intuitively,a increases. we Therefore, expect that Q(q) may be competbetter test should put greaterweight on lower-orderlags. in with some itive cases, as illustratedin our folLBS(q) We thus expect that tests based on nonuniformweightsimulations and empiricalapplications.Comparing ing may deliver better power than BP(q), LB(q), LM(q), lowing and fo(w) at frequency0, Hong (1997) constructeda f(w) Pitman's relative and efficiency QREG(q). Using QTRUN(q), test thatexploitsthe one-sidednatureof the ARCHalternacriterion,it can be shownthatthis is indeedthe case. Contive and uses a flexible weightingscheme.AlthoughQ(q) siderthe followingclass of local ARCHalternatives: does not exploitthis, it is asymptoticallymoreefficientthan Hong's (1997) test becauseQ(q) can detecta class of local an alternativesof O(ql/4/nl1/2). Hong's (1997) test only can Han: ht 2 1+?(ql/4/nl/2)Eaj detect a class of local alternativesof O(ql/2/nl1/2). The j=1 weighting schemes of Q(q) and Hong's (1997) test also where aj 2 0 and -jl, aj < o00, which implies aj -+ differ. 0 as j -+ 00. Without loss of generality, we assume 1 a•j < for all n to ensure positivity of ht. Following reasoning analogous to (but more tedious than) that of Hong (1996), it can be shown that, if q2/n -+ 0 and q -+ co, Q(q) -+ N(p, 1) in distribution under Han with Note noncentrality parameter p = (Z7= aJ)2/{D(k)}1/2. that, whenever ARCH exists-that is, at least some aj > 0 always has nontrivial power because p > 0. exist-Q(q) With q = CnY, where 0 < y < 1 and 0 < C < o00, the relative efficiency of the Q(q) test using kernel k2 with respect to using kernel kl under Han is given by (q1/4 /n1/2) -'=1 EFF(k2 : kl)= (D(k2)/D(kl))1/(2-Y ) Therefore, the Bartlett kernel kBg(z) = (1 - zl)1[zl _ 1] is about 120% more efficient than the truncated kernel 4. MONTE CARLO EVIDENCE We now study the finite-sample performances of our tests in comparison to a variety of existing ARCH tests. Consider the following DGP: Yt = X'bo + t, t = tt/2 where Et - NID(0, 1) and Xt = (1,mt)' with mt = and vt Amt-1 + -vt NID(0, o-). This model was first used by Engle, Hendry, and Trumble (1985). We consider four processes for ht--(1) ht = w, (2) ht = w + a12_l, i 2t- /i2, and (4) ht = (3) ht = w + a(E 1 1/i2)-1 w + aEt-1 + pht_1. Under (1), ARCH is not present. This permits us to examine the sizes. Alternative (2) is an ARCH(1) process often examined in existing simulation studies (e.g., Engle et al. 1985; Diebold and Pauly 1989; Luukkonen, Saikkonen, This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Hong and Shehadeh:A New TestforARCHEffects 97 and Terasvirta1988;Bollerslevand Wooldridge1992;Lee andKing 1993).Alternative(3) is an ARCH(8)processwith weights decliningat a geometricrate as the lag increases. Finally, alternative(4) is a GARCH(1, 1) process. The GARCHmodel has been the workhorsein the literature. We set b, = (1, 1)' and w = 1. For the exogenousvari- 25 S.........BDS1 20 able mt, we set A = .8 and ao2= 4. As done by Engle et al. (1985), mt is generatedfor each experimentand then held fixed from iterationto iteration.For alternatives(2) and (3), we considertwo values of a, .3 and .95. For alternative(4), we choose threecombinationsof (a, r0)-(.3, .2), (.5, .2), and (.3, .65). We considersamplesizes of n = 2" for m = 6, 7, 8, and 9. These samplesizes are chosen for conveniencebecausewe employthe Cooley-TukeyFFT algorithmin our cross-validationprocedure.(Ourprocedure is applicablewhenn is not a powerof 2, thoughat some sacrifice of computationalefficiency.This sacrificeis costly in simulation.)We set Ec = 0 for t < 0 and ho = 1. To reduce the possible effects of these initialconditions,we generate n + 100 observationsand then discardthe first 100. The iterationnumberis 10,000 for (1), and 1,000 for (2)-(4). A GAUSS-386random-number generatoris used. We choose the Daniell kernel for our tests, QDAN (q) and QDAN(q), and also considertheir cross-validatedversions, Qcv = QDAN (c) and Qcv = QDAN(qc). We only study the powers of Qcv and QDAN(q) becausetheir null limit distributionsare unknown.We compareour tests to Engle's LM(q), Box and Pierce's BP(q), Ljung and Box's LB(q), Lee and King's LBS(q), the truncatedkernel-basedtest QTRUN(q), the generalized LM test QREG(q), and BDS, (q) for q = 1,..., 20. We include BDS.y(q) because it has been popularin the literaturepartly for its capacity to detect ARCH alternatives.This test involves choice of a distance parametery E (0, 1) in terms of the data spread.To study the impactof y on size andpower,we considertwo values, 7 = 1/4 and 1/2. (We use the BDS code of D. Dechert.) For economy, we reportresults for n = 128 in detail and briefly mention results for other sample sizes. Figure 1 shows the sizes at the 5% level. The QDAN(q) test performswell for all q and overrejectsonly slightly. For large q, BP(q), QTRUN(q), LM(q), and QREG(q) all tend to underreject,most seriouslyfor LM(q). The LB(q) test underrejectsfor small q andperformswell for mediumand large q. The LBS(q) test has reasonable size for small q, underrejects for medium q, and overrejects for large q. The BDS,(q) test significantly overrejects for the two choices of 7. Finally, the cross-validated Qcv gives a rejection rate of 7.79%, illustrating some overrejection at the 5% level. (To facilitate comparison, the result for Qcv is reported in each figure as a horizontal line, the same value for each q. Since 4• is chosen endogenously it may vary for each iteration of the simulation.) Comparison of QDAN(q) and Qcv indicates that this overrejection arises from the additional noise of cross-validation. (Here, the mean of j0 is 3.8 with a standarddeviation of 3.7; the empirical distribution of q0 is left-skewed.) As will be seen, the gain in power from using 0• may compensate for this moderate overrejection. At the 10% and 1% levels, the size of LBS(q) exhibits a U-shaped "BDSI 1at5% 4 2the 3 Level. 8 5 7 6 The simulated 11is 12 9 10 model yLB=13 114 + 15mt 16+ LBS 2 3 4 5 20 19 t, 18where LM 0 1 17 6 7 I 8 I 9 I I I 10 11 12 13 14 15 16 17 18 19 20 I I I Lag(q) Figure 1. EmpiricalSize: RejectionRates of Tests Underthe Null at the 5% Level. The simulated model is Yt = 1 + mt + Et, where Et - , 1), mt + = = + vt, t ht = .8, and vt N(O, - ,thtllP th112,ht N(O, i), mt == Amt-l vt, A == .8, N(O, 1- - N(_1, t Imt I and"vt 4). Calculationsare based on a sample size of 128 and 10,000 replications. TheresultforQcv (forwhich;Tcis chosen viacross-validation and, therefore,may change foreach iteration)is includedforcomparisonas a horizontalline. curve, performingbest for small q. At the 10%level, the finite-samplecorrectionagainimprovesthe sizes of LB(q) versusBP(q),while it inducessome overrejectionfor LB(q) in the 1%level. Cross-validationfor Qcv deliversa rejection rate of 11.6%at the 10%level, and a strongrejection rate of 4.2% at the 1%level. The LM(q) test underrejects at bothlevels. The sizes of the tests improve,to varyingdegrees, in largersamples.For n = 512 andthe 5% level, the Q tests have sizes between5%and6%;the othertests have sizes between 3% and 5%. The cross-validatedQcv test still exhibitssome overrejection,7%at the 5%level. At the 10%level, the sizes of all of the tests are similarandrange between7%and 11%.At the 1%level, LBS(q),LB(q),and BP(q) performwell, close to 1%.The Q tests show some overrejectionbut never exceed 2.5%;Qcv remains3.5%. The LM(q) test exhibitssome underrejection. As suggestedby asymptotictheory,the sizes of Qcv improve as the samplesize n increases,but very slowly. In a finitesample,one can use bootstrapto obtainmoreaccurate sizes for Qcvy. Givena sampleof size n, we firstobtainthe ordinary least squares (OLS) residuals ut = Yt- X/b, where b is the OLS estimator for b0. We then draw a sample of residuals {'24} of size n, with replacement, from the empirical distribution function of {it}. Next, we obtain a sample i4. On each sample {Yt*} of size n, where Y1* = Xb {1Y*}, we perform the regression described previously and submit the regression residuals to each test. We repeat this for 1,000 iterations and obtain 1,000 bootstrap test statistics, which are used to form the bootstrap empirical distribution function for each test. We then compare the actual test statistics to the bootstrap empirical distribution functions to find the bootstrap p values. The bootstrap sizes of Qcv at the 10%, 5%, and 1% levels are 12.2%, 5.3%, and 1.5%, respectively. These are significantly better than the This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 98 Journalof Business & EconomicStatistics,January1999 sizes using asymptoticcriticalvalues,especiallyat the 1% level. Figure 2 shows powers against ARCH(1) using the 5% empirical critical values. The empirical critical values are obtained from the iterations under (1). First, LBS(1) is slightly more powerfulthan LM(1), LB(1), and QDAN(1), suggesting some gain from exploitingthe onesided nature of the alternative.The tests LM(1), LB(1), QDAN(1), QTRUN (1), and QREG (1) have the same power, slightlybetter than QDAN(1) and BDS1/4(1). Next, the powers of all the tests except QDAN(q) decreaserapidlyas q increases,most dramaticallyfor LBS(q) and BDS1/4(q). 50 40 BDS 30 LBS . .. QCV 20 BDS BP andQMI LB f7 LM and QI 10 - 80 I 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Lag(q) (a) 120 ------------------------- -.... "100 SBP andQ "80 E? -- --" .-- BDS . -. - - LMandQ , 40 L BDSBDS (a) 120 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 100 -- - . QD- ----80 IBP andQ.. rates are based on the size-correctedempiricalcriticalvalues for the 5% level. The resultforQcv (forwhich?, is chosen via cross-validation and, therefore,maychange foreach iteration)is includedforcomparison as a horizontalline. (a) a = .3; (b) a = .95. SBDS ._.. -.._ BDS 20 0 .. -------- I I LBS The tests LM(q), LB(q), BP(q),QTRUN(q), QREG(q), and For > q 1, QDAN(q)is the most QDAN(q) perform similarly. -.- I I Figure 3. EmpiricalPower: Rejection Rates of Tests Under the ARCH(8)Alternativeat the 5% Level. The simulatedmodel is yt = 1 LB LMandQ 40 (b) I I I I I I I I I 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Lag(q) (b) powerfultest. Its power loss as q increasesis smallerthan the losses of the other tests because QDAN (q) discounts The of is higher-orderlags. power BDS1/4(q) comparable to that of LBS(q) except for small q: BDS1/2(q) has low power for small q but performs better than BDS 1/4(q) for q > 8. (We report y = 1/4 becausethis choice appearsto maximizethe powerof BDS for n 128 in most cases. We know of no theoreticalbasis for choosing For large critica.) ,forthe as in Figure2(b), the power of QDAN increases slightly (q) Figure 2. EmpiricalPower: Rejection Rates of Tests Under the ARCH(1)Alternativeat the 5% Level. The simulated model is yt = 1 + mt + Et, whereEt - 5th112,ht = 1 + ar2t1, Jt N N(O,1), mt = Amt_1 + vt, A = .8, and v t N(O, 4). Calculations are based on a sample size of 128 and 1,000 replications.Rejectionrates are based on the size-correctedempiricalcriticalvalues for the 5% level. The result forQcv (forwhichqc is chosen via cross-validationand, therefore,may and then decreases as q increases. This is not surprising; change foreach iteration)is includedforcomparisonas a horizontalline. (a) a = .3; (b) a = .95. the spectraldensityof an ARCH(1)process includescon- This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 99 Hongand Shehadeh:A New Test forARCHEffects tributions from q > 1 that are substantial for large a. Finally, we note that the cross-validation delivers reasonable power for Qcv, although a little lower than those of LM(1) and LB(1), which use the correct information that the true ARCH order qo = 1. The power of Qcv is low, 31% and 80%, respectively. (We do not include Qcv in the figures.) Next, Figure 3 reports powers against ARCH(8). In Figure 3(a), QDAN (q) has the highest power for all q, including q = 8. The tests QDAN(8) and BDS1/4(8) perform similarly. The Qcv test has roughly the same or better power than QDAN(8); CV only has power 27%. The power of QDAN(q) increases slightly and then decreases as q increases. Although LBS(8), LM(8), BP(8), and LB(8) are asymptotically locally most powerful against ARCH(8), none of them achieves its maximum power at q = 8. The powers of LBS(q), LM(q), LB(q), QTRUN(q), 80 LBS 70 -..V 60 50 BPand 30 20 10 1 2 4 3 5 6 8 7 9 10 11 12 13 14 15 16 17 18 19 20 and QREG(q) attain their maxima at q = 1 and then decrease as q increases. Again, LBS(1) has better power than the related tests at q = 1. The tests LM(1), LB(1) and QDAN(1) have similar power. As q increases, the power of LBS(q) drops dramatically. The tests LM(q), BP(q), LB(q), QTRUN(q), and QREG(q)perform similarly. BDS1/4(q) performs quite well for small q but is dominated by BDS1/2(q) for large q. Again, the power of QDAN(q) declines slowly. In Figure 3(b), BDS1/4(q) dominates QDAN(q) for q < 6. Figure 4 reports powers against GARCH(1, 1). First, we examine Figure 4, (a) and (b), in which a changes but / is fixed. For (a, /) = (.3, .2), the power patterns and ranking of the tests are similar to those under ARCH(1). In particular, all the tests have maximum power at q = 1, except QDAN(q) and QDAN(q), which have maximum power at q = 2 and 3, respectively. LBS(q) is the most powerful at q = 1, and the other tests have similar power. [As pointed out by Lee (1991) and Lee and King (1993), LM(1) and LBS(1) are the appropriateLM and LBS tests for both ARCH(1) and GARCH(1, 1).] For q > 2, QDAN(q)has the highest power. The power of Qcv is close to QDAN(1).For (a, /) = (.5, .2), we note two primary differences from the preceding. First, the power curves of BDS, (q) and QDAN (q) are concave and each achieves its maximum for q > 5. Second, both the advantage of LBS(1) over the other tests and the cost of using a cross-validated q, are smaller. Last, Figure 4(c) contains results for (a, p) = (.3, .65), where a +/3 is close to 1, a characteristic common in empirical research. All of the power curves now become concave; none of the tests achieves its maximum power at q = 1. Indeed, it is reasonable that LM(1) and LBS(1) cannot be expected to be optimal here. For persistent GARCH, it is better to include more lags in the tests. Here, the power of QDAN(q) decreases very slightly even for large q and achieves its maximum power at q as large as 7. As /3increases, the relative power ranking of the tests remains much the same, though the concavity is more pronounced. Finally, we note that for most GARCH(1, 1) processes, cross-validation delivers Qcv with reasonable power, performing better than LM(1), LB(1), and BP(1) in every case, while tcv has low power. [We also examined power against several other parameter values and variance processes including ARCH(4) and Q S40 Lag (q) (a) 100 ioo - --- Q- -.. • BP and Q .DA LB 60 LMand QRED . 40 20 BDS LBS 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Lag (q) (b) 100 80 BP and Q .Tn QDA 60 BDS" "LBS 40 BDS, 20 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Lag (q) (c) Figure 4. Empirical Power: Rejection Rates of Tests Under the GARCH(1, 1) Alternative at the 5% Level. The simulated model is Yt = + pht-1, Ct N N(0, 1), 1+ mt + et, where et = 5th1/2, ht =1 + e w mt = Amt_1 + vt, A = .8, and vt N(O, 4). Calculations are based on a sample size of 128 and 1,000 replications.Rejectionrates are based on the size-correctedempiricalcriticalvalues for the 5% level. The result forQcv (forwhichqc is chosen via cross-validationand, therefore,may change foreach iteration)is includedforcomparisonas a horizontalline. (a) (0, P) = (.3, .2); (b) (0, 3) = (.5, .2); (c) (0, 3) = (.3, .65) This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Journalof Business & EconomicStatistics,January1999 100 ARCH(12),as well as the Student'st and gamma distributionsfor ?t. The resultsare largelysimilarto those presentedpreviously.For some of these additionalresults,see Hong and Shehadeh(1996).] In all of our simulations,the mean of q^increaseswith the sample size and ARCH persistence.For Figure 2(a) (a = .3), the mean of q^is 5.8 with a standarddeviation of 3.9. For n = 512, the mean and standarddeviationare 7.7 and 4.7. For Figure4(c) [(a, 0) = (.3, .65)], the means and standarddeviationsof q^are 11.1 and 5.0, respectively, for n = 128, and 17.2 and 3.6 for n = 512. The distributions of q^areskewedleft for weakpersistenceandbecome increasinglysymmetricunderstrongerpersistence. Summing up, we can draw the following conclusions from our simulation: 1. Both QDAN(q) and LB(q) have reasonablesize and performbetter than the uniform-weighttests for most q. The QREG(q)test has bettersize performancethan LM(q) for all q;QTRUN(q)has bettersize performancethanBP(q). BDS, (q) has poor size performance,overrejectingthe null hypothesisin most cases. 2. For each q, our test with nonuniform weights, QDAN(q), has power as good or better than the uniformweightingtest, QTRUN(q),againstall the alternativesunderconsideration.LM(q),BP(q),LB(q),andQREG(q)have powerssimilarto QTRUN(q). 3. Lee and King's LBS(q) test often has the best power for q = 1. For large q, however,LBS(q) is dominatedby all of the other tests, independentof the true alternative. The supremum-normtest QDAN(q) and BDS1/4(q) have the best power for persistentARCH alternativessuch as ARCH(8)but performless well againstGARCHalternatives. For large q, QDAN(q) is often the most powerful. 4. ForpersistentGARCH(1,1) processes,the LM(1)and LBS(1)tests do not havethe best power.Instead,thesetests have betterpower when more lags are employed.On the otherhand,the powerof QDAN(q) is only slightlyaffected by the truncationlag numberq over ratherwide rangesof q and often is the greatestof all the tests. 5. For our test Qcv, cross-validationexhibitssome size overrejectionbut alwaysdeliversgood power.To some extent, cross-validationrevealsinformationaboutthe truealternative.The costs associatedwith misspecificationof an unknowntrue alternativeappearhigh for all of the other tests, thoughlower for QDAN(q). As opposedto Qcv, tcv performspoorlyrelativeto QDAN(q) in each experiment. 5. EMPIRICALAPPLICATIONS We now applyour tests to two datasets,the U.S. implicit pricedeflatorfor grossnationalproduct(GNP)andthe daily nominalU.S. dollar/Deutschemark exchangerate. 5.1 U.S. GNP Deflator We considerinflationas measuredby the log changein the quarterlyimplicit price deflatorof U.S. GNP, 7lt = In(GDt/GDti). (These deflatordata are reportedby the U.S. Bureauof Economic Analysis.) Among others, En- gle and Kraft(1983) and Bollerslev (1986) examinedthis series in detail. These researchersreportedthat the conditionalmean E(7rtlt-1) appearswell specifiedby an autoregressive(AR) model.Standardunivariatetime series methodsleadto identificationof the followingAR(4)model for 7rt: 7rt= .148 + (.076) .355 7rt-1 + .265 (.090) (.094) + .199 (.093) Wt-3+ 7rt-2 .059 (.088) rt-4 + Et The model is estimatedon quarterlydata from the second quarterof 1952 to the firstquarterof 1984, a total of 128 observations.OLS standarderrorsappearin parentheses andthe R2 = .648.The p valuesof the LM,BP,andLB tests for serialcorrelationin the meanare .80 or greaterfor lags between 1 and 20, suggestingan absenceof autocorrelationin the residuals.[Inthe presenceof ARCHeffects, these standardtests for serialcorrelationin meanwill tend to overrejectthe null hypothesisof no serial correlation. This fact does not changethe conclusionhere. But, for an earlierperiodthe BP,LB, andLM tests for serialcorrelation in meanareall significantat the 10%level for a lag of 5. As Bollerslev(1986, p. 323) noted,"[f]romthe late 1940suntil the mid-1950s the inflation rate was ... hard to predict."Al- thoughthe heteroscedasticityin this periodis striking,the mean is difficultto specify. We thereforeomit this period and begin our analysisat the earliestdate at which serial correlationin meanis not evidentwiththese tests.]Wenote thatthe dynamicregressionmodelhereis differentfromthe static regressionmodels with exogenoustime series variables used in the simulationexperiment.Nevertheless,one can expect that ARCH tests will performsimilarlyunder both the regressionmodels with the same ARCH alternative because ARCH tests, based on the squaresof residuals, are not very sensitive to the estimationeffect even in moderatelysmall samples.The estimationeffect has an asymptoticallynegligible impact on both size and power of ARCH tests, whetherthe regressionmodel is static or dynamic. We apply our tests, as well as those of Engle (1982), LB, BP, BDS, and Lee and King (1993) to the estimated squaredresidualsfrom the precedingregression.In Table 1, we report the p values for the tests at different q. With use of the Daniell kernel, cross-validation delivers •? = 6. Our test Qcv = QDAN(gc) = 1.901, giving a p value of .028 using the asymptotic critical value. Thus, our test suggests that ARCH effects exist at the 5% significance level. Engle's LM test statistic LM(O•) = 11.00. This gives a p value of .09; the LM statistic is just significant at the 10% level. Lee and King's test LBS(O•) = -2.75. This has a p value equal to .997, delivering an opposite conclusion. This results from the fact that the sample autocorrelationsof the squared residuals are negative at some lags, although these negative autocorrelations are not significant for any given lag at reasonable levels. Likewise, BDS1/4(4c) = -.12 which fails to reject the null. As can be seen in Table 1, the choices of q = 4, 8, and 12 deliver significant p val- This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Hong and Shehadeh: A New Test for ARCH Effects 101 Table2. TestStatisticsforARCHEffectsin the Residualsof the MA(1)Regressionfor the DM/U.S.DollarExchangeRate Table1. p ValuesFromTests forARCHEffectsin the Residuals of the AR(4)Regressionof the U.S. GNP Deflator q 1 4 6 8 12 16 BP .442 (.439) .046 (.039) .104 (.077) .172 (.101) .394 (.263) .544 (.361) LB .437 (.439) .040 (.039) .091 (.077) .150 (.106) .351 (.281) .479 (.395) LM .511 (.515) .076 (.055) .088 (.064) .154 (.104) .367 (.282) .340 (.234) LBS .253 (.225) .912 (.863) .997 (.989) .999 (.983) 1.000 (.987) 1.000 (.945) QTRUN QDAN .614 (.439) .022 (.039) .096 (.077) .187 (.101) .446 (.263) .589 (.361) .547 (.362) .052 (.048) .028 (.037) .032 (.041) .070 (.062) .138 (.094) BDS1/4 q BP LB LM LBS QTRUN QDAN .792 (.832) .516 (.587) .905 (.937) .724 (.768) .422 (.619) .007 (.075) 10 20 29 30 40 50 155.50 188.66 191.88 193.69 205.09 208.58 155.92 189.33 192.60 194.44 206.05 209.63 111.45 120.30 125.61 126.30 140.78 144.11 8.11 7.43 6.29 6.32 5.38 4.55 32.53 26.67 21.39 21.13 18.46 15.86 40.44 37.04 34.00 33.62 30.49 27.91 BDS1/4 7.44 9.43 9.72 9.74 9.76 9.87 NOTE: Cross-validation generatedqc = 29. UnderH0, BP,LB,and LMare Xq;BDS1/4,LBS, are N(O, 1). QTRUN,and OQDAN mated a GARCH(3,3) and appliedthe tests to these new standardizedsquaredresiduals.The GARCH parameters = were significant at any reasonablelevel, and all of the NOTE:The bootstrapp values appearin parentheses.Cross-validation generatedqc 6. tests failed to reject the null of homoscedasticityat any ues (smallerthan 10%) for QDAN(q), but the choices of conventionallevel.] Beyond this, we see that QDAN(q) is q = 1 and 16 deliver insignificantp values (largerthan much largerthan Lee and King's LBS(q) and BDS1/4(q). 10%).This suggeststhe importanceof a properchoice of q. Moreover,our QDAN(q)is always larger than the trunWe see thatthe cross-validatedq^gives the smallestp value cated kernel-based test QTRUN(q)for all q, which can be viewed as a normal approximationof Box and Pierce's here. We also performa bootstrapprocedureto bettercompare BP(q) test andhas approximatelythe samepoweras BP(q) the tests in this empiricalsetting.The bootstrapresultsare and LB(q). Of special interesthere is the long lag, q^ = 29, chosen largelysimilarto those using the asymptoticcriticalvalues andarereportedin parenthesesin Table1. In particular,the by cross-validation.This resultis consistentwith otherevidence of strongpersistencein the conditionalvolatilityof relativerankingof the tests is unchanged. we esti- high-frequencydata.We have Qcv = QDAN(qc) = 34.00, [Toconfirmthe sourceof the heteroscedasticity, matedan ARCH(3)model, in which only the interceptand significantat any level. In particular,as we allow the samcoefficienton q = 3 were significant.We appliedthe ARCH ple size to grow, q, grows as well. For sample sizes of tests to the new standardizedsquaredresidualsand failed n = 2m with m = 6,7,..., 11 and a common start date, to reject the null of homoscedasticityat reasonablelevels cross-validationyieldedq^= 1, 12, 8, 11, 19, and29, respecfor q < 20. To check robustnessof the obtainedresults,we tively.Thistrendis consistentwith ourrequirementsfor the appliedthe tests to some later samples of the same size. theoreticalresults. The conclusionsremainlargelyunchanged.] Again, we performeda bootstrapof each test. The conclusion using our test is the same based on the boot5.2 Exchange-RateData strapcriticalvalues.In threeothercases, the conclusionis We now considera high-frequencyseries,the dailynomi- changed.For lags greaterthan20, the LM(q)and QREG(q) nal Deutschemark/U.S.dollarexchangerate.The data,ob- tests arenow significantonly at the 5%level. Withthe boottained from Chicago MercantileExchange,cover the pe- strapcriticalvalues, LBS(q) is significantat the 1%level riod June 8, 1983, to July 24, 1991-2,048 observations. for q < 16, at the 5% level for q < 20, and at the 10% As noted by Bollerslev et al. (1994), most researchers level for q < 25 and q > 35. This is consistentwith the have reacheda consensusthat nominalexchangerates are U-shapedsize curvewe observedfor LBS(q)in the Monte modeledbest as integratedprocesses.We thereforemodel Carlosection. st = 100 I1n(DMt/DMt_1)as a first-ordermoving average [MA(1)], where DMt is the spot exchange rate on day t. Quasi-maximum likelihood estimation of this process yields 6. CONCLUSIONS We propose a new ARCH test based on the sum of weighted squares of the sample autocorrelations, with the weights depending on a kernel function. Typically, the st = .0190 - .0149 et-1 iet.+ kernel function gives greater weight to lower-order lags. (.0149) (.0210) When a relatively large lag is employed, Engle's (1982) We note that neither the constant nor the MA(1) parameter LM test, as well as the tests of Box and Pierce (1970) is significant, consistent with the integration hypothesis. and Ljung and Box (1978), is equivalent to the trunWe test for ARCH effects as in the previous example cated kernel-based test that imposes a uniform weighting using the estimated squared residuals. A summary of our scheme and is less efficient than nonuniform kernel-based results appears in Table 2. All of the tests in Table 2 are tests. Our method permits choice of an appropriate lag significant at any reasonable level for q = 10, 20,..., 50 us- via data-drivenmethods-for example, Beltrio and Blooming the asymptotic critical values. Therefore, all the tests field's (1987) cross-validation. Simulation studies show that under consideration deliver the same conclusion. [Rejec- the new test performs reasonably well in finite samples. tion here is not surprising given the sample size. We esti- The test also is applied in two empirical examples. These This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 102 Journalof Business & EconomicStatistics,January1999 n-1 empiricalexamples illustratesome merits of the present test. Z k2(j/q)-2(j) Some directionsfor furtherresearchmay be pursued.As j=1 n-1 is well known, it is importantto check the adequacyof = k2(j/q)2(j)/r2 an estimatedGARCHmodel. For example,misspecifica(0) tion of a GARCH-Mmodel will lead to inconsistentquasi j=1 n-1 maximumlikelihoodestimation.Our approachcan be ex_ + {f-2(0)tendedto checkadequacyof a GARCHmodel.If the model T2((0)} k2(j/q)r2(j) is correctlyspecified,then the standardizedsquaredresidj=1 n-1 ual is a white-noiseprocess;otherwise,thereis evidenceof + -2(0) k2(j/q)2(j) - f2(j)}. (A.1) misspecificationfor GARCH.We conjecturethat the proj=1 posed test also will be asymptoticallyN(O,1) undercorrect specificationof the GARCH,but the technicalityinvolved (q/n) appearsnontrivial.These considerationsare left for subse- By Markov'sinequalityand n-1k2(j/q)Ef2(j) where quentwork. r2(0) {q-1E k2(j/q)}= O(q/n), q- j:lnk2 = Op (j/q) --+ f k2(z) dz, we have •?1k2(j/q)i2(j) ACKNOWLEDGMENTS (q/n). Moreover,i(0) - r(0) = Op(n-1/2). It follows that We thanka coeditor,an associateeditor,three referees, the secondtermin (A.1) is Op(q/n3/2) = op(ql/2/n) given and seminarparticipantsat CornellUniversityfor helpful q/n -+ 0. The last termis also op(ql/2/n) by LemmaA.1. commentsandsuggestions.Any remainingerrorsare solely Therefore,we have attributable to the authors.We also thankJau-LianJengfor n--1 us providing with data. (j) Sk2(j/q)2 APPENDIX:MATHEMATICAL PROOFS j=1 n-1 For rigor and completeness,we state explicitlythe con= S (A.2) k2(j/q)i2(j)/r2(0) + op(ql/2/n). ditionsfor the theorems. j=1 = is with AssumptionA.1. (a) {(t} iid, E(?t) 0, E(t2) = 1, and E(t8) < 00. (b) ?t is independentof X, for s By Hong (1996, theoremA.2), AssumptionsA.1 and A.4, < t. and q -+ oco,q/n -+ 0, we have R1, where AssumptionA.2. (a) For each b E B n-1 1 E N, g(., b) is a measurablefunction, and (b) g(Xt,.) 2 - Cn n )ik2(j/q) is twice differentiablewith respect to b in an open con(k) (j)/r2(0) vex neighborhoodN(bo) of bo E I, with lim,,, n-1 Et1 ESUPbEN(b,) IVbg(Xt,b)ll4 < 00 and limno n-1 Etn= + (2Dn(k))1/2 _d N(0, 1). (A.3) EsupbEN(bo) IVg(Xt, b)!2 < 00, where Vb and V2 are the gradientand Hessianoperators,respectively. Both (A.2) and (A.3), togetherwith q-1Dn(k) -+ fOO k4 Assumption A.3. n1/2(b - bo) = Op(l). AssumptionA.4. k: R - [-1, 1] is a symmetricfunction (z) dz, implythatQ(q) _+dN(0, 1). The proof will be comcontinuousat 0 and all but a finite numberof points,with pleted providedLemmaA.1 is proven. Proof of LemmaA.1. Noting ri2(j) - 2(j) = (?(j) k(0) = 1 and fo k2(u)du < 00. < A.5. For IR, f(j))2 + 2f?(j)(^(j)- f(j)), we write Assumption any z1,z2 E !k(zi) k(z2)l - z21 for some 0 < A < oo andIk(z)l? Alzl-' for Alzl n-1 all z gR E and some 7 > . 5 k2(j/q){C2(j) . - i2(j)} Theorem1. Suppose AssumptionsA.1-A.4 hold. Let j=l q - oo, q/n -+ O.Then under Ho, n--1 = Q(q) _d N(0, 1). To proveTheorem1, we first state a lemma. Lemma A.1. For j _ 0, define f(j) = n-1 1 (2_j - 1). Then + 2 -=j+l(s2 _ n-1 Sk2(j/q){2(j j=1 _ 2(j)} = 5 j=l op(ql/2/n). Proof of Theorem 1. Put r(j) = Ef(j). We can write k2(j/q)(?(j) - e(j))2 S j=l k2(j/q)i?(j)(?(j) - i(j)). (A.4) We now considerthe firsttermof (A.4).By straightforward algebra,we have i(j) - i(j) t=j+l This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions 103 Hongand Shehadeh:A New Test forARCHEffects given A.1-A.4, where q-1 n-1 k2(j/q) and n-1 E• 1 IIVbg(Xt,b)114= Op(1). n j) (t2 - 1)( t-j -_t- n-1 k2(z) dz Next, noting t - Et = (bo - b)'Vbg(Xt, b) + b)'V g(Xt, b)(bo b) for some b, we have t=j+1 n t=j+l (bo - n-1 ^2 ( (_?2)( +-1 f -+ 5 2j k2(j/q)22(j) j=1 t=j+1 = At (j) + A2(j) (A.5) say. A3(j), -1 = - utEt-(t-j k2(j/q) j=1 t-j) t=j+l n-1 Recall that it = it/&7n, t = et/co, and ut =-2 have 5 - 1. We - j112 k2(j/q) < 411bo j=1 2 n 2-1 1+lj (/)t2- 1•2 x n-1 • j E t=ij+l utEt-jVbg(Xt-j,bo) n--1 t=j+l Z - b114 + 2 jlbo .(_1W) tE an 0-1 E-2 k2(j/q) S2 j=1 t=j+l + ( .2 t= X 7-1 u•uEt_ g(Xt-,b) 3t-j3. (o').-1 - t=j+l t=j+ n 2 n-(-1)t=j+l A, (A.8) = Op(q/n2) j) _•-_j by Markov'sinequalityand Elln-1ZEnj+1Utt-jVbg t=j+l (Xt-j,Ibo)ll2 + (& 0•o)n-1 UtEt-j• n = O(n-1) given E(utlLt-1) = 0, where Lt-1 is the informationset availableat periodt -1. We also have /(2 t=j+l + ( 1(j) + 222B12(j) -2 _ = ao2)!Bl3(j), say. (A.6) t=ij+ j=1 j=1 -2 = UtEt-j n-1 2(jq) = Sk2(j/q)B3(i) (A.9) Op(q/n) by Markov's inequality and E(n-1 E j+l UtE•j)2 O(n-1) by AssumptionA.1. = By the Cauchy-Schwarz inequality and noting ?t - et jb- bol <IlIb- bo , (bo - b)'Vbg(Xt,b) for some b, where we have Combining (A.6)-(A.9) with an Op(n-1/2), 2 we obtain n-1 Sk2(j/q)A(j) = Op(q/n2). (A.10) j=1 7n-1 j=1 Similarly, we have Sk2 k21(j) =-1 n--1 5 k2(j/q)A 0/n_-+ <^bo211 We now turn to Ij=l-n 1k2(j/q)A(j). By the CauchySchwarz inequality and noting t - t = a•2(?t - t) + (&2 - -2)et, we have 2u(-1 k2 J t=l j=1 x n-1l = Op(q/n2), I (A.11) j=1 t=j+l j=1 (j) = Op(q/n2). n sup la3(j)12 _n-l I Vbg(Xt, b) l<j<n-1 4 (A.7) = •Zn-1 t)2 t=l This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions (I - (• - )2 t=l 104 Journalof Business & EconomicStatistics,January1999 n -2)2 n-1 E + 2(-2 Proof of Lemma A.2. Let integer d = [q"], where [x] denotes the integer part of x. Then d/n d/q -- co. O0, 4 t=1 Write = Op(n-2) d given AssumptionsA.1-A.3. It follows that Cn((k) = Cni(k)+ E Sk2(j/q) A3(j) = Op(q/n2). k2(j/q)} n-1 (A.12) + j=1 n-1 n-1 - f(j))2 = Op(q/n2). k2(j/q)((J) ((1-j/n)k2(j/l) j=d+l Hence, from (A.5) and (A.10)-(A.12),we have E j=1 - (1 - j/n){k2(j/l) j=1 n-1 (A.13) S (1-j/n)k2(j/q) j=d+l = Cn (k) + C + 02n - 3n, say. (A.15) = OF(q/n), (j/q) 2(j) = - j=l k2(jq)f2(j) On the otherhand,becauseEn-1 Op(q/n), We show that the last three terms are op(ql/2). First, we we have considerC2n. Given jk(z)j < Alzl-r in AssumptionA.5 n-1 and O/q--+P0, we have - (j))f(j) = op(ql/2/n) (A.14) E k2(j/q)((J)( j=1 n-1 ? by the Cauchy-Schwarzinequality,(A.13), and q/n -+ 0. The lemma then follows from (A.4), (A.13)-(A.14), and j-27 IC2Tn[ A2q2r(/q)27 q/n -+ 0. This completes the proof. = j=d+l Op(q2r/d2T-1) = Op(ql/2), (A.16) Theorem2. Suppose AssumptionsA.1-A.5 hold. Let where the last equality follows from d = [q"]for v > (27 q be a data-dependentbandwidthsuch that q/q - 1 = - 1). Similarly, where *)/(27 op(q-((3/2)v-1)) q -- oo,q'q/n -- for some v > (27 )/(2,r- 1), 0. Then, under H0, Q(q) - Q(q) = op(l), and C3n = o(qi/2). (A.17) For the secondtermin (A.15), we decompose Q() __d N(0, 1). To show Theorem2, we first state two lemmas. d and Lemma A.2. Let Cn(k) = E=(1 j/n)k2(j/q) - j/n)(1 - (j + 1)/n)k4(j/^). Then D(k) = EI(1 and q-iDn(k) = q-lCn(k) = q-Cn(k) + op(q-1/2), + op(l). q-'1D(k) Lemma A.3. EZl {k2(j/q) - k2(j/q)2(j) = Op Cln < {k(j/) )- k(j/q)}2 j=1 d J{k(j/l) - k(j/q)}k(j/q)l. + 2 j=1 Given the Lipschitzconditionon k (in AssumptionA.5), of we can From the definition 2. Theorem = - = Q(q), Proofof q/q 1 op(q-((3/2)v-1)), and d [qv], we have write (q1/2/nr). d Q(q)-Q(q) = E {k(j/q)- k(j/q)}2 j=i -- Cn(k)} {Dn(k)}-1/2{Cn(k) + Q(q){(Dn(k)/Dn(k))1/2 + {Dni(k)}-1/2 n - d < A2-2(/q)2(t/q 1} (k2(j/O) - k2(i/q))2(J) -- 1)2 5j2 = Op(1). j=i }. Here, the first term vanishes in probability by Lemma A.2 and {D,(k)}-1/2 = Op(q-1/2), the latter follows from Lemma A.2, and q-iD,(k) -- fo k4(z) dz. Next, the second term vanishes in probability because D! (k) /D, (k) --+P 1 by Lemma A.2 and Q(q) = Op(l) by Theorem 1. Finally, the last term vanishes in probability by Lemma A.3 and = {Djn(k)}-1/2 = Op(q-1/2). Consequently, Q(O)- Q(q) so Q(j) --+dN(0, 1) given Q(q) _+dN(0, 1) by TheOp(l), orem 1. The proof is completed provided Lemmas A.2-A.3 are proven. This, together with Ed=l k2(j/q) = O(q) and the Cauchy- Schwarz inequality, implies Z {k(j/4)- j=1 k(j/q)}k(j/q)- op(ql/2). It follows that Cl, = op(ql/2). (A.18) Collecting (A.15)-(A.18) yields q-l(C'n(k) = q-Cn(k)+ (k) = qiD (k) + op(1) can op(q-1/2). The result q-1 be shown analogously. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Hongand Shehadeh:A New Test forARCHEffects 105 Proof of Lemma A.3. Recalling the definition of f(j) in have LemmaA.1, we can write 1/2 d n-1 (D11)1/2 JD12nj 1 E {k2(j/q) - k2(j/q)}P2(j) j=1 k2(j/q)f2(j) j=1 n-1 (A.25) op(q1/2/n). {k(2(j/) - k2(j/q)} 2(j) =-2(0) It follows from (A.23)-(A.25)that j=1 n-1 + Din = op(q1/2/n). {k2(j/l) - k2(j/q)} -2(0) E (A.26) j=1 x {12(j) _ (A.19) 2(j)}. For the firsttermof (A.19), we have Combining(A.20)-(A.22) and (A.26), we obtain that, for the firsttermof (A.19), n-1 n-1 S {k2(j/q) - k2(j/q)}f2(j) = op(q1/2/n). {k2(j/l) - k2(j/q)} 2(j) E j=1 n-1 d = I{k(2(j/4) - > k2(j/q)} 2(j) + k2(j/q)f2(j) j=d+l j=1 n-1 Next, we considerthe second term of (A.19). Because _ 2(j)} = op(ql/2/n) by Lemma j= k2(j/q){2( _ f2(j)} A.1, it suffices to show --•1k2(j/){ri2(j) op(ql/2/n). Put - E3k2(j/q)i2(j) n-1 j=d+l = Din + D2n - D3n, say. A2q27r(/q)27 Wn = (n/q1/2) E k2(j/q)lr2(j) (A.20) WefirstconsiderD2n. GivenIk(z)l< Alzj-', we have n-i D2n (A.27) j=1 - 2(j)[. j=1 We shall show WI,= op(l). For any given q > 0, 6 > 0, < P(Wtn> q) P(WTr,> rl,Iq/q- 11< 6) +P(li q 11> 6), -2()}j=dl j=d+1 (A.21) where the last probabilityconvergesto 0 as n -+ oc, given l/q - 1 = op(1). Thus, it remainsto show that the first givenq/q -+P1 andd = [q"]for v > (27--)/(27 - probabilityalso vanishes.Put (j) = Op(dl-2r/n) by Markov's 1), where E -jf-27 if I < zo = 1 and Ef2(j) inequality O(n-1). Similarly,we have k(z) = { Ajzj- if 1z, > zo, = op(ql/2/n), D3n = op(ql/2/n). (A.22) Next, we considerthe firsttermin (A.20). We write d Din {k(j/l) - k(j/q)}2f2(j) - d {k(j/) Then [k(z)l k(z), and k(z) is monotonicallydecreasing on IR+ = [0,00]; that is, k(zi) > k(z2) if 0 < z1 < Z2. Furthermore, we have fo k2(z) dz < 00, given 7 > 2. It follows that [k(j/q^)l k(j/^) <_ k{j/(1 - 6)q} if 2> (1- 6)q.Whence,the firstprobabilitywill vanishas n -+ o00 j=1 + 2 where 0 < zo < oc and A, 7 are given in Assumption A.5. - k(j/q)}k(j/q)f2(j) if j=1 n-1 = Dlln + 2D12n, say. (A.23) Given the Lipschitz condition on k (Assumption A.5), we have D11, < A2q-2q(q/c)2( /q - 1)2 j2r2(j) j=1 = A2 -2op(q-(3v-2))Op(d3/n) = Op(n-1), (n/(1 - 6)q1/2) x j=1 c2{j/(1 - 6)q}li2(j) _ •2(J)l = Op(1) for any given small 5 > 0, which follows by a reasoning analogous to the proof of Lemma A.1 and the fact that k q-1 Ej=I k2(j/q) 2(Z) dZ < 00 as q -+ o00. Therefore, we have Wn = op(1), or equivalently, n-1 (A.24) where c/q - 1 = op(q-((3/2)V-1)) and Zj]lJ2f2(J) Op(d3/n) by Markov's inequality. Next, by the CauchySchwarzinequalityand E k2(J/q) 2(j) = Op(q), we > k2(j/q)r2(j) - i2(j)[ = op(ql/2/n). (A.28) j=1 Combining (A.19), (A.27)-(A.28), and Lemma A.1 then yields the desired result. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Journalof Business & EconomicStatistics,January1999 106 Theorem3. Suppose AssumptionsA.1-A.3 hold. Let For the firstterm of (A.30), we have - QREG(q)= q -+ oo, q5/n -+ 0. ThenunderHo, QTRUN(q) q op(1) and 02 n-1 Bl, = sup (AQREG(q)-+d N(0,1). t=l Oeeq j=l Proof of Theorem3. Put Z = (Z1,I...I, )', Zt = n ( ^2 (1, 2/2 7 6_tq) _1,..., = (^2/^2 6 ( 12/ 1,...,O- 1• 2/n Rq = (R(1),.. .,R(q))', where R(j) n- Et+ = 1 - 1), and n 4) _ n11 t (-4_j_4) n t=j+1 - 3). Then by definition(e.g., see Engle 1982), an)(2_j 2 we have 0 ( n-1 sup OEeq j=1 q R2= U'Z(Z'Z)-1Z'U/(U'U). n 5 t=n-j+l By straightforward algebra,we haveR2 = (nR2(0))2RA'-1 Rq,,whereA is a q x q symmetricmatrixwith the (i, j) element 2 -n /2 + j)(n -- 3 n - Aij _ -2 2) (^2 • _- E t) S qn 1 t=max(i,j)+ 3 and 3 =n-1 E 2. BecausenE=l2 R22(j)R (0), it follows that <supj=11 .0 n-1t=n-j+l4 Eeq =1 ()=n q (&+2 Sup q nR2 -n 8EEqj=l P2(j) ( -1 5 - n t=j+l j=1 = nri-2(0)(R(O)RIA-lRq 1- < n- Rq) (A.29) = nR2[r-2(0)Oq(R(0)Iq - A)Oq, is a q x 1 unit vector in Eq = where , = A-1/2q/V {0 E R•:0'0-= n 1}. t=n-q+1 = sup ISnl n 2)6 ( n-1 OEeq i=1 j=1 t=1 q j-1 E ( 2 2 - T Gr < 5 GEeq q j=1 n -1 q j-1 n j=2 i=1 t=j+l -2)(-2-j (-i E i=1 q (n (A.31) t=j+1 E nz-1 n1 C-i-j O, t=max(i,j)+1 sup 12 t=n-q+l j=2 n -2 2 For the secondtermof (A.30), we have B2n < q -4 4 E = Op(q/n). Put Sn =supoeO'(i(O)Iq- A)O.We first show Sn = Op(q2/n1/2). For this, we write q n t +23n&n-• t 5 1 (?2&2)2 n j-1 --1 - 2_ - i=1 j=2 t=j+1 t=l q j-1 j=2 i=1 + n n- t=j+1 E (t-i - ct- ) - t=j+1 +2 sup Z Geeq j=2 ?•xi --)• Zn-1 i=1 3 = B1, + 2B2,, say. q j-1 n j=2 i=1 t=j+1 j=2 i=1 t=j+1 t=j+l - ), (A.30) = CI + C2• + C3n + C4n = Op(q2/nZ/2). This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions (A.32) Hong and Shehadeh:A New Test forARCHEffects 107 Here, REFERENCES q a-i 1/2 n -4 C1 Cln n- j=2 i=1\ 1/2 n -2 n t _ (gSt- t=l t 2) given n-r E t•l2 tn (A.33) 22 - (A.34) eroskedasticity in Time Series Models," Journal of TimeSeries Analysis, 13, 501-519. - (1993),"ARCHModels:Properties,EstimationandTesting,"Jour- nal of Economic Surveys, 7, 305-366. q < U.K.:CambridgeUniversityPress. the Bandwidthof a Beltrdio,K., andBloomfield,P. (1987), "Determining Bera, A. K., and Higgins,M. L. (1992), "A Test for ConditionalHet- We also have cn Bartlett, M. S. (1955), An Introductionto Stochastic Processes, Cambridge, Kernel Spectrum Estimate," Journal of Time Series Analysis, 8, 21-38. Op(n-1). Similarly, = Op(q2/n1/2). C2n The Annals of Statistics, 21, 830-847. andAutocorrelation ConAndrews,D. W. K. (1991),"Heteroskedasticity sistentCovarianceMatrixEstimation," Econometrica,59, 817-858. t=l = op(q2/n1/2) Anderson,T.W. (1993),"Goodnessof Fit Testsfor SpectralDistributions," 2 n-2 ?t - 7q Nonregular Cases in Econometrics," in Quantitative Economics: Theory and Practice,eds. S. Chakravorty, D. Coondoo,and R. Mookherjee, New Delhi:Allied Publishers,pp. 324-356. Bollerslev, T. (1986), "GeneralizedAutoregressiveConditionalHet- j=2 t=l 2l pan2 2 -1n < Bera,A. K., Ra, S., and Sarkar,N. (1998),"HypothesisTestingfor Some 21 eroskedasticity," Journal of Econometrics, 31, 307-327. g t Bollerslev,T., Chou,R. Y., andKroner,K. F. (1992),"ARCHModelingin Finance," Journal of Econometrics, 52, 5-59. t=+ q C tE2 n-1 -2 1-2 t= Bollerslev,T., Engle,R. F., andNelson,D. B. (1994),"ARCHModels,"in Handbookof Econometrics(vol. IV),eds. R. F. EngleandD. L. McFadden, Amsterdam,ElsevierScience,pp. 2959-3038. Likelihood Bollerslev,T., and Wooldridge,J. (1992), "Quasi-Maximum Estimationand Inferencein DynamicModelsWith Time VaryingCo- j=2 variances,"Econometric Reviews, 11, 143-172. 2 -1 n 2 E 1-2 an 2 Box, G. E. P., and Pierce, D. A. (1970), "Distributionof ResidualAutocorrelationsin Autoregressive-Integrated Moving AverageTime Se- 21 _a 0 ries Models," Journal of the American Statistical Association, 65, 1509- t=n-q+l t=l - 1526. Brock,W., Dechert,D., andScheinkman,J. (1987), "ATestfor Independence Based on the CorrelationDimension,"unpublishedmanuscript, Universityof Wisconsin,Dept.of Economics. Diebold,F. X. (1987), "Testingfor SerialCorrelationin the Presenceof n n (A.35) Opp(q2/n/2). ARCH," in Proceedings of the Business and Economic Statistics Section, American Statistical Association, pp. 323-328. Diebold,F. X., andPauly,P. (1989),"SmallSamplePropertiesof AsympFinally,by Markov'sinequalityand the fact that underHo ConditionalHeteroskedastoticallyEquivalentTestsfor Autoregressive ticity,"StatisticalPapers,30, 105-131. Domowitz,I., and Hakkio,C. S. (1987), "Testingfor SerialCorrelation and CommonFactorDynamicsin the Presenceof Heteroskedasticity," unpublishedmanuscript,NorthwesternUniversity,Dept.of Economics. Durbin,J. (1967), "Testsof Serial IndependenceBased on the Cumu- n 2 < E 2 n-1 pleest0,2) j i C tet-proo lated Periodogram,"Bulletin of the International Statistical Institute, 42, 1/2 = O(n-1/2), t= j+ we have C4n = Op(q2/n1/2). Collecting (A.32)-(A.36) yields B2n = Op(q2/n1/2). (A.36) This, togetherwith (A.30)-(A.31) and q5/n -. O, implies Sn = = Op(1). It follows from (A.29) that nR2 is of the same order of magnitude as n •_j=l f2(J); that Op(q2/n1/2) is, nR2 = Op(q). Therefore, from (A.29) and q5/n -• 0 again, we obtain nR2 n q=1j2(j) = Op(q3/n1/2) = op(ql/2). Consequently, QREG(q) - QTRUN(q) = Op(1) and QREG(Q) d N(0, 1) because QTRUN(q)•d N(0, 1) by Theorem 1 and the facts that C,(k) = q + o(q1/2) and 1040-1048. Durlauf,S. (1991),"SpectralBasedTestingof theMartingaleHypothesis," Journal of Econometrics, 50, 355-376. ConditionalHeteroskedasticity With Engle,R. F. (1982),"Autoregressive Estimatesof the Varianceof UnitedKingdomInflation," Econometrica, 50, 987-1007. "Estimatesof the Varianceof U.S. InflationBased on the .-(1983), ARCHModel,"Journalof Money,Creditand Banking,15, 286-301. (1984),"Wald,LikelihoodRatio,andLagrangeMultiplierTestsin in Handbookof Econometrics(vol. II), eds. Z. Griliches Econometrics," ElsevierScience,pp. 776-826. Amsterdam: andM. Intriligator, PropEngle,R. F., Hendry,D. F., andTrumble,D. (1985),"Small-Sample erties of ARCH Estimators and Tests," Canadian Journal ofEconomics, 18, 67-93. ForecastErrorVariances Engle,R. F., andKraft,D. (1983),"Multiperiod of InflationEstimatedFrom ARCHModels,"in AppliedTimeSeries Analysisof EconomicData, ed. A. Zellner,Washington,DC: U.S. Departmentof Commerce,Bureauof the Census,pp. 293-302. Engle, R. F., Lilien, D. M., and Robins,R. P. (1987), "EstimatingTime VaryingRisk Premiain the Term Structure:The ARCH-MModel," Econometrica, 55, 391-407. D,(k) = 2q(1 + o(1)) as q5/n --, O,q -, co. This completes the proof. Granger,C. W. J., and Teriisvirta,T. (1993), ModellingNonlinearEconomicRelationships,New York:OxfordUniversityPress. Testfor Autoregressive CondiGregory,A. W. (1989),"ANonparametric A Markov-Chain tionalHeteroskedasticity: Journalof BusiApproach," [Received January 1996. Revised May 1998.] Grenander,U., and Rosenblatt,M. (1953), "StatisticalSpectralAnalysis ness & Economic Statistics, 7, 107-115. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions Journalof Business & EconomicStatistics,January1999 108 S ArisingFromStationaryStochasticProcess,"TheAnnalsof Mathemat- McLeod,A. I., andLi, W. K. (1983),"DiagnosticCheckingARMATime ical Statistics, 24, 537-558. Series Models Using SquaredResidualAutocorrelations," Journalof - (1957), Statistical Analysis of Stationary Time Series, New York: Time Series Analysis, 4, 269-273. Chelsea. Milh0j,A. (1985),"TheMomentStructureof ARCHProcesses,"Scandinavian Journal of Statistics, 12, 281-292. Hong, Y (1996), "ConsistentTestingfor SerialCorrelationof Unknown Form," Econometrica, 64, 837-864. Newey,W., andWest,K. (1994),"AutomaticLag Selectionin Covariance ARCHTestingin TimeSeriesModels,"Journal (1997),"One-sided Matrix Estimation,"Review of Economic Studies, 61, 631-653. of Time Series Analysis, 18, 253-277. Parzen, E. (1957), "On ConsistentEstimates of the Spectrumof a Hong,Y., and Shehadeh,R. (1996), "ANew Test for ARCHEffectsand Stationary Time Series," The Annals of Mathematical Statistics, 28, its Finite SamplePerformance," CAE WorkingPaper96-01, Cornell 329-348. University,Dept.of Economics. M. B. (1981), Spectral Analysis and TimeSeries (vol. 1), London: Priestley, Lee, J. H. H. (1991), "ALagrangeMultiplierTestfor GARCHModels," AcademicPress. Economics Letters, 37, 265-271. Lee, J. H. H., and King,M. L. (1993), "A LocallyMost MeanPowerful Robinson,P. M. (1991a),"Testingfor StrongSerialCorrelationand Dyin MultipleRegression," Journal Based Score Test for ARCHand GARCHRegressionDisturbances," namicConditionalHeteroskedasticity Journal of Business & Economic Statistics, 11, 17-27. of Econometrics, 47, 67-84. (1991b), "AutomaticFrequencyDomainInferenceon SemiparaLjung,G. M., andBox, G. E. P. (1978),"Ona Measureof Lackof Fit in SmetricandNonparametric Models,"Econometrica,59, 1329-1364. Time SeriesModels,"Biometrika,65, 297-303. T. (1988),"TestingLinearity Taylor, S. (1984), Modelling Financial Time Series, New York: Wiley. Luukkonen, R., Saikkonen,P.,andTeriisvirta, in Univariate Time Series Models," Scandinavian Journal of Statistics, Weiss,A. (1984), "ARMAModelsWith ARCHErrors,"Journalof Time Series Analysis, 5, 129-143. 15, 161-175. This content downloaded from 128.84.125.184 on Fri, 22 Nov 2013 14:15:07 PM All use subject to JSTOR Terms and Conditions
© Copyright 2025