Accurate ARL Calculation for EWMA Control Charts Monitoring Simultaneously Normal Mean and Variance Sven Knoth Advanced Mask Technology Center, Dresden, Germany Abstract: EWMA control charts designed for monitoring the variance or the mean and the variance of a normally distributed variable are either based on the log transformation of the sample variance S 2 or provide only rough ARL results. Gan (1995), as the most prominent example for the simultaneous case, calculated ARL ¯ S 2 EWMA schemes. The results in Knoth and Schmid values precisely for X-ln 2 ¯ (2002) for X-S ones are less accurate than the former one. The reason behind the lacking precision is that the methods usually applied for ARL calculation are not able to handle the restricted support of the chart statistic (S 2 and, of course, S and the range R are non-negative random variables). While in Knoth (2005) this problem is treated for single variance monitoring by solving integral equations with collocation methods, this paper employs collocation and ideas similar to Gan ¯ 2 EWMA control charts. (1995) in order to obtain accurate ARL values of X-S Additionally, the appropriate choice of the non-symmetric control limits for the S 2 part of the scheme is addressed. Keywords: Average Run Length; Collocation; Computational methods; Control chart design; Exponentially Weighted Moving Average. Subject Classifications: 62L10; 60J22; 47N30; 65M70. Address correspondence to S. Knoth, AMTC Dresden, Postfach 110161, D-01330 Dresden, Germany; Fax: +49-351-4048 9158; E-mail: [email protected] 1 1. INTRODUCTION Control charts – core tools of Statistical Process Control (SPC) – are used to monitor certain parameters of one or more (random) variables over time. The main parameter of interest is the mean of the considered variable. Additionally, the performance of the variance plays a crucial role when applying a control chart. For instance, the setting of control limits is based on reliable, preliminary estimates of the variance. Thus, if the variance changes during monitoring, then the average time until an alarm, false or right, is affected. The most unpleasant case occurs when the mean changes and the variance decreases at the same time. Then, usually, more time is needed to detect the mean change than initially expected. Additionally, a changed (mostly increased) variance is of importance of its own. To sum up, simultaneous monitoring of mean and variance seems to be a quite reasonable task. Eventually, SPC software packages tend to favor most frequently X-MR (MR – ¯ ¯ ¯ control Moving Range), X-R, or X-S control charts over single X or X charts. In practice, many control charts are applied at the same time. In general, their joint behavior is not regarded; this is true even for charts which are designed for the same unit. For example, in the previously mentioned simple Shewhart control charts the control limits are seldom corrected to ensure a certain behavior of the simultaneous charting scheme. It is quite simple to obtain the same performance measures for the combined schemes as for the single ones, if we consider only Shewhart charts. If we are interested in applying more complex control charts such as EWMA (Exponentially Weighted Moving Average) or CUSUM (Cumulative Sum), then we have to face more complex problems. There are several ways to form EWMA control charts with the objective of monitoring mean and variance at the same time. Recall the so-called omnibus charts in Domangue and Patch (1991) that exploit only a single characteristic. Gan (1995) wrote a paper which plays an important rule in the area of simultaneous EWMA charts. He demonstrated that EWMA ¯ and ln S 2 , that is, the natural logarithm of the sample schemes based on X variance, perform quite well and dominate the preceding omnibus charts. Additionally, he equipped the SPC public with a numerical tool to determine the Average Run Length (ARL) of combined EWMA (and CUSUM as well) schemes. The ARL – the average number of subgroups or individual observations counted from monitoring startup, where it is assumed that the change in the mean and/or variance happens only at the beginning – is the most popular performance measure for control charts. Thus, there is high 2 interest in methods to compute this value in a quick and accurate manner. Unfortunately, Gan’s original approach works only for ln S 2 . The same could be observed in papers about ARL computation for single variance control charts (see Knoth (2005) for more details). In Knoth and Schmid ¯ 2 EWMA scheme was analyzed, but with less ac(2002) the combined X-S curacy than Gan (1995) provided. Apart from some practical reasons (ln S 2 looks more “normal” than S 2 , the scale change is transformed to a step change, and the variance of ln S 2 does not depend on the actual value σ 2 ) one major reason drives the usage of ln S 2 – the numerical difficulties with S 2 . These difficulties vanish after the log transformation. For ln S 2 , the transition kernel of the EWMA sequence, more or less the distribution of the chart statistic, is smooth, while the original sample variance leads to a kernel which exhibits a non-smooth point. The latter is problematic for the well-known Brook and Evans (1972) approach (Markov chain approximation) and for the so-called integral equation approach as well. See Reynolds and Stoumbos (2001a,b, 2004) for a discussion of these nasty problems. However, they do not provide any suitable calculation method – their results rely mainly on Monte Carlo studies. Refer to Knoth (2005) for a more comprehensive study of ARL calculation methods. Thereby, mean as well as variance EWMA control charts are considered. Hence, there are two obstacles in the way of calculation of the ARL of combined EWMA control charts for monitoring mean and variance based on S 2 (or S), (i) the difficulties that stem from the coupling of single charts which was suitably solved by Gan (1995), (ii) the subtle character of the S 2 related transition kernel which was treated in Knoth (2005). The objective of this paper is to combine ideas of both papers for getting a feasible calculation ¯ 2 EWMA control charts. method of the joint ARL of X-S ¯ 2 EWMA CONTROL CHARTS 2. X-S Let {Xij } be a sequence of subgroups (samples, batches) of independent and normally distributed data. Each subgroup i consists of n observations Xi1 , . . . , Xin . The subgroup size n can be equal to 1 if we utilize not the usual sample variance S 2 but a variance estimator that employs the actual mean (Reynolds and Stoumbos 2004, who preferred this version). In this paper, which focuses more on calculation methods than on applicational issues, this differentiation is not that important because of the equivalence (in the in-control case, otherwise the non-central χ2 -distribution prevents that) of both approaches in terms of the ARL calculation, if the degrees 3 of freedom in the variance part is re-adjusted. In the sequel we restrict ourselves to the usual sample variance S 2 . The parameters of the normal distribution are the mean µ and the variance σ 2 . Usually, we are interested in detecting changes in both directions of µ and increases of σ 2 . Additionally, we consider here schemes which provide signals in case of decreased variance so that we might be aware of missing signals in the mean chart. For the topic of misleading signals we refer to Morais and Pacheco (2006). To sum up, we consider joint schemes which combine two-sided mean control charts with one-sided upper and two-sided variance control charts, respectively. To begin with, let us collect some suitable notation. The parameter values, which describe the stable performance of the monitored process, are µ0 and σ02 . As long as the stable and the actual parameter values coincide we call the process “in-control”: it is otherwise “out-of-control” (unfortunately, we have to base our decision on data and not on exact knowledge of µ and σ). ¯ 2 EWMA control charts are based on the two EWMA sequences Then, X-S µ {Zi } and {Ziσ }, that is, n n X ¯i = 1 X Xij n Si2 z0µ Z0σ Ziσ j=1 Z0µ Ziµ = = (1 − µ λµ )Zi−1 1 X ¯i 2 , = Xij − X n−1 j=1 ¯i + λµ X = z0σ , σ = (1 − λσ )Zi−1 + λσ Si2 . The design parameters λµ and λσ are chosen from the interval (0, 1] to ensure a certain signaling behavior, where small values increase the detection power for small changes and vice versa. The starting values z0µ and z0σ are usually set to µ0 and σ02 , respectively. Then, as long the process is in control, the expectations of both EWMA sequences are equal to their target values. An alarm is given at time point L if at least one of the single charts signals, L = min{Lµ , Lσ } , ( s ) λµ σ0 √ Lµ = min i ∈ N : − µ0 | > := cµ , 2 − λµ n ( ) r r λ 2 σ Lσ = min i ∈ N : Ziσ − σ02 > c∗σ := cσ σ2 . 2 − λσ n − 1 0 |Ziµ c∗µ As usual, the alarm thresholds on the righthand sides of Lµ and Lσ are normalized with the asymptotic standard deviation of the chart statistics Ziµ 4 and Ziσ , respectively. The values cµ and cσ are called critical values. In the sequel, however, we utilize the starred versions. For the two-sided variance case we have to introduce a further limit because of the non-symmetric behavior of Si2 and consequently Ziσ (see Section 4). The previously mentioned ARL could be described as the expectation of L. To be precise, for fixed values of the so-called head starts z0µ and z0σ , and for given actual values of µ and σ, which are assumed to be valid for the whole averaging process, denote E(L) = E(L|µ, σ, z0µ , z0σ ) . Of course, as in the univariate case the Markov chain as well the integral equation approach could be established for the full 2-dimensional model. Both end in large dimensions of the resulting linear equation system if a reasonable precision is needed. See Knoth and Schmid (2002) for some results in that direction, where it had to be used for autocorrelated data. That approach becomes even more complex if one tries to employ the collocation ideas given in Knoth (2005). 3. ARL CALCULATION Ideas like in Gan (1995) that are based on Waldmann (1986) should be exploited in order to get a more feasible approach than the full 2-dimensional one. First, the parameters µ0 and σ02 are set to 0 and 1, respectively, for the sake of simplicity. Denote P (·) the probability measure of our series for a fixed parameter set of µ, σ, z0µ , z0σ . Then for any j ∈ N regarding the ¯ and S 2 independence of X P (L > j) = P (Lµ > j) P (Lσ > j) . Using the well-known relation E(L) = ∞ X P (L > j) j=0 and the geometric tail behavior of the distribution of L (see Waldmann (1986), Gan (1995), Knoth (1998), and Madsen and Conn (1973) – the essential assumption for the asymptotic geometric tail is some quasi-positiveness of the kernel of the joint scheme, which is mostly fulfilled for the considered control charts) allows us to establish the following ARL bounds: ARL± J = J−1 X j=0 P (L > j) + P (L > J) . 1 − m± J 5 The choice of the truncation point J relies on the distance between the bounds ARL± J and could be rather small for small distances already. The constants m± J are defined as in Waldmann (1986) and Gan (1995), that is, they stand for sup and inf of P (L > j; z0µ , z0σ )/P (L > j − 1; z0µ , z0σ ) over the permissible domains of starting values z0µ and z0σ . These values m± J converge rapidly to a common limit which resembles the eigenvalue of largest magnitude of the joint transition kernel. Waldmann (1986), and later Gan (1995), utilized quadrature rules in order to calculate numerically the values P (L > j) for j = 1, 2, . . . , J by means of the following iteration rule for the single control charts: ∗ P (Lµ > 1)(z0µ ) = Zcµ f (z0µ , z) dz , −c∗µ ∗ P (Lµ > j)(z0µ ) = Zcµ P (Lµ > j − 1)(z) fµ (z0µ , z) dz . −c∗µ Thereby, f (·) denotes the corresponding transition kernel, which is equal to z − (1 − λµ ) z0µ 1 µ fµ (z0 , z) = φµ,σ λµ λµ for the mean chart. φ(·) denotes the probability density function (pdf) of a normally distributed variable with mean µ and variance σ 2 . For the S 2 chart we have to use (and, of course, different limits in the integral) 1 2 n − 1 z − (1 − λσ ) z0σ n−1 σ fσ (z0 , z) = χ . 2 λσ σ λσ σ2 χ2 (·) denotes the pdf of a chi-squared distributed random variable with n − 1 degrees of freedom. Gan (1995) considered ln S 2 and obtained a different version of the variance chart transition kernel. Thus, he was able to approximate the integrals in both iteration sequences P (Lµ > j) and P (Lσ > j) with Gauss-Legendre quadratures with small absciccae numbers to ensure reasonable precision. Unfortunately, if we are interested in S 2 instead of ln S 2 , the same problem arises as for the single variance chart. In Knoth and Schmid (2002) both iteration sequences were approximated by terms, which come from the Markov chain approach. Instead of applying the truncation at a certain J the whole problem is transformed into a 6 Sylvester matrix equation system that is solved by employing a numerical library. Unfortunately, it inherits the precision losses from the S 2 part and, consequently, large matrix dimensions are needed to get higher accuracy. A small study should demonstrate the ability of different approaches to ¯ 2 EWMA control charts. Let deal with the subtle ARL calculation for X-S us consider: 1. Pure transfer of Gan (1995), who utilized Gauss-Legendre quadrature (GL) for calculating the iteration sequences P (Lµ > j) and P (Lσ > j), j = 1, 2, . . . 2. Apply the Simpson (S) rule instead of GL for the S 2 part. 3. Apply the midpoint rule instead of GL, which is equivalent to the popular Markov chain approximation (M 1) – Morais and Pacheco ¯ S 2 EWMA charts (for both the mean and (2000) utilized this for X-ln the variance chart). 4. Collocation (C) similar to Knoth (2005), described in more detail in the Appendix of this paper. 5. Like in Knoth and Schmid (2002), employ Markov chain approximation (M 2) for both charts and solve a Sylvester matrix equation. Note that with one exception (Sylvester matrix equation) all approaches ¯ part. The subgroup size use always Gauss-Legendre quadratures for the X n is equal to 5 like in Gan (1995). The smoothing parameters are taken as λµ = 0.134 from Gan (1995) and λσ = 0.1 that is more a first shot than a result of some optimization. However, it helps to build a control chart which beats Gan’s best combination as will be demonstrated in Section 4. Then c∗µ = 0.345 477 and c∗σ = 0.477 977 are used, which yields an incontrol ARL of 252.3 of the combined scheme and equal ARLs of the single charts. A Monte Carlo study with 109 replicates validates the above joint ARL value by resulting in 252.307 with standard error 0.0078. In Figure 1 the approximated ARL versus matrix dimension N (each method relies on matrices) are plotted. Note that the truncation point J that is needed for approaches 1 to 4 is always chosen in the same way. The distance between the bounds mentioned in the beginning of the section, ARL± J , should be smaller than 10−8 . The matrix dimension N is only related to the quality of the survival function approximation. 7 Figure 1. Precision of different ARL computation methods for the in¯ 2 EWMA control chart monitoring the mean (2-sided) control ARL of a X-S and variance (upper) of normally distributed data, the true value is 252.3 and the smoothing values are λµ = 0.134 and λσ = 0.1; S – Simpson rule, M 1 – Markov chain (midpoint rule), M 2 – Markov chain (Sylvester matrix equation), C – Collocation. The Gauss-Legendre quadrature based results are not drawn in Figure 1, because they oscillate more than all the others (see Table 1 for some numerical results). At first sight of Figure 1, the collocation method exhibits the best performance. For dimension N = 23, collocation already ensures the first 6 digits after the decimal point. No other method in the study attains this accuracy with N below 300. Even if we regard the higher complexity of the collocation approach, which leads to the highest computing times for fixed N among the candidates, collocation is the clear favorite. See Table 1 for the corresponding computing times. The second and third best ones are the two Markov chain approaches, followed by Simpson and Gauss-Legendre quadrature based ones. Table 1. Resulting ARL approximations and CPU times (measured in hundredth seconds on an Athlon XP 1.4 GHz) for selected matrix dimensions N and all methods under consideration; the true value is 252.2999. Method Gauss-Leg. Simpson M. chain 1 M. chain 2 Collocation 13 25 60.8717 1 5.2588 1 232.4026 <1 211.3234 <1 253.3603 2 90.9297 1 -1 1 245.2234 <1 239.9381 1 252.2999 8 matrix dimension N 51 101 143.4348 2 297.8329 2 250.5108 2 249.3160 5 252.2999 35 253.1860 5 249.0502 5 252.2235 5 251.9535 32 252.2999 140 201 301 252.1741 30 252.5606 30 252.3001 31 252.2515 286 252.2999 583 252.6392 65 252.3129 65 252.2729 68 252.2598 1221 252.2999 1323 Thus, if we are interested in getting a quick and nearly accurate result, 8 then we would take the Markov chain approximation M 1. If we prefer high accuracy, then collocation is the first choice. The numbers in Table 2 demonstrate how the collocation approach works for smaller λσ down to 0.001 and less subgroup sizes n. The parameter λµ remains unchanged because it is numerically less subtle. Table 2 gives the matrix dimensions Nσ (only the resolution of the S 2 related approximation, just Nσ , is increased; Nµ is fixed at 51) and the resulting final thresholds, that is, c∗µ and c∗σ . Fortunately, the matrix dimension Nσ does not increase Table 2. Resolution (matrix dimension) Nσ needed to ensure 6 digits accuracy for the in-control ARL (252.3). Additionally, both (finalized) critical values c∗µ and c∗σ (second and third entry) are given. n 5 4 3 2 0.1 0.05 λσ 0.01 23 0.345477 0.477977 23 0.386268 0.563641 17 0.446048 0.713918 23 0.546349 1.081116 31 0.345381 0.284547 31 0.386166 0.332911 21 0.445939 0.416409 31 0.546243 0.614829 60 0.345962 0.075633 62 0.386822 0.087563 37 0.446709 0.107662 61 0.547224 0.153289 0.005 0.001 83 0.347443 0.038398 85 0.388478 0.044359 50 0.448622 0.054313 83 0.549569 0.076655 167 0.356187 0.006213 174 0.398254 0.007149 107 0.459910 0.008701 176 0.563392 0.012100 too fast. Note that small λ’s such as 0.001 are not reasonable at all. Knoth (2006) provides a small study for one-sided S 2 EWMA charts that illustrates it. While the ARL for detecting a given scale change decreases, the in-control probability of an early alarm (right at the first observation) increases heavily. In Section 6 the collocation approach is described in more detail. In the next Section we employ this ARL calculation method to compare Gan’s (ln S 2 based) ARL results with the ARL values of the competing S 2 based simultaneous EWMA chart. 9 ¯ S 2 AND X-S ¯ 2 EWMA CHARTS 4. COMPARE X-ln Gan (1995) compared omnibus charts described in Domangue and Patch (1991) with two joint EWMA charts and one CUSUM combination in terms of their ARL. Both EWMA and CUSUM are built with ln S 2 . The EWMA charts work considerably well. Here, we want to add for both EWMA schemes the corresponding performance of the S 2 based charts. As mentioned in the previous Section, we take the same λµ = 0.134 like Gan, but set λσ = 0.1, which is sufficiently powerful – as we will see soon. The critical limits c∗µ and c∗σ are chosen to provide an in-control ARL 252.3 for the ¯ chart looks similar: combined scheme. Recall that Gan’s limit for the X c∗µ,Gan = 0.345 vs. c∗µ = 0.345477. These limits ensure also equal in-control ARLs for the single charts, namely 499.3. Note that our comparison is not really fair, because Gan’s EWMA sequence exhibits a lower reflection barσ σ + ln Si2 }. Then, Gan calculated = max{0, (1 − λσ )Zi−1,Gan rier, i. e. Zi,Gan σ the ARL for z0,Gan = 0 that corresponds to the worst case for detecting an increase in σ 2 (the smoothing value is λσ,Gan = 0.043). For the S 2 based chart, we calculate the ARL as usual for EWMA charts, that is, for z0 = σ02 = 1, which is not the lowest position of Ziσ . It is difficult to compare these charts in an appropriate manner (we have to switch to the steady-state ARL or we should introduce a reflection for the S 2 chart like in the comparison in Knoth 2005). In Table 3, however, we give ARL values of both (upper) EWMA charts. ¯ S 2 (EEU ), and of Table 3. Subset of ARLs from Gan (1995), that is, X-ln 2 ¯ X-S EWMA charts calculated by collocation. ∆ 0 0.20 0.40 1.00 σ/σ0 1.00 1.05 1.25 1.00 1.05 1.25 1.00 1.05 1.25 1.00 1.05 1.25 ¯ S 2 252.3 113.0 16.6 129.7 78.3 15.9 48.8 39.4 14.2 10.2 10.0 8.3 X-ln ¯ 2 252.3 101.2 13.9 129.3 72.6 13.4 48.6 38.1 12.3 10.1 9.9 7.7 X-S √ Note that the shift ∆ is measured in σ0 / n units (subgroup size n) like in Gan (1995). We see from Table 3 that the S 2 based version works slightly better. Remember that the comparison suffers from the same dilemma as many EWMA-CUSUM comparisons. We consider now a comparison for the double two-sided schemes. It would remain an unfair comparison if CUSUM would be one competitor. ¯ S 2 EWMA chart and our X-S ¯ 2 EWMA chart, Gan’s double two-sided X-ln 10 however, act similarly. According to Gan, the critical limits of his twosided ln S 2 were chosen to minimize the ARL for detecting an increase to σ = 1.25σ0 (now λσ,Gan = 0.106). His resulting critical limits are c∗µ,Gan = 0.345, c∗σ,lower,Gan = −0.867 and c∗σ,upper,Gan = 0.215. Here, we want to ¯ 2 chart, and with so-called compare this optimal chart with a similar X-S ARL-unbiased versions of both. The analysis of the former is based again on collocation methods and provide c∗µ = 0.345313, c∗σ,lower = −0.383153 and c∗σ,upper = 0.53 (the upper variance limit was fixed, the lower one and the mean limit are chosen to give the joint ARL 252.3 and equal ARL values of the mean and the variance chart). The term ARL-unbiased was defined in Pignatiello et al. (1995) and broadly applied for variance control charts in Acosta-Mej´ıa and Pignatiello (2000). Roughly speaking, the max of the ARL for fixed µ = µ0 = 0 and for varying σ should be attained at σ = σ0 = 1. While it is trivial for symmetric distributions (symmetric lower and upper control limits ensure that), it is more demanding to choose the usually non-symmetric limits to provide the max at the right position. Here, we try to solve the following task: 1. Joint ARL is again 252.3. 2. ARLs of the single charts are equal. 3. The slope (of the ARL function) in σ direction at σ = σ0 = 1 is 0. This is done numerically based on Gan’s methods and on collocation. The re¯ S 2 EWMA scheme are c∗ sulting critical limits of Gan’s X-ln µ,Gan = 0.345382, ∗ ∗ cσ,lower,Gan = −0.824585, and cσ,upper,Gan = 0.262703. The respective limits ¯ 2 EWMA chart are c∗ = 0.345271, c∗ of the X-S µ σ,lower = −0.363404, and ∗ cσ,upper = 0.582758. The variance limits are slightly increased compared to the previous designs. Thus, both schemes now detect decreases in σ faster, while the increase detection ability became worse. In Figure 2 we see the √ results for µ = µ0 + ∆/ n , ∆ ∈ {0, 0.2, 0.4, 1}. For the given domain of possible σ values, the S 2 based scheme is better than the ln S 2 one. Only for σ values smaller than 0.7 or larger than 1.3, and for µ = 1 and σ < 1 the ln S 2 based version provides a better ARL performance. The latter demonstrates the advantage of the more symmetric behavior of the ln S 2 version. The S 2 version, however, becomes faster and faster, when the magnitude of the mean shift and the variance increase in the same time. 11 5. CONCLUSIONS The main result of the current paper is that we are able to get accurate ARL ¯ 2 EWMA control charts. To do this, one should use colloresults for X-S cation for the S 2 part. Standard methods like the Markov chain approach provide quick and nearly accurate results, but do not attain high accuracy. Moreover, as already seen in Knoth (2005) and several papers mentioned therein S 2 based schemes perform better than ln S 2 ones in terms of their ARL. However, the current paper does not provide a complete study. A further paper might fill this gap. For truly two-sided variance EWMA charts one has to look for so-called ARL-unbiased designs. With recent computer technology (PCs, not clusters or bigger boxes, which are even faster), this is done with CPU times of some seconds only. To get rid of large delays in detecting mean changes in the presence of simultaneous variance decreases, it is enough to take a slightly non-symmetric design (hence, the scheme becomes “ARL-biased”) as Gan (1995) already did. 6. APPENDIX Here, some more details of the collocation method as a plugin for calculating the iterations of P (Lσ > j) follow. Note that for the whole calculation process we derive equations for an approximation of the run length survival function and not for the true function itself, that is, all terms like P (L > j) refer now to the numerical approximation. ¯ chart the Gauss-Legendre quadrature is employed, Recall that for the X which is here the most powerful one. Thus, take N Gauss-Legendre roots zs and weights ws on the integration interval [−c∗µ , c∗µ ], so that P (Lµ > j)(zr ) = N X ws P (Lµ > j − 1)(zs ) fµ (zr , zs ) , r = 1, 2, . . . , N s=1 provides us with a numerical recipe for the iteration sequence P (Lµ > j). Please do not confuse z0µ (or z0σ ) from Section 3, where the 0 stands for in control, with zs . Here, s points to sth quadrature node. In Section 3 the same was applied for P (Lσ > j) besides the Simpson and midpoint rules as competing quadrature methods. Now, the collocation method should be presented. Collocation means that the solution is built as linear combination of certain base functions 12 {Ts (·)}. Thus, set for each j P (Lσ > j)(z) = N X gjs Ts (z) s=1 via N X gjs Ts (zr ) = s=1 N X ∗ 1+c Z σ gj−1,s s=1 Ts (z)fσ (zr , z) dz , r = 1, 2, . . . , N 0 on a suitably chosen grid z1 < z2 < . . . < zN and an easily derived starting set {g1s } we obtain the wanted iteration sequence for approximating the run length survival function. We calculate the new values {gjs } step by step while solving the above equation systems. If we take step functions as base, then we arrive again at the Markov chain approach. Usual bases are monomials like 1, z, z 2 , . . . , z N −1 , Lagrange or Chebyshev polynomials (some adaption for usage on [0, 1 + c∗σ ] is needed) Ts (z) = cos s arccos(z) , s = 0, 1, . . . , N − 1 , z ∈ [−1, 1] , which ensure more stable numerical quadratures than monomials or Lagrange polynomials. It could be worthwile to compare monomials and Chebyshev polynomials in terms of the resulting ARL computing time. For monomials it could be easier to establish faster iterations. ¯ and for S 2 we are able to approximate the survival function Now, for X µ/σ P (Lµ/σ > j) for arbitrary z and not only at the nodes zr . Recall that the grids used for the single charts differ from each other. Thus, zs stands for the respective grid. In terms of vector arithmetics the methods look like: P~j = MP~j−1 , µ 0 ) , P~j = P (Lµ > j)(z1µ ), . . . , P (Lµ > j)(zN M = Mrs , Mrs = ws fµ (zr , zs ) , for the simple mean part and N ~gj = O~gj−1 , ~gj = gj1 , . . . , gjN , N = Nrs , Nrs = Ts (zrσ ) , ∗ 1+c Z σ O = Ors , Ors = Ts (z σ )fσ (zrσ , z σ ) dz σ 0 13 for the more subtle S 2 chart. Here, we recognize the source of the higher computing times for fixed N compared to the competing methods. Fortunately, the matrices N and O do not depend on j so that the solution of the linear equation system for each j could be accelerated. The entries of O are calculated numerically, where only small numbers of quadrature points are needed (do not confuse these quadratures with the upper ones) for high accuracy. Note that for this paper Chebyshev polynomials are used (just in case when for very small λ large values of N could be necessary). For the monomials base perhaps, faster solution is possible. For the two-sided variance chart the collocation approach becomes even more difficult to describe. Instead of bases which contain smooth, nonconstant functions over the whole interval [1 + c∗σ,lower , 1 + c∗σ,upper ], we use now piecewise defined functions, which vanish outside a small supporting interval. For details about this we refer to Knoth (2005). The reason behind this detour is that the piecewise approach is considerably more precise than the full approach and does not need more computing time for the same N . Eventually, the numerical approach for the ARL-unbiased design will be sketched. As mentioned in Section 4, three conditions have to be fulfilled and three critical values are unknown. The iteration starts with the upper scheme that yields the target joint in-control ARL and equal ARLs of the single charts. Then, a slightly increased upper variance limit is fixed. The remaining limits are calculated to ensure the previous mentioned ARL design. Finally, by means of the secant rule this upper variance limit is changed as long as (Lµ0 ,σ0 −ε − Lµ0 ,σ0 +ε )/(2ε) (this secant should mimic the slope at (µ0 , σ0 ) in σ-direction) is larger than 10−6 . Thereby, ε is set to 10−4 . After updating the upper variance limit, the lower one and the mean limit are again derived to provide the target joint ARL and equal ARL values of the separate charts. Figure 2 demonstrates that the iteration provides reasonable results. References Acosta-Mej´ıa, C. A. and Pignatiello, J. J., Jr. (2000). Monitoring Process Dispersion Dithout Subgrouping, Journal of Quality Technology 32: 89– 102. Brook, D. and Evans, D. A. (1972). An Approach to the Probability Distribution of CUSUM Run Length, Biometrika 59: 539–549. 14 Domangue, R. and Patch, S. (1991). Some Omnibus Exponentially Weighted Moving Average Statistical Process Monitoring Schemes, Technometrics 33: 299–313. Gan, F. F. (1995). Joint Monitoring of Process Mean and Variance Using Exponentially Weighted Moving Average Control Charts, Technometrics 37: 446–453. Knoth, S. (1998). Exact Average Run Lengths of CUSUM Schemes for Erlang Distributions, Sequential Analysis 17: 173–184. Knoth, S. (2005). Accurate ARL Computation for EWMA-S 2 Control Charts, Statistics and Computing 15: 341–352. Knoth, S. (2006). The Art of Evaluating Monitoring Schemes – How to Measure the Performance of Control Charts?, in Frontiers in Statistical Quality Control 8, H.-J. Lenz and P.-T. Wilrich, eds., pp. 74-99, Heidelberg: Springer. Knoth, S. and Schmid, W. (2002). Monitoring the Mean and the Variance of a Stationary Process, Statistica Neerlandica 56: 77–100. Madsen, R. W. and Conn, P. S. (1973). Ergodic Behavior for Nonnegative Kernels, Annals of Probability 1: 995–1013. Morais, M. C. and Pacheco, A. (2000). On the Performance of Combined EWMA Schemes for µ and σ: A Markovian Approach, Communications in Statistics - Simulation & Computation 29: 153–174. Morais, M. C. and Pacheco, A. (2006). Misleading Signals in Joint Schemes for µ and σ, in Frontiers in Statistical Quality Control 8, H.-J. Lenz and P.-T. Wilrich, eds., pp. 100-123, Heidelberg: Springer. Pignatiello, J. J., Jr., Acosta-Mej´ıa, C. A., and Rao, B. V. (1995). The Performance of Control Charts for Monitoring Process Dispersion, 4th Industrial Engineering Research Conference, pp. 320–328. Reynolds, M. R., Jr. and Stoumbos, Z. G. (2001a). Individuals Control Schemes for Monitoring the Mean and Variance of Processes Subject to Drifts, Stochastic Analysis and Applications 19: 863–892. Reynolds, M. R., Jr. and Stoumbos, Z. G. (2001b). Monitoring the Process Mean and Variance Using Individual Observations and Variable Sampling Intervals, Journal of Quality Technology 33: 181–205. 15 Reynolds, M. R., Jr. and Stoumbos, Z. G. (2004). Control Charts and the Efficient Allocation of Sampling Resources, Technometrics 46: 200–214. Waldmann, K.-H. (1986). Bounds for the Distribution of the Run Length of Geometric Moving Average Charts, Journal of Royal Statistical Society, Series C 35: 151–158. 16 Figure 2. ARL function of double two-sided EWMA charts (Gan’s EE ¯ 2 -EWMA), mean is fixed at µ = µ0 + ∆/√n , ∆ ∈ {0, 0.2, 0.4, 1}, and X-S in-control ARL 252.3. 17
© Copyright 2024