Sample Sizes for Attributes Inspection When Strata are Heterogeneous John L. Jaech and Marian Donoho International Atomic Energy Agency Vienna, Austria ABSTRACT that at least one of the M/yx defects that are hypothesized to exist among the N population (or stratum) items will be contained in the sample of size n. This statement implies that IAEA attributes inspection is generally based on zero-acceptance number sampling plans. Such plans require a minimum amount of inspection which is an important consideration when inspection resources are limited. The formula that is in common usage at the International Atomic Energy Agency (IAEA) when calculating sample sizes for inspection by attributes is based on the assumption that the stratum in question is homogeneous with respect to element/isotope weight, i.e., all items in the stratum have the same weight. In practice, this assumption is generally invalid to some extent. This paper examines the effect of heterogeneity in item weights on the probability of detection, and indicates how sample sizes might be altered to account for the heterogeneity. After considering the simplest type of heterogeneity that occurs when the item weights cluster tightly about a small number of modal points, a more general pattern of heterogeneity is examined. Modifications to the sample sizes to account for this general pattern are derived based on a specific diversion strategy. A number of numerical results are presented in the form of plots relating sample size to mean item weight and to the scatter in item weights. An important feature of the sample size formula is its simplicity. The IAEA inspectors quickly commit the formula to memory through repeated usage. The sample size calculation itself is very simple with the aid of a pocket calculator. Notwithstanding the widespread application of formula (1), it is not without potential shortcomings. One question that has reoccurred on frequent occasions is the impact of heterogeneity in item weights on the validity of the formula. Formula (1) is based on the assumption that all item weights are equal to x, although it is realized in practice that this is never exactly true. One purpose of stratification is to limit the scatter in the item weights within each stratum so that formula (1) yields the sample size that will guarantee a detection probability of 1-|3. INTRODUCTION In calculating inspection sample sizes for the detection of gross/partial defects by attributes inspection, the formula in common usage at the IAEA is of the form where y = 1 for gross defects and is some value less than one for partial defects. In (1), (3 is the nondetection probability, x is the mean item weight (element or isotope), M is the goal amount, N the number of items in the stratum, and n the sample size. In practice, limitations on the number of strata that can be reasonably accommodated result in strata that may well be quite heterogeneous with respect to item weights. This introduces concern about the validity of applying the sample size formula (1), and about how sample sizes should be calculated to provide the desired detection probability if the degree of heterogeneity is excessive. Of course, a basic question to consider is just how much heterogeneity can be tolerated without concern, i.e., how much is "excessive". Formula (1) has been applied virtually since the beginning of IAEA inspections. It is based on an approximation to the hypergeometric probability density function and is the sample size needed to assure with probability (1 - |3) In addressing the question, a particular type of heterogeneity is examined first with respect to its effect on the sample size detection probability question. The type of heterogeneity in question is that in which n/N = (1 - P (1) 173 respect to permissible variation in item weights within a stratum on the one hand, and problems of implementation and data evaluation that are associated with moderately large heterogeneity in item weights on the other hand. In practice, item weights within a stratum may vary by a considerable amount. A particular pattern of heterogeneity is considered here. individual item weights may cluster tightly around a small number of mean weights. This would occur, e.g., when two or more homogeneous strata that differ only in mean item weights are combined into a single stratum in order to reduce the number of strata. The obvious solution to the sample size problem in this instance is to compute the total sample size by summing the sample sizes calculated separately for each "sub-stratum". Under appropriate assumptions, this is equivalent to simply calculating the sample size for the combined stratum. However, the detection probability is not necessarily > (1-0), as will be illustrated. Characterize this specific pattern as one in which item weights are clustered around m modal points. In the limiting case, Nj items have item weight xj; i = 1, 2, ..., m. In essence, the stratum consists of m sub-strata. Without loss of generality, set y = 1 in formula (1), in which case the sample size in substratum i would be The major focus of the paper is to consider a general pattern of heterogeneity in item weights in which the heterogeneity is characterized by the standard deviation in stratum item weights. No further distributional assumptions are made. A conservative approach is to insert some value other than x in formula (1), where this value exceeds x by some amount, the amount depending on the value of the standard deviation. This conservative approach assumes inherently that the adversary will choose to falsify the items with the largest weights, thus minimizing the number of items that need be falsified. The increased sample size may be quite excessive, but the desired detection probability of (1-P) should be virtually guaranteed. nj and £ nj is the total sample size. For small ij/M, say x^/M < 0.1 as is often the case, the following approximation holds: (1 - P>*i/M) * - (Xi/MHnp (3) so that Another approach is to assume a random strategy for the adversary, one in which he selects items to falsify at random from the stratum, stopping when the sum of the selected item weights equals or exceeds the goal amount. One benefit of this approach from the adversary's viewpoint is that the particular items that are defected are completely unpredictable to the inspector, and he must therefore tailor his sampling plan to cover all possibilities. It is also noted that this model is certainly a reasonable one in a non- adversarial situation in which the defects are the results of mistakes in the records as opposed to data falsifications. In this situation, it is entirely reasonable to assume that the defected items are selected at random. (4) Now, the stratum average is m (5) = I and it is noted that if x In the next section, the pattern of heterogeneity in which individual item weights cluster about a small number of mean weights is considered. In the section to follow, the effect of a general pattern of heterogeneity is examined. and N = are used in formula (1), the sample size is - pX/M> CMJSTKK HETEKOGKNKITY Whenever strata are created within some component of a material balance, decisions must be made regarding the definition of specific criteria for stratification. The IAKA has developed guidelines for stratification in attempting to strike a compromise between the excessively large number of strata that would result in the event of rigid criteria with - ififi I NjXi (6) which is identical to (4). Thus, the sample size for the stratum is the same as the total sample size for m pseudo- substrata. A numerical example may enhance 174 understanding of this result. Table 1 gives the frequency distribution for grams pluLonium in containers of mixed oxide powders. The frequencies are given in terms of Nj/N. for P = 0.05, the sample size calculations are also indicated in the table. The goal amount, M = 8000 g Pu. T a b l e PATTKKN OK In Table 1, a l l N^ itenis in substratum i are assumed to have the same item weight. . In generalizing the heterogeneity pattern, this assumption is no longer made. Rather, the heterogeneity is characterized by the standard deviation in stratum item weights. No other distributional assumptions are made, i.e., the density function for item weights is not specified. 1 Frequency Distribution of Grains Plutonium in Containers of MOX Powder As previously indicated, for this general model a random strategy is assumed for the adversary. Items to falsify (defects) are selected at random from the stratum, this process continuing until the goal amount is achieved. Nj/N 700-800 600- 700 500-600 400-500 300-400 200-300 100- 200 0-100 Pu. 750 650 550 450 350 250 150 50 0.28 0.30 0.10 0.02 0.02 0.20 0.06 0.02 1.00 0.2486 0.2160 0..1861 0..1551 0..1228 0.0893 0.0546 0.0185 0..0696 0..0648 0.0186 0.0031 0..0024 0..0179 0..0033 0.0004 0.1801 Under this assumption the number of defects among the N items is a random variable, denoted by Z. The distribution of 2 is difficult to determine, but it may be inferred by proceeding as follows. For a given value of Z, denoted by zj, the probability that the sum of the item weights exceeds M may be calculated assuming, (by appealing to the Central Limit Theorem) that this sum is normally distributed. Let this probability be denoted by fz{- For the next integer, the corresponding probability is Pzi_l, and hence, the probability that Z = Zi is qzi = pzi - Pzi.iThen, for given zj, the nondetection probability is approximately The stratum average item weight is 536 g From (1), n/N = (1 - pS36/8°°°) = 0.1819 as compared with n/N = 0.1801 pseudo-substrata. NKTK.WC l-'NK IT Y when forming the (1 - n/N) l = a * It is noted that although the total sample size is essentially the same whether or not the substrata are actually formed, the item weights for the items sampled at random from the entire stratum will not have the same frequency distribution as indicated in the last column of Table 1 except on an expectation basis. For example, the expected number of items sampled that have item weights between 700-800 g Pu is 0.0696N; the actual number sampled may be quite different. For example, if N = 100, on the average 7 items would be sampled from this substratum. The actual number sampled in a given realization of the sample could easily range between 3 and 12. The overall nondetection probability is then z v I qzi a (8) and a is found by trial and error such that the indicated sum is <_ P. This process is illustrated. data, For the Table 1 x = 536 g Pu The conclusion with respect to combining the substrata into a single stratum and selecting the n items at random from this single stratum may be stated from another equivalent perceptive. s = 220 g Pu and also, Conclusion: (7) If sampling is performed within each substratum by repeated application of formula (1), the overall nondetection probability probability is P independent of how the defects may be distributed. If sampling is performed within the overall stratum, then the expected nondetection probability is P; the actual probability may be cither greater than or less than p. M = 8000 g Pu. (It is noted that x and s, being population values rather than sample values should be replaced by M and a respectively. The results are, of course, independent of the notation, and since the accepted notation at the IAEA for a stratum average weight is x, the notation is preserved in this discussion. for 175 = 13, say. p l3 = Prob( I mean item weight, the sample size given by application of equation (1) is actually slightly larger than needed to achieve the desired value of P. Only when the standard deviation exceeds roughly 80 % of the mean need additional items be inspected. Xi > M) = Prob(t /13s = Prob(t > 1.301). where t is N<0,1). From a table of the standardized normal distribution function, P13 = 0.097. For z A simple example will serve to explain this phenomenon. Consider the case where M/x = 4 and c = 0. In this case, all items in the stratum weigh M/4 units (e.g., g Pu) and no matter which items are included in the random sample, exactly 4 would have to be falsified to accumulate to the goal amount, H. Now suppose that c is just slightly larger than 0, i.e., s it 0 but is small relative to x. Then the required number of items needed to accumulate to M units is either 4 or 5, and each value of Z is equally likely. Thus, with probability 0.5, Z = 4 and the required sample size is the same as that required for s = c = 0. But also with probability 0.5, Z = 5, and the required sample is smaller than that needed for s = c = 0 , there being more defects in the population. Thus, on an expectation basis, the required sample size is actually smaller for c > 0 by some amount that it is for c = 0. At some point, however, at about c = s/x = 0.8, the required sample size is greater than for c = 0 because the standard deviation of Z becomes so large relative to its mean. It is noted that c ^ 0.8 corresponds to a stratum with a considerable degree of heterogeneity, and very few strata have coefficients of variation that exceed 0.8. =14, i P14 = Prob(t > --JUX) = Prob(t > 0.603) = 0.273 vO4~ s Since 0.097 is the probability that the goal amount would be exceeded by the sum of the weights of 13 randomly selected items, and 0.273 is the corresponding probability for 14 items, the probability that z = 14 is Pl4 0.176 In this way the probability distribution of the random variable Z is inferred. Some small scale simulations were performed with good results to verify the validity of this approach to finding the distribution of Z. Now, for a given value of Z - zj, detection probability is approximately SUMMARY AND CONCLUSIONS non- the A computer program was written to permit calculation of a large number of results. The results are displayed in graphical form, Figures (l)-(4). In the figures, "SQ" refers to a significant quantity, the usual value for M in the previous discussion. The parameter c is the ratio s/x, i.e., is the coefficient of variation. The parameter re is the ratio of the sample size for given c to that given by formula (1), i.e., at c = 0. Two approaches were used to study the effect of heterogeneity in item weights on required inspection sample sizes for attributes inspection. In the event that item weights are clustered tightly around a number of modal points, the overall sample size found by forming pseudo-substrata as defined by the modal points is essentially the same as that found for the overall stratum when the stratum average item weight is used in the sample size formula (1). When sample items are selected randomly from the entire stratum, the detection probability is (1 - p) on an expectation basis, although in a given realization of the sample, the detection probability is a function of the diversion strategy. If the sample items were to be drawn at random from each of the pseudo-substrata by applying formula (1) to each substratum m, then the detection probability is essentially (1 - P) independent of the diversion strategy. Under the assumption that it is appropriate to calculate the detection probability on an expectation basis, it is valid to apply formula (1) to the entire stratum in spite of the degreee of heterogeneity when the item weights exhibit the "cluster" pattern of heterogeneity. Rather surprisingly, perhaps, the analysis shows that for values of the standard deviation of item weights that are as large as 80 % of the For a more general pattern of heterogeneity characterized by the coefficient of variation, c = s/x, and for a diversion strategy in which - n/N)Zi (9) Since Z = zj with probability qz,, the overall expected nondetection probability, P, is equal to the sum z i (10) For a desired value of |3, equation (10) may be solved by trial and error for the fractional sample size, n/N. 176 weights. This conclusion holds as long as it is judged satisfactory to characterize detection probabilities on an expectation basis. Otherwise, one can as one alternative form the pseudo substrata indicated in the discussion on the cluster pattern of heterogeneity and inspect the appropriate number of items from each substratum by repeated application of formula (1). This obviously may greatly increase the inspection planning implementation, and evaluation effort. As another alternative, given inspection resources, the conservative approach of adjusting x upwards to characterize the items with the large weights may be utilized. Of course, it would be necessary in this latter instance to randomly sample the items from the entire stratum, and not just inspect those with the largest item weights. The required sample size with this conservative approach might well exceed that given by formula (1) by a large amount. items are selected at random until the goal amount is attained, an algorithm was developed that gives the sample size needed to achieve the required detection probability on an expectation basis. The necessary computer software was developed and calculations performed for a large number of cases. The results indicate that even for reasonably large values of s/x, i.e., as large as 0.8, the sample size is actually smaller for the heterogeneous stratum than for the limiting case in which s = 0. Thus, formula (1) tends to be conservative on the high side for the very few instances in which s/x becomes quite large, i.e., > 0.8. In such instances, figures (l)-(4) may be used to calculate required sample sizes. The overall major general, the commonly (1) may be applied in being concerned about f conclusion is that in used sample size formula most instances without heterogeneity in item «.l •.4 ••• Fig. Fig. 1 3 9Q/XBAH far • • ! • • • .I E 1.1 " ^_ Fig. 2 177 S
© Copyright 2024