Review of Choice Based and Matched Sample Studies In Auditing Research* Donald P. Cram Unaffiliated Iris Stuart California State University Fullerton Vijay Karan California State University Fullerton March 7, 2007 PRELIMINARY AND INCOMPLETE * Please address all correspondence to Donald P. Cram ([email protected]). 1 Review of Choice Based and Matched Sample Studies In Auditing Research Summary Three methodological errors in the analysis of choice based and matched samples in accounting research have been identified by Cram, Karan, Stuart (CKS, 2007). CKS demonstrate that non-random samples, when collected, must be analyzed taking their non-randomness properly into account. We discuss the misanalysis in nine streams of audit research: auditor litigation, audit fees, impact of audit committees, quality of audits and auditors, prediction of financial distress involving audit variables such as a going concern qualification, other audit qualifications, and experiments on auditor judgment. We tabulate and discuss 76 papers published in these research streams between 1980 and 2003 that used choice-based or matched samples, and we identify errors in 71 of these. We provide evidence that the broad course of research in some of these streams may have been misdirected due to reliance upon unjustified results of these works. Key words: Matched sample, choice-based designs, auditing literature Data availability: Data are available from public sources 2 INTRODUCTION Audit researchers have used choice based and matched sample research designs frequently in 76 papers from 1980-2003. They do so primarily for their power to reveal statistically significant findings following collection of relatively small data sets. Choice based and matched samples are frequently used to economize when data collection is costly, especially when outcomes of one sort are rare and few would be obtained under random selection. The research design of these non-random samples provides for efficient collection of fewer data points. For example, all firms experiencing auditor litigation during a period may be identified, and for comparison, rather than gathering data for all non-litigation firms, a control sample containing matched firms, matching to each litigation firm by industry and firm size, may be collected. This is appropriate if a factor such as industry or firm size is likely to have a large effect on the likelihood of auditor litigation but not itself be of primary research interest. In such a case, the use of a matched sample design allows the researcher to focus power on estimating parameters for variables of interest while applying control for those “nuisance” variables. Or, if nuisance variables are likely to have a nonlinear effect, it suffices to match on those variables without modeling and estimating their effects explicitly. Cram, Karan, Stuart (CKS, 2007) identify, however, that choice based and matched samples have often been misanalysed in accounting research, from Beaver (1966, 1968) and Altman (1968) until now, and identify three errors. These errors, to some extent, have been noted before, but CKS suggest sufficient guidance has not been provided within accounting research. CKS are the first to identify six categories of studies using choice-based and matching techniques which differ in the potential problems and most efficient solutions that apply. The presence of these errors undermines both the internal and external validity of the research. Recent studies seek to identify determinants of the outcomes observed, e.g. of change of auditors or no auditor change, and look at the sign, magnitude, and significance 3 of the coefficients in a model in order to accept or reject hypotheses on their relationships. However, as Cram, Karan, Stuart (2007) demonstrate when improper methods of analysis are used, the estimated coefficients’ signs, magnitudes, and significance are unreliable. When miss-specified models are being applied, research streams are more at risk from the effect of “fishing” for results. Lys and Watts (1994) measure 14 client-specific variable and 9 auditor-specific variables, but present a model involving just 5 and 3 variables, of which 4 variables are significant. Bell and Carcello (2000), in exploratory work, find just 7 of 46 hypothesized characteristics to be significant in explaining auditor dismissals. Other studies depend on this literature by use of coefficients estimated in choicebased and matched samples, extending errors and introducing a new kind of error. For example, a researcher may take and apply the estimated coefficients from a study of determinants of audit litigation, in order to calculate a Z-score for use in other work. This extends any errors from within the choice-based and matched analyses that derived the coefficients. And, further it introduces a new error: it constructs an unconditional estimate, which is not justified, from conditional results. For example, Krishnan and Krishnan (1997) used the coefficients from Stice (1991) matched sample to assess the likelihood of lawsuits against auditors. We assess that Stice’s paper’s coefficients are not reliable due to Errors 1, 2, and 3 all occuring in the analysis. Second, the Stice paper provides a conditional analysis, yielding coefficient estimates for some variables, but not providing coefficients on matching variables of industry and size that are needed in an unconditional estimate. The Z-score is incomplete. We do not search out and review these studies relying upon choice-based and matched samples, unless they happened to be in the auditing area and also included a new matched or choice-based sample. The present paper explores the extent of misanalysis in audit research. We seek to alert audit researchers to the need for reanalysis of previously published results, as they 4 suffer from problems of both internal and external validity. We also suggest that many unpublished studies may benefit from correct reanalysis. Auditing researchers often reason persuasively that industry or size or other factors have large effects that must be controlled for, use those factors in selecting their sample, but continue to perform analysis that, erroneously, does not account for the matching. Matching on an effect does not accomplish the desired control if an unmatched method is then used to analyze the sample. Therefore the analysis performed is misspecified, and the researchers have created a strong possibility that their discussion of the relative importance of other factors of research interest is not justified. Briefly, their analysis suffers from the omission of multiple correlated variables which they themselves have identified as being important, and hence all estimated coefficients and standard errors in their analysis are biased unpredictably. Cram, Karan, Stuart (2007) term this omission to be Error 1. A related, further complication in the analysis arises if the matching has not been perfect, e.g., if matching is done on a continuous variable such as assets or sales, and “closest” rather than exact matches are accepted. Then, the remaining gap in size between treatment and control observations could have a substantial effect, and a variable measuring that difference in size needs to be included in the analysis of differences to “soak up” its effect. Or, equivalently, that effect could be controlled for by including the size variable itself in an analysis where matching is otherwise controlled for by including pairwise dummy variables. Audit researchers often do not employ this refinement of simpler matched analysis when it is needed. Cram, Karan, Stuart (2007) term this Error 2 in the discussion that follows. Choice based and matched samples are not randomly selected. So, in addition to controlling properly for matching variables, it is also necessary to adjust analyses to take into account the differing sampling rates in strata of the collected data. This can be done by reweighting each observation according to its stratum’s representativeness of the 5 general population. Analyses employing logit regression may be exempt from the need for reweighting. Also exempt are some within-subject designs, such as studies of audit fees paid by individual firms before and after a given event. Cram, Karan, Stuart (2007) term the omission of necessary reweighting as Error 3. The goals of this article, then, are to review the use of choice based and matched sample designs in published auditing research. We identify model specification issues, and clarify the correct use of matched sample research design in order to promote its use as an effective tool. In this review of 73 papers, we establish that the extent of incorrect analysis in auditing research—and by extension in other areas of accounting and financial research--is large. In some research streams it is possible to do cross-sectional studies on random samples, so it is possible to perform meta-analysis across those and the ones employing choice-based and matched samples. Hay Knechel Wong perform such a meta-analysis across audit fee literature. In other streams, it is less feasible to collect random samples, matched and choice-based samples comprise the majority of studies. When these are the dominant approach, misanalysis is likely lead the research stream awry. In bankruptcy prediction, the early studies such as Beaver (1966, 1968) and Altman (1968) and many hundreds thereafter used matched samples and were misanalysed. From Ohlson (1980) on, however, the use of random samples has largely supplanted the use of matched samples. This paper is organized as follows. In the next section, we discuss the research designs used in choice based and matched sample studies in auditing research and categorize them into six distinct groups. Then we discuss three types of errors that we find to be common in many of these studies. Groups vary in terms of being subject to the various errors and their remedies vary correspondingly. Then we establish the scope of the problem by discussing the 76 studies in auditing research where we note choice based, matched and stratified sample approaches have been applied. To be most useful to 6 audit researchers, we discuss these studies categorized within nine streams of audit research. Also, we report on the frequency of these errors by category of the research design used, by journals where these papers have been published, and by year of publication. Finally, we summarize and conclude. RESEARCH DESIGNS IN AUDIT RESEARCH In audit research, Cram, Karan, Stuart (CKS 2007) identified six distinct research designs involving samples that are nonrandom in that they are choice-based and/or matched. We discuss, in addition, one more research design. A choice-based research design is one where a subsample consisting of cases having one outcome (e.g., firm-year observations where a bankruptcy or a going concern qualified audit opinion occurs) is collected, and then a comparison sample of control observations is selected from available data having different outcomes, and then the analysis to follow uses the outcome as the dependent variable to be explained by other variables. “Outcome-based” or “state-based” might be more descriptive, but choice-based is the term generally used, as if the research is explaining discretionary management decisions to declare bankruptcy or not, for example. Choice-based sampling is useful when data collection is costly and one category of the outcome to be explained is rare, so random sampling from the population would not yield very many observations of the rare type unless very costly, large samples were collected. Medical research often takes advantage of this method to examine determinants of patient outcomes such as cancers that rarely occur. A matched sample research design is one which incorporates non-proportional sampling by its selection of pairs, triples, or other clusters of observations that are similar in certain respects. Matched clusters may consist of “within-subject” data, e.g., the pair of before-treatment and after-treatment observations of the same firm or other subject.1 1 Within-subject, before-and-after studies are not usually self-described as being matched samples in accounting research; we found relatively few in our review of audit research. Unlike between-subject studies, accounting researchers have usually analysed these taking into account the matching of subject with itself at a later time. Many within-subject studies appear in experimental work and are appropriately analysed: in analysis the term “blocking” refers to the control for matching that is accounted for. 7 Or, in a cross-sectional study, clusters may consist of pairs of firm-year observations that a researcher assesses are similar on observable and available characteristics such as year, industry, and size. These are “between-subject” studies and are usually misanalysed. In medical research, there are many studies of twins, as many genetic and/or environmental factors can be well controlled for, leaving power to discern the effects, across twins, of differences of dietary or smoking or other habits. Within the choice based research design, the control sample is often chosen based not only upon the outcome but also by matching on other variables. Thus a choice-based sample may or may not also be a matched sample; to be both, the control sample must be selected on the outcome variable to be explained in analysis as well as matched to the case sample on other variables. A sample of litigated firms paired to non-litigated firms, with the pair-matching by industry and closest size, is both. If the comparison firms were chosen randomly from the stratum of non-litigated firms, this would be just choice based. We distinguish between “fully-matched” samples in which each case observation is uniquely associated with one or more controls, and “semi-matched” samples. The latter are samples that have pairings of case and controls that are nominally but not meaningfully unique. For example, if several case observations in one industry are each matched to a different randomly selected control from that industry, what is achieved is what we term “semi-matching”; the pairs can just as well be combined into matched many-to-many clusters. Each industry-case cluster and each industry-control cluster are strata. A different kind of nonrandom design is one that is merely stratified, meaning that samples from different subpopulations are selected at varying sampling rates. To generalize to the larger population, the data must be reweighted in any analysis so that the true subpopulation sizes are proportionally represented. Kreutzfeldt and Wallace (1986), Kreutzfeld and Wallace (1990), and Wallace and Kreutzfeldt (1995) use the same sample, a sample from Arthur Anderson of 260 audit clients by stratified sampling in groupings by size, industry, and publicly vs. privately held. The authors stated that hey wanted a more representative sample than one chosen by pure random selection. Effectively, 8 however, they over-sampled in some strata. Hence this is a stratified sample, although it is not choice-based and it does not involve matching. These approaches to research design lead to seven distinct categories (see Figure 1), here described as (1) Choice Based Non Matched (CB-NM), (2) Choice Based Semi Matched (CB-SM), (3) Choice Based Fully Matched (CB-FM), (4) Non Choice Based Semi Matched (NCB-SM), and (5) Non Choice Based Fully Matched within-subjects (NCB-FM-W), (6) Non Choice Based Fully Matched between-subjects (NCB-FM-B), and (7) Stratified but neither choice-based nor matched. All seven categories are nonrandom samples, suffer from different potential errors and require differing forms of analysis. Previous researchers have not recognized the need for differing forms of analysis and have often analyzed their data incorrectly. THREE COMMON ERRORS IN DATA ANALYSIS Cram, Karan, Stuart (2007) identified three errors that can apply to one or more of the seven research design categories. We summarize the research designs and the potential for each error in Table 1. Error 1: the Use of Unconditional Analysis, when Analysis Conditional upon Effects of Matching Variables is Needed, can apply to all the matched sample categories, but not to choice-based samples which do not involve matching. Error 2: Failure to Control for Effect of Imperfectly Matched Variables, can apply to CB-FM and NCB-FM-B categories. Matching is exact for NCB-FM-W category. We chose not to review this error in the category of CB-SM, because we estimated the effect of the error would likely be minor. Error 3: Failure to Reweight Observations According to Differing Sampling Rates, can apply to all research categories but NCB-FM-W. However, a “logit exemption” to the need for reweighting applies to logit regression models in CB-NM, CB-SM, and CB-FM categories, as long as fully saturated models are used. In auditing research, we note there are just a few studies employing weighting to correct for such problems. A limitation of the weighted exogenous sampling maximum likelihood (WESML) method that these studies use, and of other weighting approaches, is 9 that exogenous population information is needed. In many research settings the required information-- the proportions of all individuals from the population that fall in each of the matching set categories-- might be readily available, such as when sampling from the finite population of publicly traded firm-year observations appearing in CRSP and Compustat databases, but often it is not. The vast majority of auditing studies using choice-based and matched samples do not, however, employ exogenous weighting. Many, instead, rely upon a special result which we term the “logit exemption”. The logit exemption, at least as it applies to simple choice-based sampling, has been known in accounting research since Palepu (1986). Notably, Maddala (1991) who described the simple version of the exemption in an invited paper in The Accounting Review, has been oft-cited and is at times paraphrased, imprecisely, to represent that as long as logit regression is used, coefficients other than the intercept will not be biased.2 Zmijewski (1984) also has been cited on this point. It has not been understood that if matching is used within choice-based samples, however, it is then necessary for an intercept to be estimated for each matched set. Then each of those intercepts are likewise biased, although coefficient estimates on included variables will be consistent. Accounting researchers have also applied unweighted estimation to probit, OLS, discriminant, and univariate analyses where the logit exemption obviously does not apply. These analyses do not yield valid estimates. Five of the six research design categories can include papers suffering from Error 3, because they involve nonrandom sampling. 2 An extended version of the logit exemption is proven in CKS (2007). The logit exemption allows the use of unweighted logit regressions to analyse samples that are choice-based, whether matched or not, data, delivering asymptotically unbiased coefficient estimates and standard errors on non-intercept variables, providing that the model is “fully saturated”, i.e. that an intercept is included for every level of each matching variable. The typical choice-based application in accounting research , however, involves matching, and erroneously estimates an unweighted and unsaturated model, which does not control for the matching variables’ effects and does not enjoy the logit exemption from need to weight data to reflect population proportions. 10 DISCUSSION OF AUDIT RESEARCH STUDIES The use and frequent misanalysis of choice based and matched sample studies in accounting research is not limited to the auditing area. However, these nonrandom samples are widely used in auditing research, appearing in 24 articles during 2000-2003 alone, of which seven are in AJPT, the primary journal for American Accounting Association audit researchers. We have collected, and summarize in Table 3, a list of 76 papers that is intended to be a fairly complete list of publications in auditing research, from 1980 on, that feature choice based, matched, and stratified samples.3 3 We do not seek to include experimental studies that employed within-subjects designs although those can be interpreted as matched sample studies. We would have included such a study if it was self-described as a matched sample study, but found none. Our search to identify published papers in auditing research included the following steps. We searched in ABI-Inform, JSTOR, and other literature databases on “matched sample”, “choice-based”, “matching”, “stratified sample” and similar key words and phrases. Also, we extensively browsed through back issues of The Accounting Review, the Journal of Accounting Research, the Journal of Accounting and Economics, Auditing: A Journal of Practice & Theory, Accounting and Business Research, the Journal of Accounting Auditing and Finance, and the International Journal of Auditing to identify additional articles using matched sample designs, presented less prominently. We observe that it is relatively rare for researchers to highlight their use of a choice-based or matched sample design in title or abstract, perhaps because the method is so well accepted. We collected complete copies of all articles identified, and from those identified other articles that these cited, and collected those, obtaining those from outside our library where necessary. For authors who were found to employ matched or choice-based sampling, as defined in this paper, we frequently made further efforts to review their other papers. Although we do not include unpublished papers in our target set, we also searched for working papers in accounting and finance by use of the Google search engine and the SSRN 11 The 76 papers listed in Table 3 comprise an interesting set of papers on its own. Current work in the audit literature relies upon results of these papers. We believe that, although it is beyond the scope of what we can prove in this paper, patterns prevailing in this area extend to other areas in accounting and finance research. We feel that the main results are not adequately supported in 71 of these 76 papers, and explain our general concerns in this section. More details of each individual study are included in Table 3. We attempt to be nearly exhaustive in sampling from one area of accounting research in order to avoid an appearance of criticizing authors selectively. This review enables us to establish patterns that in fact apply more broadly. We find one or more technical deficiencies along the lines identified above with almost all of the choice based and matched sample analyses performed. Correct re-analysis in most cases would be straightforward, were the original data available. Hence we recommend reanalysis of many of these works before they are relied upon further. We do not present replications, but at least in all of the studies using logit regression, correct reanalysis would be easy, if the original data were available. Reanalysis of many others website. Working papers yielded more citations of published papers using matching. From this set of more than 400 papers in accounting and finance, we then restricted our attention to the 76 papers that we classify as being in audit research. We deem a paper to be in audit research, for the purpose of this review, if it focuses on audit litigation, uses auditors as subjects in a survey, or prominently employs audit-related variables. As an example of a decision on the margin, we chose to include Beasley (1996), whose model includes a variable denoting the existence or not of an audit committee, in this review, while we exclude numerous other papers in corporate governance not having audit committee variables. 12 would also be feasible if exogenous sampling rate information also remains available. Our speculation is that many results will be significantly refined upon replication. We find that the vast majority of these papers suffer one or more of the three technical deficiencies. Of the papers, 56 of the matched sample papers do not explicitly control for matching in their analysis, thus suffering from Error 1. In the pair-matched papers among these 56, the researchers should have evaluated pair-wise differences rather than pooling all the data. The correct analysis also could have been implemented, loosely speaking, by included dummy variables for each matched set. Of the remaining 20 papers that do not suffer from that error, seven are choice-based but not matched samples, and five are merely stratified. Eight are matched samples which do control for the matching, for example, by their having taken pairwise differences of the data, and regressing pair-wise differences in outcomes upon those. Although these eight suffer other errors, they do utilize the matching and effectively avoid Error 1. Papers suffering from Error 1 lack internal validity. To be clear, there is no set of statistical assumptions which would justify the standard errors and t-statistics reported. CKS show by simulation that the effect of Error 1 can be to yield entirely erroneous conclusions. On the second technical criticism, Error 2, we note 29 of the papers suffer from lack of explicit control for “closest” imperfect matching. A closest-matched variable such as size can still have influence. It might be controlled for by including a linear term. But, as size or another variables contribution might be non-linear, in general, there is no fully satisfactory resolution. The researcher, we argue, must show some effort to eliminate the possibility that their results are merely driven by the omitted residual effect. Sensitivity analyses including linear and quadratic terms, for example, might be 13 performed and given some discussion. Otherwise, the researcher has not established that their effects of research interest are not merely the result of an omitted variable problem. Papers suffering from Error 2 may lack internal validity. The main statistical assumption which would justify excluding the residual effect is that size has no influence upon the outcome. In a OLS regression, the omission of residual size would also be justifiable if size was not correlated with any included variables (it is only omitted correlated variables that cause bias in all other coefficients). However in logit, probit and other analyses, the omission of even an uncorrelated variable that should be in the model will bias estimates of all other coefficients. On the third technical criticism, that of the need to reweight for sampling rates in each stratum, numerous papers would be exempt for their use of logit regression, at least insofar as for inferences based on non-intercept coefficients. (All of the matched sample papers using logit, however, suffer from Error 1). Additionally, five are exempt from the need to reweight for the reason that their matching is a within-subjects design, such as before-and-after study of audit fee levels, where the subject is a firm which is randomly selected from all firms having data available. And one study, G. Krishnan (2003), is exempt from need to reweight because the researcher’s sampling rate is effectively 100% in each of the strata sampled. The results, when presented as conditional to the stratification variables, are then already weighted to reflect the available population. The remainder do not enjoy the logit exemption from the need to reweight sampled data to reflect population proportions, but only two of those, both being probit analyses, perform the necessary reweighting. So 44 are in error for employing univariate, OLS regression, 14 discriminant analysis, and other analyses without necessary reweighting. These studies suffer from Error 3 and lack external validity. Overall, only five papers are free of all three errors that we classify. Two of these, Colbert and Murray (1998) and Foster et al (1998), are logit regressions of nonmatched choice-based samples. The three others, Iyer and Iyer (1996), Maher et al. (1992), and Sanders et al. (1996), are “before-and-after” studies of audit fee levels, employing OLS regression of within-subject differences in audit fees upon differences in other variables. A total of 30 suffer just one error, 24 suffer from just two, and 17 suffer from all three errors. On average 1.7 of these three types of errors appear. DISCUSSION OF AUDIT RESEARCH STUDIES BY RESEARCH STREAM We identified nine research areas in auditing and discuss matched sample and choice-based research in each of these areas. Research Stream 1: Determinants of Auditor Litigation Five papers investigate determinants of auditor litigation using choice-based samples. Lys and Watts (1994)’s CB-FM sample consists of 163 firms whose auditors were sued and 163 firm-year observations pair-matched by year, industry, Compustat delisting code (if any), and then by closest logarithm of size. They report OLS regressions explaining auditor litigation or not. Stice (1991), Shu (2000), and Heninger (2001) are CB-SM studies employing logit regressions, and Krishnan and Krishnan (1997) is a paper that extends the work in this area. Stice creates two semi-matched samples for comparison to 49 firms having auditor litigation: one is matched on year only, the other is matched by year and by 15 industry, and then one is selected randomly, achieving less than 49 year-industry sets. Shu (2000) uses a list of audit litigation firms from Watts, augmented by additional searching, yielding 282 firm-year observations, and constructs a semi-matched sample by selecting 10 controls matched by year, for each litigation outcome. Heninger (2001) identifies 67 auditee firms with lawsuits against auditors, and semi-matches to 67 firms by year and industry, achieving somewhat fewer than 67 year-industry sets. These four papers suffer from Error 1: they essentially suffer from the omission of dummy variables that are required to be included. The authors argue convincingly that industry and/or year are important but the analyses does not control for them (Error 1). Lys and Watts include control for imperfect matching on logsize by including size as a variable in their analysis, mitigating concern for Error 2 in their work, although including logsize would be more internally consistent. All of these papers suffer Error 3, the need for reweighting to population proportions. Since saturated models are not employed in the logit analyses, they do not enjoy the logit exemption for reweighting. Univariate analyses in these papers and the OLS analysis in Lys and Watts also suffer from Error 3. The impact of erroneous analysis in these papers carries through many other papers that rely upon them for their measurement of audit litigation risk. As one example, Krishnan and Krishnan (1997), itself a CB-SM paper, relies upon Stice’s reported coefficients for a Z-score-type index of auditor litigation likelihood. This application is inappropriate. If Stice’s sample had been choice-based but not matched (CB-NM), then the coefficients other than the intercept would have been accurate and higher index values would have represented higher auditor litigation 16 likelihood. However, use of an unconditional index based on the conditional estimates from a matched sample is invalid. The index omits year-industry-specific intercepts that cannot be estimated in a matched sample and that would likely be important (as Lys and Watts, Stice, Shu, and Heninger do or would argue). And even if Stice had accounted for the matching by including year-industry dummies in the logit regression, the intercepts estimated would not be valid, for the same reason that the overall intercept in logit regression of CB samples is not valid. Estimation of those intercepts for an index requires a random sample, or reweighting of each observation according to the sampling rate in each outcome-year-industry vis-a-vis the general population. Would such an index nonetheless be informative about litigation likelihood, in the same way that Altman’s Z-score turns out to be informative about bankruptcy, although it is equivalently miss-specified? The answer to that is unclear: Altman’s Z-score has been proven to be informative by classification performance tests by Altman and many others; an auditor litigation score has not been proven informative in such ways. Further, it is arguable that industry-year effects could be relatively stronger for auditor litigation and omitting them would undermine the scores validity in two ways: first the direct omission of the important effect of industry and year, second the coefficients included in the model would be more severely miss-estimated. As another example, more recent than the 76 that we tabulate, Krishnan and Zhang (2005) similarly use a Z-score-like measure of audit litigation risk, but one based on Shu (2000)’s coefficients instead of Stice’s. It is again incorrect to use the conditional estimates from Shu to construct an unconditional index of audit litigation risk, one which 17 necessarily omits the year effect that Shu’s work deemed important, besides the fact of error 1 in Shu’s work. It seems that the impact of misanalysis of choice based and matched samples significantly affects the estimation of audit litigation risk, which then continues to influence research in corporate governance and other areas. Research Stream 2: Impact of Audit Committees The research in this area considers the impact of audit committees and the board of directors on financial statement fraud. Research studies include three choice-based samples and two matched but non-choice-based samples. McMullen (1996) examined the reliability of financial reporting to determine if the presence of an audit committee is associated with one or more of five outcomes related to financial reporting problems (shareholder litigation alleging management fraud, quarterly earnings restatement, SEC violation, illegal act, auditor change due to an accounting disagreement between auditor and management). She collected five samples of firm-year observations involving such outcomes over seven years, and constructed five comparison samples from a set of 100 firms that was itself randomly selected. For each outcome, she discarded comparison candidates that themselves experienced the outcome, and for the remainder selected firm-year observations that semi-matched by year to the treatment sample: each comparison sample had approximately the same distribution by year as its treatment sample. She runs separate logit regressions to explain each of the five 0-1 outcomes. The analysis failed to account for the semi-matching. To correct the analysis, the researcher should have included stratification by year in the analysis. Effectively, she omitted up to six dummy variables for year. Her result that audit 18 committee presence is negatively associated with each of the five financial reporting problems is somewhat under question for the misanalysis. Beasley (1996) examines the relation between the composition of the board of directors and financial statement fraud. He selects a sample of 75 companies reporting fraud between 1980 and 1991. Non-fraud firms are matched with fraud firms by closest size, industry, and time period. The analysis fails to incorporate matching, so the results may not be reliable. To emphasize the importance of these works, take note of a recent paper relying partially upon their contibutions, Srinivasan (2005), which itself uses a matched sample analysed imperfectly. For evidence “that the likelihood of financial reporting problems can be diminished with greater independence and expertise on boards and audit committees” (p. 292), the previous publications he cites are all matched sample studies (Beasley (1996), Dechow, Sloan, Sweeney (1996), Farber (200_), and Agrawal and Chadha (200_).4 Srinivasan examines consequences for outside directors, on the audit committee vs. not, when financial reporting failures occur. This bears on impact of the audit committee, because incentives for board members tie to the effectiveness of the audit committee work. In analysis across restating firms and non-restating firms matched by industry, year, and performance, which a discussant deems to be the “central result” of the paper, Srinivasan suffers Error 3 and, for some models, also Error 1, even though he attempted a differences-in-differences approach that should have avoided Error 1, but did not implement that thoroughly for all models. 4 The latter two were working papers at the time of his citation. Dechow, Sloan, Sweeney (1996) suffers Error 1. We did not include it in our 75 papers reviewed herein as it appeared insufficiently focused upon audit research. Farber (200 ) 19 These are but a few of many corporate governance papers that used matched samples. We included these because of the prominence of the audit committee variable. Research Stream 3: Determinants of Audit Fees A literature on pricing audit services is potentially important in providing guidance in fee negotiation for audit firms and their clients. Six papers from our sample, three papers using between-subjects matching and three using within-subjects matching, contribute to this literature. This is the only research stream which has been studied by a meta-analysis: Hay, Knechel, and Wong (HKW 2006) provide a review of 100 or so papers including these six. HKW’s collected data could possibly be used to judge the accuracy of the 6, by testing whether their results differ from the others’. Simunic (1980) develops a model for determinants of audit fees and also empirically tests it by OLS regressions explaining fees deflated by assets. He collected a stratified sample by surveying financial officers at some of the 8,077 public U.S. companies within four categories (OtherBig8 vs. nonBig8 auditor crossed with auditee size greater or lesser than $125 million), selecting at random within each category. We term this a NCB-SM sample. The OLS regression included intercepts for Big8 vs. NonBig8 (literally a dummy for Big8, and an overall intercept.) However to fully account for the stratification, the OLS regression should have included dummy variables for each category, so three dummies plus the overall intercept were required. (Error 1). Including one additional dummy for large size within Big8, and another for large size within nonBig8 would have been consistent with the sampling done, and would have achieved more efficient, consistent estimation of the coefficient on Big8 that was the main result of interest. 20 We identify a sampling-related issue5: he had sampled at a different rate in each category, sampling at a relatively high rate among the Big8 large auditee subuniverse while collecting less than a proportional share of the nonBig8 small firms. For his results to be generalizable to the universe of public companies, the regression would have to be reweighted: each observation should be given weight equal to the inverse of the sampling rate for its category. That requires information exogenous to the sample collected, namely the numbers of available firms in each of four categories, out of the universe of 8,077 from which he drew. For example, consider the 70 observations collected in the NonBig8 small auditee category, and the 172 observations of Big8 large auditees. If the universe of 8,077 had contained, say 6,000 and 500 of each of those categories respectively, then weighting the NonBig8 small at 6000/70 and the Big8 Large at 500/172 would be appropriate. Thus we find two “errors” in the model specification, Error 1 and Error 3. The reported results are technically incorrect in the sense that they are not the maximum likelihood estimates that they are interpreted to be, and the corresponding standard errors and p-values reported are not what would be yielded by the appropriately weighted OLS regression that included the two omitted intercept variables. Reestimating the model in this way would yield different coefficient estimates and standard errors for all variables, given the non-zero correlations amongst them. Could the results be significantly different? It is certainly possible that some coefficients would drop under or swing over the hurdle of achieving a .05 statistical significance level, and that 5 Simunic expresses concern about response bias, a distinct sampling problem that we do not address. 21 then the interpretation of results would be changed, and hypotheses advanced by following researchers would differ, and so on. Menon and Williams (2001) note: “Simunic (1980), in a seminal study, finds evidence of a Big 6 discount, attributable to scale economies. However, subsequent studies, such as Francis (1984), Craswell et al. (1995), and Palmrose (1986), detect a Big 6 price premium, plausibly attributable to a higher quality of assurance offered by these firms.” (p.117). Perhaps a re-estimation of Simunic’s data could reconcile the reported results to those reported by other researchers who used entirely random samples. Two following studies that use low-balling to explain audit fees also suffer from Error 1: Turpen (1990) employed a stratified sample similar to Simunic’s sample. Turpen’s OLS analysis omitted the dummy variable for the OTC versus exchange traded securities. Walker and Castarella (2000) collected a sample of 80 pairs matching by size and by industry code. Their analysis omitted 80 indicator variables. In addition, for their results to be generalizable, the OLS regressions in each needed to be reweighted using inverse sampling rates as described above. Three studies exploring changes in audit fees do not suffer from any of the three errors. Maher et al. (1992), Sanders et al. (1995), and Iyer and Iyer (1996) each employ matching only in the sense that they are within-subject studies. They collect data for 78 firms, 159 cities, and 270 U.K. firms at two dates: each firm-year observation is “matched” to a later year’s observation for the same firm. They take pairwise differences, and regress changes in fees on variables that are each changes in some other measure. This accounts properly for the matching. In the OLS regression setting, this is 22 equivalent to including 78, 159, and 270 pair-indicator variables, respectively, in a regression on the data for both years pooled together. These regressions are not missweighted either: the first firm-year observation for each pair can be viewed as randomly selected from a universe of firms that exist at both the earlier and later dates. The matching of corresponding later firm-years is collected at a 100% rate. If the pairs are themselves viewed as strata, the sampling rates within each one is the same, so no reweighting is needed. Survivorship bias is a form of selection bias that would apply in considering generalizing their results to all firms. However, the results can be generalized directly to the population of firms that existed at both dates. The most recent of these studies, Walker and Castrella (2000), is the most deficient with respect to the errors wet review, as it omits the greatest number of required variables. We assess that its two main results, that audit fee discounting of new engagements continues into more recent years, and that auditors charge more for firms experiencing losses (perhaps as compensation for higher audit litigation risk) have not been established by this work. The research stream should revisit these questions, rather than rely on this study. Hay, Knechel, and Wong’s (2006) meta-analysis of the audit fee research literature includes all six of these papers. Its powerful conclusions on which variables affect audit fee levels, based on meta-analysis of about a hundred papers, then, are partly misinformed by the effect of misanalysis in these that we review here. 23 Research Stream 4: Prediction of Financial Distress and Fraud Twenty-three papers in our census predict levels of financial distress, measured by Going Concern Qualification (GCQ) or Bankruptcy, in choice-based studies. Some of these papers use GCQ as a predictor of bankruptcy. While studies that do this are numbered in the hundreds, we identify only 23 having an audit research focus or incorporating audit measures. Of these, six employ a CB-NM design, i.e. they have one subsample collected by random selection from financially distressed firms, another by random selection from non-distressed firm, with matching not employed. Obtaining the same number of observations in each subsample is not required. Mutchler (1985) and Lenard et al (1995, 1998, 2000) collect equally sized samples; Foster et al (1998) and Cormier et al (1995) do not. Only one, Foster et al 1998, is exempt from the need to employ reweighting because it uses a logit regression. The others fail to apply necessary reweighting in their statistical analysis. The purposes of the Lenard et al and Cormier et al papers include comparing the performance among logit regression, discriminant analysis, recursive partitioning, and/or neural network methods, and their authors advocate use of the non-logit methods to achieve higher classification accuracy, based on the apparent performance of the models applied without reweighting. We suggest the comparisons should have been performed with each approach incorporating reweighting to address the different sampling rates in the two subsamples. We must readily concede that the non-logit methods, even without reweighting, may well yield models that achieve higher prediction performance than logit models, but we note that, at least without reweighting, they are in effect “black box” methods. In choice-based samples only the logit regressions yield coefficient and p-value 24 estimates that are valid for discussing the contribution of specific variables. The earliest of these papers, Mutchler (1985), first discusses apparent univariate differences between her GCQ vs. non-GCQ subsamples; this is based either on reported means of continuous variables or percentages of firms exhibiting each of several 0-1 characteristics. With hindsight, we note that extended discussion along these lines seems unjustified as statistical significance of differences is not measured (as might be done by unmatched ttests). She goes on to apply discriminant analysis to various combinations of the variables without reweighting.6 An unexpected result is that adding a good vs. bad news identifier variable, constructed by evaluation of management discussion and analysis, causes performance (measured by classification in hold-out samples) to deteriorate, while it was strongly supposed to provide improvement. We conjecture that miss-weighting of the observations could possible contribute to such an anomaly occurring, even when all other statistical assumptions of the discriminant model (normal distribution of each independent variable) are satisfied. We would expect that a Z-score computed from Mutchler’s coefficients, such as Fleak and Wilson (1994) employ, would be informative about GCQ likelihood, just not as informative as a comparable Z-score computed from a reweighted analysis of Mutchler’s data. Fleak and Wilson (1994) consider the information content of the going-concern opinion. Their premise is that some going-concern qualifications provide information affecting security returns and some do not. They created a stratified sample of distressed firms. One stratum is all firms receiving going-concern qualified opinions from 19791986, the second is approximately 25 percent of all other available firms meeting a 6 Tatsuoka(19__) provides discussion of modification of discriminant analyses to avoid Error 1. 25 “troubled firm” criteria. They found that going-concern opinions that are unexpected are associated with negative abnormal security returns. Their analysis suffers from error 3 only. To gain generalizability of their results, reweighting is needed. Twelve papers explaining financial distress employ a CB-FM design: Kida (1980), Levitan and Knoblett (1985), Koh (1991), Ponemon and Schick (1991), Citron and Taffler (1992), Carlson et al (1998), Koh and Tan (1999), Citron and Taffler (2001), Vanstraelen (2002 and 2003), Gaeremynck and Willekens (2003), and Geiger and Rama (2003). They each identified a set of financially distressed firms, and then selected an equally-sized, pair-matched comparison sample of non-distressed firm-year observations, with pairing typically by industry, size, and year. The matching was not accounted for, in any of these papers, in the various statistical models they estimated (all suffer from Error 1;) the authors erroneously believed that they had controlled for industry, size, year and/or other matching variables merely by using matching in the sample selection stage, while in fact it also needs to be controlled for in the analysis. As such, we argue all these papers results should be disregarded: as Maddala (1991) put it, “nothing can be learned from” such miss-specified models. Further, many of these suffer from Error 2 and/or Error 3, although Koh (1991) stands out for applying appropriate reweighting in his probit analysis. Four of the financial distress papers employ a CB-SM design: Chen and Church (1992), Kleinman and Anandarajan (1999), Morris and Strawser (1999), and Behn et al (2001). Chen and Church use matching only by year to select an equal sized comparison sample for their 127 GCQ firm-year observations, thereby creating a matched set for each of five years. Morris and Strawser similarly use matching only by year to select an equal 26 sized comparison sample for 116 Texas banks bankrupted in 1990 and 1991. Behn et al (2001) similarly create year-matched sets for 148 GCQ firms during four years. These three studies apply logit regression to explain GCQ and fail to account for the semimatching (Error 1): essentially they needed to include a dummy variable for each year’s set. Their results are only skewed by the omission of 4, 1, and 3 year-dummy variables, respectively; this may seem minor but is required for the statistical reasoning to be valid, and the authors had themselves identified that year was an important variable to be controlled for. Kleinman and Andarajan report that they match by size to their 61 GCQ firms, but do not explain why they then obtain 173 control firm-year observations; we infer their sample selection was equivalent to semi-matching on year and on size groupings. Kleinman and Andarajan apply stepwise discriminant analysis which suffers from both Error 1 and Error 3. Research Stream 5: Prediction of Audit Qualification A somewhat overlapping set of papers are those that explain audit qualification. There are nine papers that address audit qualifications more generally than those, above, addressing going concern qualifications alone. An early CB-SM study in this area is Dopuch et al (1987) They find 275 firms that experienced audit qualifications from 1969-1980, and semimatched on year alone. The only concern we have is that they did not control for the semimatching in the analysis by including year dummies. Wilkerson (1987) identified 16 firms receiving uncertainty qualifications related to allegations of federal antitrust law violations. He matched the firms receiving 27 uncertainty qualifications to firms identified with allegations of federal antitrust law violations and not receiving uncertainty qualifications. A holdout sample from each group was used to determine the predictive ability of a model based on financial and market variables. Wilkerson’s analysis fails to incorporate matching. Elliott (1992) identified 145 firms receiving “subject to” audit qualifications between 1973 and 1978 and selected control firms with the closest earnings per share measure (calculated by taking a year-to-year change in the EPS in the event year minus the average change in EPS during prior years divided by the estimated standard deviation of the change in EPS) to the sample firms. The analysis failed to control for the “closest” match in the EPS measure. DeFond and Jiambalvo (1993) examined factors related to auditor-client disagreements. They identified 58 companies that reported auditor-client disagreements between 1982 and 1986 and identified control firms that changed auditors but did not report a disagreement. The control firms were matched by industry code (2-digit, 3-digit or 4-digit.) The analysis failed to incorporate matching in the analysis and control for the “closest” match was needed and was not done. Kinney and McDaniel (1993) identified 85 firms from 1976-1988 making yearend announcements of corrections in previously reported interim earnings. The sample was matched to control firms with no corrections in previously reported interim earnings by industry and closest size. While the authors correctly used matching in the analysis of the data, they failed to add additional controls for “closest” match. Given the differences in occurrence rates in the two groups sampled, re-weighting is needed to generalize the results beyond the current sample. 28 Buchman and Collins (1998) identified 60 firms receiving qualified opinions for litigation from 1969-1977. Firms receiving unqualified opinions disclosing litigation uncertainty in a footnote was matched to the firms receiving qualified opinion for litigation by 3-digit SIC codes. The analysis failed to incorporate matching in the analysis. Re-weighting of the sample in both populations was needed for generalization of the results and was not reported. Seipel and Tunnell (2000) reported results from a study examining the use of a “subject to” audit opinion from 1983 to 1987. They identified 135 firms with “subject to” opinions during this time period. The controls firms were matched by 2-digit SIC code, and financial condition measured by Altman’s Z-score. The analysis fails to incorporate matching. Additional control variables are need for “closest” matching and are not reported. To generalize the results beyond the firms in the sample, re-weighting is needed in the analysis. Bartov et al (2001) identified 173 firms with qualified audit reports from 1980 to 1997. Each firm year in the test sample is matched with a control firm with an unqualified opinion in the event year. Matching was done by 2-digit SIC code, auditor type, and nearest asset amount. The analysis fails to incorporate matching. Additional control is needed for “closest” matching and is not reported. Peasnell et al (2001) identified firms with defective financial statements as judged by the Financial Reporting Review Panel. The authors found that the firms judged to have defective financial statements reported weaker performance than firms in the control sample. The authors failed to incorporate matching in the analysis or to add additional controls for “closest” matching. 29 Research Stream 6: Audit Quality We group together eight papers in which audit quality is saliently discussed. Audit firm size has been considered a reasonable proxy for perceived audit quality since DeAngelo (1981). One of these papers directly tests that by comparing peer assessed audit quality to audit firm size measured by number of CPA’s. Eight papers assume that Big 8 / Big 6 / Big 4 audit firms are high quality and that non-Big 8 / Big 6 / Big 4 audits are low quality. Of these one examines determinants of this proxy for audit quality choice, while seven test whether audit quality, so measured, is associated with some outcome. These latter papers implicitly test whether this proxy for audit quality has merit as a proxy, by jointly testing whether the proxy enters with the expected sign into their models. The research stream perhaps is important for auditors and firms, to verify that higher quality audits are worthwhile and justify higher fees. For investors, the research could verify that auditor quality proxy is a valid signal bearing on interpretation of financial statements. What is the purpose of using matched samples in these studies? In contrast to other research streams, the purpose is not to economize on data collection costs. In fact here data is discarded that has already been collected. The authors state reasons which do not hold up as valid reasons, and the matching in these studies together with failure to account for the matching in the analyses render their stated results to be invalid. For example, Becker Defond Jiambalvo Subramanyam (___) state that they are attempting “to increase comparability” by selecting a semi-matched sample, dropping Big 8 audited firms in industry / year / cash flow size regions of the population where there are not also non Big 6 audited firms . And Feltham, Hughes, Simunic (____) similarly state “we 30 control for the size effects ...by selecting two subsamples of firms, dichotomized by their choice of small versus large auditor, such that the firm size distribution is the same in both subsamples. This is accomplished by including a firm in a particular subsample only if we can ‘match’ it with a firm of corresponding size in the other subsample.” (p.381) This reasoning is invalid. It serves no purpose to drop firms selectively and then apply an unmatched analysis. If data is selectively dropped to derive a fully- or semimatched sample, then only conditional results can be obtained, rather than the unconditional results that are asserted. The empirical relationship between size and audit quality is assessed by only one paper, Colbert and Murray (1998), in this set. Using a CB-NM design Colbert and Murray study small audit firms (half having 3 or fewer CPA’s) that are evaluated for quality by state societies of CPAs or by the AICPA through a peer review process. They measure size by the number of CPA’s working for the audit firm. Using ordered logit to explain assessed quality in on-site reviews, they find the size variable to be significant. As the study does not use a matched sample and uses logit regression to analyse the data, the study is not subject to any of the three errors. Clarkson and Simunic (1994) is the one CB-FM study. Clarkson and Simunic (1994) use logit regression to explain firms’ choices of auditor type over 44 matched pairs of data. It did not incorporate matching. Given the authors’ discussion, the correct analysis incorporating the matching would be best conceptualized, for them, as running the pairwise difference in audit quality (always 1) as a no-intercept logit regression over 44 pairwise differences of independent variables. This would have been appropriately avoided Error 1 and is equivalent to running their logit regression modified to include 87 31 pair-indicator variables, over 88 observations. Other principal analyses include univariate tests of differences in means; it is not clear in our readings whether they applied matched t-tests which would be correct; if they applied unmatched t-tests their reasoning was not valid. OLS regression results over a non-matched sample would be valid; the same OLS regression over the pooled matched pairs is not. All of the analyses performed suffered from one or both of Error 2 and Error 3, as well, because need to account for imperfection in closest matching and need to provide reweighting to reflect population proportions were not addressed. In two papers, the authors create two samples, run models separately in each sample, and claim results from comparisons made. Feltham, Hughes, Simunic (1991) select three samples of 50, 40, and 37 fully matched pairs from 251 high- and 141 low-audit-quality firms identified in prior research, matching on just one variable for each: closest assets, closest IPO proceeds, and closest MVE, respectively. They run an OLS regression explaining market value, which do not incorporate matching, simultaneously within the high quality and low quality subsamples, and report Chi-squared statistics from Chow tests whether coefficients estimated in the different subsamples are different. For the tests to be valid, the regression would need to include fixed effects for the pairings. Allen matched cities with Moody’s bond ratings from 1978-1986 by size and Big 8 vs. non-Big 8 auditor. He ran separate logistic regressions within the 125 Big 8 Cities subsample and the 125 non-Big 8 cities to predict bond ratings for the cities, as a function of accounting ratios. He found that the model for Big 8 cities had higher explanatory power. His primary hypothesis was that “accounting information audited by Big 8 32 auditors is better able to predict municipal bond ratings than is accounting information audited by non-Big 8 auditors.” (p 119) He found rank correlation of .448 between predicted and actual ratings for the Big 8 cities versus .185 correlation for the non Big 8, and reported two tests. However, these analyses failed to account for the matching in sample selection, and because the cities were matched by population size, additional control for “closest” matching was also needed. Therefore Allen’s study suffers from Error 1 and Error 2. His result, while reasonable sounding, is not established by his analysis. We cannot ourselves imagine how to adjust his tests to be valid, given his sample selection, but the onus is on the researcher, not us, to conduct the research in manner so that appropriate tests can be performed. Taking a random sample and controlling for city size explicitly in the model, then applying a bootstrapping approach to statistically measure the difference in R-squareds (as Joos and Lang 1994 do) would be appropriate. Three papers are NCB-FM which use auditor quality, proxied by Big 8 / Big 6 / Big 4 or not, as an explanatory variable in explaining various outcomes. Teoh and Wong (1993) use matching by year, industry, and closest in size to select 1282 pairs, and then run an OLS regression explaining CAR over a window from earnings forecast to earnings announcement; their analysis does not incorporate matching. Again, those not incorporating the matching should be disregarded. Teoh and Wong essentially omitted 1,262 dummy variables that would have implemented control for year, industry and size. Although Teoh and Wong use assets as their measure of size in their sample selection, they include log of market value as a firm size measure in their analysis; matching on log of market value originally would have been more appropriate. 33 Both studies suffer from Error 2 and Error 3: control for closest matching (in Teoh and Wong’s case, by including at least a linear term for assets) was needed and also reweighting was needed, for the results to be generalizable. Bauwhede, Willekens, and Gaeremynck (2003) matched companies listed on the Brussels Stock Exchange by industry and size with nonlisted firms in the same industry and size to determine if Belgian companies engage in earnings management. The analysis fails to incorporate matching. Reweighting is needed to generalize results to the population and is not done. Finally, two papers, Becker, DeFond, Jiambalvo, and Subramanyam (1998) and Krishnan (2003) use a NCB-SM design. Becker, Defond, Jiambalvo, and Subramanyam specifically that they do not use a matched sample design, but they nonetheless proceed to select what we term a semi-matched sample. They match companies with Big 6 auditors and non-Big 6 auditors by year, industry, and cash flows to determine if discretionary accruals are greater for companies with Big 6 or non-Big 6 auditors. Their exposition of the selection process is a negative one: they discuss how they exclude observations for firms not having a match, which yields the same result that including only firms having matches would accomplish. It is not pair-matching, but this accomplishes a many-to-many matching.7 Their analysis, not surprisingly, fails to incorporate matching for year and industry. To obtain valid results they needed to have included indicator variables for each year-industry-cash flow decile combination to 7 They state “We emphasize that our sample selection procedure is not a matched sample design. Rather, the constraints on year, industry, and cash flow are employed in an attempt to induce a reasonable amount of comparability across the Big Six and non-Big Six samples. Because the Big Six clients vastly outnumber non-Big Six clients in the COMPUSTAT database, a one-to-one match does not yield a sample that is reflective of the population. Our main result (that Big Six auditors have lower discretionary 34 incorporate matching in the analysis. Also, reweighting is needed to generalize their results to the population and is not done. Krishnan similiarly applies an exclusion process which amounts to matching companies having Big 6 and non-Big 6 auditors by year, industry, and cash flows. He seeks to determine if discretionary accruals are associated with stock returns or future profitability. His analysis likewise fails to incorporate matching and omits reweighting needed to generalize the results to the population. In this grouping of research papers, all but one (Colbert and Murray) suffers Error 1, so the miscellaneous results stated (that audit quality is related to market value of IPO’s, to ERC’s, to municipal bond ratings, etc.) have not in fact been supported. Becker DeFond Jiambalvo and Subramanyam state in a footnote that their main result (that Big Six audited firms have lower discretionary accruals) holds in non-reported analysis over all Compustat firms, but there is no way for us to evaluate whether other results are robust to the misspecifications that have been imposed. Research Stream 7: Auditor Judgments Research on auditor judgments examines the way auditors make decisions. The use of matched samples may be a strategic decision because data collection from audit personnel is very costly. The field may include such studies where there is matching in the sense of within-subject, before and after data collection, but none are highlighted as using matching. Use of blocking in ANOVA to correctly account for within-subject matching is well understood and is equivalent to the use of dummy variables for each matched set. accruals) is not dependent on the sample selection procedure. Our results hold even when we use the 35 We identified only two papers in this field where the matching was across subjects. Jamal and Tan (2001) use a sample consisting of 14 threesomes of auditors (manager, “top” auditor, and “mediocre” auditor) who work together on client engagements. They test for differences in performance on a judgment task across roles, eg. managers’ vs. “top” seniors’ performance, by t-tests that appear to be unmatched. Accounting for the matching within their sample selection would require use of matched t-tests, instead, that would report results with seven fewer degrees of freedom. It is not possible for us to assess whether doing so would yield higher or lower statistical significance than what was reported, but the matched test is what is required by the situation (the degrees of freedom are not “used up” by a choice after the fact, they were not available in the sample collected). Unmatched tests do not have validity for generalizing about the performance differences across roles. Bedard and Graham (2002) sample contains 23 pairs of auditors who work together on the same engagement. They treat one auditor with an aid intended to give a negative orientation in a risk assessment judgment task, the other with one to give a positive orientation. OLS regressions are applied to explain number of risk factors identified as a function of negative vs. positive orientation as well as auditor experience. Accounting for the pairing by including fixed effects is mandated by the data selection process, but appears not to have been done. Coefficients and p-values would differ; it is not discernable whether the “loss” of 23 degrees of freedom would decrease apparent precision or whether precision would be enhanced by implementing the control for within-client-engagement experience that may be obscuring precision. population of COMPUSTAT firms. 36 Chung, Firth, and Kim, (2003), a NCB-FM-B study considers whether Big 6 auditing firms use more conservative accounting than non-Big 6 firms, when the company reports bad news. They develop a sample of firms with a Big 6 auditor during the 10-year period 1988-1997 and for each year selected and matched them to a sample of firms with non-Big 6 auditors by 2-digit SIC code and closest in size. This paper suffers from error 1, 2 and 3. The analysis fails to incorporate matching. Additional control is needed for “closest size.” To generalize the results to the population, the researcher needs to reweight the sample. Research Stream 8: Auditor Changes Auditor changes are events of concern to firms and investors because auditor change may contain information content. Researchers are interested in auditor changes because identifying factors associated with changes may be useful for predicting auditor change. Knowing when clients change auditors or when auditors are likely to resign from an engagement may allow an audit firm to control the overall audit risk. The fourteen papers in this research stream can be grouped into three categories: (1) market reaction to auditor change; (2) causes of auditor change; and (3) consequences of auditor change. Seven papers are CB-FM and six papers are CB-SM. Eleven of the thirteen papers suffer from error 1, a failure to take matching into account in the analysis of the data. The results of all these papers are suspect, both in the results found and the results not found. A number of researchers discuss areas where results were expected, but not found (e.g. Dhaliwal, Schatzberg, and Trombley 1991; Schwartz and Soo 1995.) In these situations, the failure to find significant results may be a result of either miss- 37 analyzing data (this biases the researcher against finding results) or in not controlling for other factors in the analysis. Several studies used univariate analysis, rather than multivariate analysis. Univariate analysis does not allow a researcher to control for other variables that may affect the analysis and provides very inconclusive results in a situation where many variables impact the result. Matched sample studies are used in this area of research because data collection is costly. Data is hand-collected from annual reports, 8K’s, 10-K’s, proxy statements, Moody’s Manuals, and Standard & Poor’s Register of Corporations. Using a matched sample when data collection is costly can be an efficient way to analyze data. Market Reaction to Auditor Changes Three papers consider market reactions to auditor changes: Fried and Schiff (1981), Schwartz and Soo (1995), and Dunn, Hillier and Marshall (1999.) Schartz and Soo and Dunn, Hillier, and Marshall suffer from error 1 making the results reported in their papers highly questionable. Fried and Schiff (1981) use univariate pairwise t-tests of difference in market returns of firms that switch auditors vs. beta- and industry-matched firms that do not switch auditors. Fried and Schiff report a negative market reaction to a change in auditors. Schwartz and Soo (1995) consider whether 8-K disclosures of auditor changes by firms approaching bankruptcy are systematically different from 8-K disclosures of non-bankrupt firms changing auditors. The results reported in the paper support the researchers'contention that disclosures of bankrupt firms differ from the disclosures of non-bankrupt firms. The researchers fail to find evidence that the market penalizes bankrupt companies that delay the filing of the 8-K report, contrary to expectations. 38 Dunn, Hillier and Marshall (1999) consider the reaction of the stock market in the United Kingdom to auditor resignation letters. Their results indicate that the market reacts negatively to the auditor resignation letter, even though few auditors indicated client problems in the letter. Because the last two studies suffer from error 1, the conclusions reported by the researchers may not be supported by re-analysis of the evidence. Causes of Auditor Changes Seven papers examine causes for auditor changes. All seven papers in this area suffer from error 1, so the results reported are highly questionable. While the conclusions reached by the researchers may appear to be reasonable or consistent with a “commonsense” understanding of the phenomena of auditor change, the data analysis reported in the papers does not provide evidence to support the research theories. Schwartz and Menon (1985), consider the reasons bankrupt companies change auditors. Possible factors associated with a change might include: audit qualifications, reporting disputes, management changes, audit fees, increased need for larger auditors as firm grows, but their analysis fails to support any of these explanations. Results indicate only that failing firms have a greater tendency to switch auditors. Seabright, Levinthal, and Fichman (1992) consider the role that personal attachments between management and the auditor play in auditor changes. They found that changes in the resource requirements of a company increased the liklihood of switching auditors. Attachments of the CFO and Audit Committee to the auditor decreased the liklihood of ending the relationship. Some of the reported results were contrary to predictions. Re-analysis taking the matching into account may provide support for their predictions. 39 Krishnan and Krishnan (1997) examine factors influencing auditor changes by auditor resignation or by dismissal. The study uses two semi-matched samples, by industry and year or by year alone. The logit analysis fails to account for the matching variables and hence suffers Error 1 and 3. An unpublished partial replication of this work suggests that Krishnan and Krishnan’s main result (that an audit litigation risk variable is significant in determining auditor resignation) is negated when the matching is accounted for as it should be.8 Menon and Williams (1999), consider audit changes initiated by the auditor rather than the client. They hypothesize that auditor decisions to resign or decline to continue an engagement can be explained by audit error cost. Although their results support their belief, the analysis suffers from error 1. The reported results may change on re-analysis of the data. Shu (2000) hypothesizes that auditor resignation is positively related to increased client legal exposure and to a mismatch between the auditor and the client. Her results provide support for her hypotheses. She finds significant and negative stock reactions associated with auditor resignations. The greater the increase in litigation risk, the larger the drop in stock price, if the resignation occurs during the audit. Because Shu' s paper suffers from error 1, her results may not be supported by re-analysis. Woo and Koh (2001) examines the reasons for auditor changes in Singapore. Their sample includes companies changing auditors and companies not changing auditors from 1986-1995. They find that auditor changes are more likely in cases where firms engage smaller audit firms, change their management composition, experience a lower diffusion of ownership, experience an increase in income manipulation opportunities, have higher leverage, have 8 We do not have the partial replication results to share, as they have not been made available to us. 40 many subsidiaries or have more rapid growth. The researchers tested sixteen characteristics to explain the direction of auditor change. Of the sixteen variables tested, ten were found to be significant. Because their results failed to incorporate matching, the variables identified may be inaccurate. The insignificant variables may also prove to play a role in the audit change process if the data was subject to re-analysis. Carcello and Neal (2003) examine auditor dismissals following a going concern opinion report issued by Big 6 auditors between 1988 and 1999. They find that audit committees with greater independence, greater governance expertise, and lower stockholdings are more effective in shielding auditors from dismissal after issuance of a going concern opinion. The researchers report a counter-intuitive positive relationship between audit committee members’ financial expertise and dismissals following clean opinions. Because this study suffers from error 1, the results reported in the paper may change when re-analysed. Consequences of Change in Auditors Four papers consider the consequences of changing auditors in terms of auditorclient fit, the level of non-audit services consumed, the level of performance of the company changing auditors, and the quality of financial statements of the company changing auditors. DeBerg, Kaplan, and Pany (1993) does not suffer from error 1. Eichenseher and Shields (1989), Dhaliwal, Schatzberg, and Trombley (1993), and Johnson, Khurana, and Reynolds (2002) suffer from error 1, rendering the results from their studies questionable. Eichenseher and Shields (1989) posit that the demand for audit services is a function of the firm' s capital structure. The researchers consider the relationship between 41 the auditor (Big 8 or not) and four capital structure variables. None of the samples generated models that were predictive. The paper suffered from error 1 so reanalysis may be useful. DeBerg, Kaplan and Pany investigated whether there was an association between the decision to change auditors and the level of non-audit services provided. They failed to find an association in their analysis. The researchers did report that companies reduced their consumption of non-audit services after a change in auditors. Dhaliwal, Schatzberg, and Trombley (1993) investigate the information content of disagreement disclosures surrounding auditor changes. They find that clients changing auditors after a disagreement have poorer earnings performance, more debt, lower levels of current assets, and poorer stock price performance than firms changing auditors without a disagreement and other firms in the same industry. Because the research suffers from error 1, the variables identified and the relationships between the variables and auditor change are unreliable. Johnson, Khurana, and Reynolds consider whether audit firm tenure is associated with financial reporting quality. Johnson, Khurana, and Reynolds selected a COMPUSTAT sample of companies audited by short-tenure firms(2-3 years), medium tenure firms (4-8 years)and long tenure firms (9 years or longer) with companies matched on industry and size. The analysis fails to incorporate matching so the results do not reflect the condition nature of the research design. Reweighting is needed to generalize to the population and is not performed. Bears on whether mandatory rotation of audit firms is good policy. 42 Research Stream 9: Detecting Management Fraud Research in the area of fraud detection is designed to identify factors present when fraud occurs. This information would be useful for managers interested in preventing fraud, for auditors responsible for designing the audit so material fraud is found, and for the regulators making institutional changes likely to lessen the occurrence of fraud within a company. Results of research done in this area indicate that analytical procedures might be useful for identifying situations where fraud might be present. Research conclusions related to the impact of the independence of the board of directors or the presence of an audit committee are subject to error 1. Therefore the conclusions of the research may not be supported during re-analysis. One research paper identified fraud risk factors that might be associated with fraud, but this paper was also subject to error 1, so the fraud risk factors identified may not stand up to reanalysis. Matched sample and choice-based research designs have been used to investigate fraud due to the costs associated with manual data collection. Four research studies consider the potential for detecting management fraud in the planning stage of an audit. Two of the studies are choice-based fully matched, one is choice-based semi-matched, and one is a between subjects non-choice-based study. Three of the four studies suffer from Error 1. Two of the studies use closest matching (on size) in pair selection, hence would be required to account for that in the analysis, but do not (suffering error 2.) Two of the studies use logit and hence are exempt from the need to reweight data; the other two suffer error 3. Green and Calderon (1995) identify 86 instances of 10-K statements subsequently found to be fraudulent, and match to 86 non-fraud firm year observations, with matching 43 by industry, year, and “comparable” size. The work tests whether simple analytical procedures in the early stage of an audit are effective in signaling management fraud. Their analysis is by univariate ranked sums of differences, takes matching into account, but suffers Error 2 for not addressing residual difference in size, and suffers Error 3 because reweighting is not done that would allow the results to generalize to any larger population. The results of the study indicate that analytical procedures are fairly effective at detecting management fraud. Beasley (1996), also discussed in Research Stream 3 on Audit Committee Impact, employs 75 matched pairs of fraud and non-fraud firms matched by year, stock exchange, size (market value within 30%), and industry to consider whether including a larger proportion of outside members on the board of directors reduces the likelihood of fraud. The results indicate that firms without fraud have a significantly higher percentage of outside members than firms with fraud. Audit committee presence was found not to be significant, seemingly contrary to Beasley’s priors. This work was viewed as important, innovative work on an important topic and received the 1995 AAA Competitive Manuscript Award. The error suffered Error 1, which Cram, Karan, and Stuart (2007) show is severe. Sarbanes-Oxley requires an independent audit committee, rather than an independent board of directors implementing the reverse of what was implied by this paper. It seems possible that, in reanalysis, Beasley’s main result, that percentage of outside directors affects the likelihood of financial fraud, could be reversed, and that the presence of audit committee instead is a significant driver. Green and Choi (1997) use the sample of firms from Green and Calderon (1995) to apply a neural network approach which appears to pool the data and not account for 44 the pairings; hence suffers Error 1 in addition to Errors 2 and 3. They assert that their neural network model achieves higher reliability levels than previous models developed to detect the presence of fraud in financial statements. Bell and Carcello (2000) is a CB-SM study that develops a model to estimate the likelihood of fraudulent financial reporting for an audit client based on the presence or absence of fraud-risk factors. The researchers identified 7 factors from the list of 46 fraud risk factors that were associated with the likelihood of fraudulent financial statements. Because this paper suffers from Error 1, the variables identified may not prove to be significant after re-analysis. Also, variable believed to be insignificant might be found to be significant, in reanalysis. DISCUSSION OF AUDIT RESEARCH STUDIES BY JOURNAL AND YEAR Frequency of Errors by Journal: In Table 4, we tabulate the occurrence of errors across journals in which the articles appear. The top three journals in terms of numbers of papers in our sample are Auditing: a Journal of Practice & Theory, with 22 papers, The Accounting Review, with 10 papers, and Accounting and Business Research, with 9 papers. This is reflective of where auditing papers appear. There is no obvious evidence of editorial differences across journals relating to the errors addressed in this paper. Frequency of Errors by Year: Examination of trends over time, tabulated in Table 5, is cause for concern. Overall, 75% of papers have suffered Error 1, the most essential error, and all but one of 24 recent papers (published during 2000-2003) suffer from it. The exception appeared in a journal outside the mainstream of accounting research, in Decision Sciences, it 45 employed a Choice-Based Non-Matched sample, so did not require matching to be accounted for in the analysis. The average number of conceptual errors has hovered around two per paper. Cross Tabulation by Journal Quality against Time Range: When we group journals by quality category (Higher tier accounting journals, AJPT, lower tier accounting journals, vs. non-accounting journals) and by year ranges, as tabulated in Table 6, we begin to reveal some trends in the dispersion of the matched and choice-based research designs. Audit researchers used matched and choice-based samples first in the Upper tier journals (defined as CAR, JAE, JAR, TAR). Usage in AJPT and in lower tier journals followed, and the cumulative usage is now about equal. Audit research has appeared in journals that are not primarily accounting journals only more recently. However, the average number of errors does not yet appear to differ across journal quality categories. With the distribution of CKS (2007) and the present research, we are interested to see which journal editors enforce changes going forward. SUMMARY AND CONCLUDING REMARKS This article examined the use of choice based and matched sample designs in published auditing research, identified model specification issues, and clarified the matched sample research design approach in order to promote its use as an effective tool. Researchers have often selected matched samples based on industry and size, and assumed this effectively controls for industry and size effects in their studies. But, as has been demonstrated with simulations and formal proofs, this approach can and does lead to incorrect conclusions. This leaves us with uncertainty in the accuracy of results of a 46 number of published papers within auditing and beyond (in other accounting and finance research). Perhaps more importantly, the power to detect statistically significant relationships may have been lost in numerous research studies that have gone unpublished, despite researchers having collected data that may have held important results. We hope that our identification of the need to reanalyze various auditing research studies will spur expanded use of the choice based and matched sample design in future accounting research. Data analysis using properly specified models might allow researchers to find new results in previously published choice based and matched sample studies, or to resolve discrepancies of findings within and across papers. Additionally, a better understanding of matched sample design may facilitate reanalysis of datasets that have yet to yield publishable results, perhaps due to incorrect analysis. Further the improved understanding should facilitate more rapid penetration of entirely new areas of research interest where data collection is initially costly. 47 References Abbott, L. J., Y. Park, and S. Parker. 2000. The Effects of Audit Committee Activity and Independence on Corporate Fraud. Managerial Finance 26 (11): 55-67. Agrawal, A. and S. Chadha. 2005. Corporate Governance and Accounting Scandals. Journal of Law, Economics & Organization. 48: 371-406. Allen, A. C. 1994. The Effect of Large-Firm Audits on Municipal Bond Rating Decisions. Auditing: A Journal of Practice & Theory 13 (1): 115-125. Altman, E.I. 1968. Financial Ratios as Predictors of Failure. Journal of Finance 23(4): 589-609. Bartov, E., F. A. Gul, and J. S. L. Tsui. 2001. Discretionary-Accruals Models and Audit Qualifications. Journal of Accounting and Economics 30: 421-452. Bauwhede, H. V., M. Willekens, and A. Gaeremynck. 2003. Audit Firm Size, Public Ownership, and Firms’ Discretionary Accruals Management”. International Journal of Accounting 38:1-22. Beasley, M. S. 1996. An Empirical Analysis of the Relation Between the Board of Director Composition and Financial Statement Fraud. The Accounting Review 71 (4): 443-465. Beaver, W. 1966. Financial Ratios as Predictors of Failure. Empirical Research in Accounting: Selected Studies, 1966, supplement to Journal of Accounting Research: 71-111. Beaver, W. 1968. Market Prices, Financial Ratios, and the Prediction of Failure. Journal of Accounting Research 6(2): 179-192. Becker, C. L., M. L. DeFond, J. Jiambalvo, and K. R. Subramanyam. 1998. The Effect of Audit Quality on Earnings Management. Contemporary Accounting Research 15 (1): 1-24. Bedard, J. C., and L. E. Graham. 2002. The Effects of Decision Aid Orientation on Risk Factor Identification and Audit Test Planning. Auditing: A Journal of Practice & Theory 21 (2): 39-56. Behn, B. K., S.E. Kaplan, and K. R. Krumwiede. 2001. Further Evidence on the Auditor’s Going-Concern Report: The Influence of Management Plans. Auditing: A Journal of Practice & Theory 20 (1): 13-28. 48 Bell, T. B., and J. V. Carcello. 2000. A Decision Aid for Assessing the Likelihood of Fraudulent Financial Reporting. Auditing: A Journal of Practice & Theory 19 (1): 169-184. Buchman, T., and D. Collins. 1998. Uncertainty about Litigation Losses and Auditors. Journal of Business Research 43: 57-63. Carcello, J. V., and T. L. Neal. 2003. Audit Committee Characteristics and Auditor Dismissals following ‘New’ Going-Concern Reports. The Accounting Review 78 (1): 95-117. Carcello, J. V., and A. L. Nagy. 2004. Audit Firm Tenure and Fraudulent Financial Reporting. Auditing: A Journal of Practice & Theory 23(2): 55-69. Carlson, S. J., G. W. Glezen, and M. E. Benefield. 1998. An Investigation of Investor Reaction to the Information Content of a Going Concern Audit Report while Controlling for Concurrent Financial Statement Disclosures. Quarterly Journal of Business and Economics 37 (3): 25-39. Chen, K. C. W., and K. B. Church. 1992. Default on Debt Oblication and the Issuance of Going-Concern Opinions. Auditing: A Journal of Practice & Theory 11 (2): 3049. Chung, Firth, Kim . 2003. Auditor Conservatism and Reported Earnings. Accounting and Business Research 33 (1): 19-. Citron, D. and R. Taffler. 1992. The Auditor Report under Going Concern Uncertainties: an Empirical Analysis. Accounting and Business Research 22: 337-345. Citron, D. and R. Taffler. 2001. Ethical Behavior in the U.K. Audit Profession: The Case of the Self-Fulfilling Prophecy Under Going-Concern Uncertainties. Journal of Business Ethics 29: 353-363. Clarkson and Simunic. 1994. The Association Between Audit Quality, Retained Ownership, and Firm-Specific Risk in U.S. vs. Canadian IPO Markets. Journal of Accounting and Economics 17: 207-228. Colbert, G., and D. Murray. 1998. The Association Between Auditor Quality and Auditor Size: An Analysis of Small CPA Firms. Journal of Accounting, Auditing, and Finance 13 (2): 135-150. Cormier, D., M. Magnan, and B. Morard. 1995. The auditor’s consideration of the going concern assumption. Journal of Accounting, Auditing, and Finance 10 (2): 201222. 49 Cram, D., V. Karan, and I. Stuart. 2005. Three Threats to Validity of Choice-Based and Matched Sample Studies in Accounting Research. Working paper. Available at the Social Science Research Network at http://ssrn.com/abstract=955031. DeAngelo, L. 1981. Auditor Size and Audit Quality. Journal of Accounting and Economics 3(3): 183-199. DeBerg, C., S. Kaplan, and K. Pany. 1991. An Examination of Some Relationships Between Non-Audit Services and Auditor Change. Accounting Horizons (March): 17-28. Dechow, P.M., R.G. Sloan and A.P. Sweeney. 1996. Causes and Consequences of Earnings Manipulation: An Analysis of Firms Subject to Enforcement Actions by the SEC. Contemporary Accounting Research 13 (1):1-36. DeFond, M. L., and J. Jiambalvo. 1993. Factors related to auditor-client disagreements over income-increasing accounting methods. Contemporary Accounting Research 9 (2): 415-431. Dhaliwal, D. S., J. W. Schatzberg, and M. A. Trombley. 1993. An Analysis of the Economic Factors Related to Auditor-Client Disagreements Preceding Auditor Change. Auditing: A Journal of Practice & Theory 12 (2): 22-38. Dopuch, N., R. W. Holthausen, and W. Leftwich. 1987. Predicting Audit Qualifications with Financial and Market Variables. The Accounting Review 62 (3): 431-454. Dunn, J., D. Hillier, and A. P. Marshall. 1999. The Market Reaction to Auditor Resignations. Accounting and Business Research 29 (2): 95-108. Eichenseher, J. W., and D. Shields. 1989. Corporate capital structure and auditor ‘fit’. Advances in Accounting, Supplement 1: 39-56. Elliott, J. A. 1992. “Subject to” Audit Opinions and Abnormal Security Returns— Outcomes and Ambiguities. Journal of Accounting Research 20 (2): 617-638. Farber, D.B. 2005. Restoring Trust After Fraud: Does Corporate Governance Matter? The Accounting Review 80(2): 539-561. Feltham, G. A, J. S. Hughes, and D. A. Simunic. 1991. Empirical Assessment of the Impact of Auditor Quality on the Valuation of New Issues. Journal of Accounting and Economics 14: 375-399. Fleak, S. K., and E. R. Wilson. 1994. The Incremental Information Content of the GoingConcern Audit Opinion. Journal of Accounting, Auditing, and Finance 9 (1): 149166. 50 Foster, B. P., T. J. Ward, and J. Woodroof. 1998. An Analysis of the Usefulness of Debt Defaults and Going Concern Opinions in Bankruptcy Risk Assessment. Journal of Accounting, Auditing and Finance 13 (3): 351-376. Fried, D., and A. Schiff. 1981. CPA Switches and Associated Market Reactions. The Accounting Review 56 (2): 326-341. Gaeremynck, A., and M. Willekens. 2003. The Endogenous Relationship Between AuditReport Type and Business Termination: Evidence on Private Firms in a NonLitigious Environment. Accounting and Business Research 33 (1): 65-79. Geiger, M.A. and D.V. Rama. 2003. Audit Fees, Nonaudit Fees, and Auditor Reporting on Stressed Companies. Auditing: A Journal of Practice & Theory 22(2): 53-69. Green, B. P., and T. G. Calderon. 1995. Analytical Procedures and Auditors’ Capacity to Detect Management Fraud. Accounting Enquiries 5 (1): 1-48. Green, B. P., and J. H. Choi. 1997. Assessing the Risk of Management Fraud Through Neural Network Technology. Auditing: A Journal of Practice & Theory 16 (1):14-28. Hay, D. C., W. R. Knechel, and N. Wong. 2006. Audit Fees: A Meta-analysis of the Effect of Supply and Demand Attributes. Contemporary Accounting Research 23(1):141-91. Heninger, W. G. 2001. The Association between Auditor Litigation and Abnormal Accruals. The Accounting Review 76 (1): 111-126. Iyer, V. M., and G. S. Iyer. 1996. Effect of Big 8 Mergers on Audit Fees: Evidence from the United Kingdom. Auditing: A Journal of Practice & Theory 15 (2): 123-132. Jamal, K., and H. Tan. 2001. Can Auditors Predict the Choices Made by Other Auditors? Journal of Accounting Research 39 (3): 583-597. Johnson, V. E., I. K. Khurana, and J. K. Reynolds. 2002. Audit-firm Tenure and the Quality of Financial Reports. Contemporary Accounting Research 19 (4):637660. Kida, T. 1980. An Investigation into Auditors’ Continuity and Related Qualification Judgments. Journal of Accounting Research 18 (2): 506-523. Kinney, W. R., and L. S. McDaniel. 1993. Audit Delay for Firms Correcting Quarterly Earnings. Auditing: A Journal of Practice & Theory 12 (2): 135-142. 51 Kleinman, G., and A. Anandarajan. 1999. The Usefulness of Off-Balance Sheet Variables as Predictors of Auditor’s Going Concern Opinions. Managerial Auditing Journal 14 (6): 273-285. Koh, H. C. 1991. Model Predictions and Auditor Assessments of Going Concern Status. Accounting and Business Research 21 (84): 331-338. Koh, H. C., and S. S. Tan. 1999. A Neural Network Approach to the Prediction of Going Concern Status. Accounting and Business Research 29 (3): 211-216. Krishnan, G. 2003. Audit Quality and the Pricing of Discretionary Accruals. Auditing: A Journal of Practice & Theory 22 (1):109-126. Krishnan, J., and Krishnan, J. 1997. Litigation Risk and Auditor Resignations. The Accounting Review 72 (4): 539-560. Kreutzfeldt, R.W., and Wallace, W. A. 1986. Error Characteristics in Audit Populations: Their Profile and Relationship to Environmental Factors. Auditing: A Journal of Practice & Theory 6 (1):20-43. Kreutzfeldt, R.W., and Wallace, W. A. 1990. Control Risk Assessments: Do They Relate to Errors? Auditing: A Journal of Practice & Theory. Krogstad, L., and G. Smith. 2003. Assessing the Influence of Auditing: A Journal of practice & Theory: 1985-2000. Auditing: A Journal of Practice & Theory 22 (1): 195-204. Lenard, M. J., P. Alam, and G. R. Madey. 1995. The Application of Neural Networks and a Qualitative Response Model to the Auditor’s Going Concern Uncertainty Decision. Decision Sciences 26 (2): 209-227. Lenard, M. J., P. Alam, and G. R. Madey. 1998. The Design and Validation of a Hybrid Information System for the Auditor’s Going Concern Decision. The Journal of Management Information Systems 14 (4): 219-237. Lenard, M. J., P. Alam, and D. Booth. 2000. An Analysis of Fuzzy Clustering and a Hybrid Model for the Auditor’s Going Concern Assessment. Decision Sciences 31 (4): 861-884. Levitan, A. S., and J. A. Knoblett. 1985. Indicators of Exceptions to the Going Concern Assumption. Auditing: A Journal of Practice & Theory 5 (1): 26-39. Lys, T. and R. Watts. 1994. Lawsuits Against Auditors. Journal of Accounting Research 32 (Supplement): 65-93. 52 Maddala, G. S. 1991. A Perspective on the Use of Limited-Dependent and Qualitative Variables Models in Accounting Research. The Accounting Review 66: 788-807. Maher, M., P. Tiessen, R. Colson, and A. Broman. 1992. Competition and Audit Fees. The Accounting Review 67: 199-211. McMullen, D. A. 1996. Audit Committee Performance: An Investigation of the Consequences Associated with Audit Committees. Auditing: A Journal of Practice & Theory 15 (1): 87-103. Menon, K., and D. D. Williams. 1999. Error Cost and Auditors’ Termination Decisions. Journal of Accounting, Auditing and Finance: 95-123. Morris, R. E.,and J. R. Strawser. 1999. An Examination of the Effect of CPA Firm Type on Bank Regulators’ Disclosure Decisions. Auditing: A Journal of Practice & Theory 18 (2): 143-158. Mutchler, J. 1985. A Multivariate Analysis of the Auditor’s Going-Concern Opinion. Journal of Accounting Research 23(2): 668-682. Palepu, K.G. 1986. Predicting Takeover Targets: a Methodological and Empirical Analysis. Journal of Accounting and Economics 8: 3-35. Peasnell, K. V., P. F. Pope, and S. Young. 2001. The Characteristics of Firms Subject to Adverse Ratings by the Financial Reporting Review Board. Accounting and Business Research 31(4): 291-311. Ponemon, L. and A. Schick. 1991. Financially Distressed Companies and Auditor Perceptions of the Twelve Characteristics of Decline. Auditing: A Journal of Practice & Theory 10: 70-83. Richardson, S.A. 2005. Discussion of Consequences of Financial Reporting Failure for Outside Directors: Evidence from Accounting Restatements and Audit Committee Members. Journal of Accounting Research 43(2): 334-342.. Sanders, G., A. Allen, and L. Korte. 1995. Municipal Audit Fees: Has Increased Competition Made a Difference? Auditing: A Journal of Practice & Theory 14 (1): 105-114. Schwartz, K. and K. Menon. 1985. Auditor Switches by Failing Firms. The Accounting Review 60 (April): 248-261. Schwartz, K. B., and B. S. Soo. 1995. An Analysis of Form 8-K Disclosures of Auditor Changes by Firms Approaching Bankruptcy. Auditing: A Journal of Practice & Theory 14 (1): 125-136. 53 Seabright, M. A., D. A. Levinthal, and M. Fichman. 1992. Role of Individual Attachments in the Dissolution of Interorganizational Relationships. Academy of Management Journal 35 (1): 122-160. Seipel, C., and L. Tunnell. 2000. A Stochastic dominance analysis of the issuance of qualified opinions. American Business Review: 32-39. Shu, S. Z. 2000. Auditor Resignations: Clientele Effects and Legal Liability. Journal of Accounting and Economics 29: 173-205. Simunic, D. A. 1980. The Pricing of Audit Services: Theory and Evidence. Journal of Accounting Research 18(1): 161-190. Srinivasan, S. 2005. Consequences of Financial Reporting Failure for Outside Directors: Evidence from Accounting Restatements and Audit Committee Members. Journal of Accounting Research 43(2): 291-334. (Discussed by Richardson) Stice, J. 1991. Using Financial and Market Information to Identify Pre-Engagement Factors Associated with Lawsuits Against Auditors. The Accounting Review 65: 516-533. Teoh, S. and T. J. Wong. 1993. Perceived Auditor Quality and the Earnings Response Coefficient. The Accounting Review 68: 346-366. Turpen, R. A. 1990. Differential Pricing on Auditors Initial Engagements: Further Evidence. Auditing: A Journal of Practice & Theory 9(2): 60-76. Vafeas, N. 2001. On Audit Committee Appointments. Auditing: A Journal of Practice & Theory 20: 197-207. Vanstraelen, A. 2002. Auditor Economic Incentives and Going-Concern in a Limited Litigious Continental European Business Environment: Empirical Evidence from Belgium. Accounting and Business Research 21(3): 171-186. Vanstraelen, A. 2003. Going Concern Opinions, Auditor Switching, and the SelfFulfilling Prophecy Effects Examined in the Regulatory Context of Belgium. Journal of Accounting Auditing and Finance: 231-252. Walker, P. L., and J. R. Casterella. 2000. The Role of Auditee Profitability in Pricing New Audit Engagements. Auditing: A Journal of Practice & Theory 19(1): 157167. Wallace, W. A. and R. W. Kreutzfeldt. 1995. The Relation of Inherent and Control Risks to Audit Adjustments. Journal of Accounting, Auditing and Finance: 459-481. 54 Wilkerson, J. E., Jr. 1987. Selecting Experimental and Comparison Samples for Use in Studies of Auditor Reporting Decisions. Journal of Accounting Research 25: 161167. Woo, E., and H. C. Koh. 2001. Factors Associated with Auditor Changes: A Singapore Study. Accounting and Business Research 31(2): 133-144. Zmijewski, M. 1984. Methodological Issues Related to the Estimation of Financial Distress Prediction Models. Journal of Accounting Research 22: 59-82. 55 Figure 1: Research Design Categories for Choice Based and Matched Samples9 Choice Based Matched NCB-FM-W CB-NM CB-SM CB-FM NCB-SM NCB-FM-B Fully Matched Category Example Count in Audit Research CB-NM Choice Based Non-Matched Palepu (1986) CB-SM Choice Based Semi-Matched Heninger (2001) 16 CB-FM Choice Based Fully Matched Lys and Watts (1994) 21 NCB-FM-W Non Choice Based Fully Matched 7 Teoh and Wong (1993) 5 Iyer and Iyer (1996) 15 7 Within-Subject NCB-FM-B Non Choice Based Fully Matched Between Subject NCB-SM Non Choice Based Semi-Matched Krishnan (2003) STRATIF Stratified only (NCB, NM) Kreutzfeldt and Wallace _5 Total 76 9 Adapted from CKS (2007). 56 Table 1: By Research Stream Research Stream Paper Count Has Error 2 0 Has Error 3 2 Suffer 0 errors 0 Suffer 1 error 3 Suffer 2 errors 2 Suffer 3 errors 0 Avg # Errors 5 Has Error 1 5 5 5 1 1 0 4 0 1 1.4 7 3 0 3 3 2 2 0 .86 23 16 10 13 1 11 5 6 1.69 1-Determinants of Audit Litigation 2-Impact of Audit Committees 3-Determinants of Audit Fees 4-Prediction of Financial Distress Including GCQ 5-Prediction of Audit Qualifications Other than GCG 6-Audit Quality 7-Auditor Judgments 8-Auditor Changes 9-Detecting Mgmt Fraud Subtotal (Duplications) 1.4 9 7 6 4 0 2 6 1 1.89 8 6 14 4 81 (5) 7 3 12 3 61 (5) 4 1 5 2 29 (0) 7 4 8 2 44 (0) 1 0 0 0 5 (0) 0 5 6 2 35 (5) 3 0 5 1 24 (0) 4 1 3 1 17 (0) 2.25 1.33 1.79 1.75 1.66 -- Total 76 56 29 44 5 30 24 17 1.70 57 Table 2 Errors by Research Design10 CB-FM 21 papers in audit research sample CB-SM 16 papers CB-NM 7 papers NCB-FM-W 5 papers NCB-FM-B 15 papers NCB-SM 7 papers STRATIF 5 papers Total: 76 10 Treatment Group Selected on basis of outcome Selected on basis of outcome Selected on basis of outcome Randomly selected sample of firms Randomly selected sample of firms Randomly selected sample of firms Stratified only Control Group One firm selected as match for each firm in treatment group from set of firms having similar characteristics by matching on “closest” values Randomly selected from firms not having the same outcome, but matching by industry, year, size or group level Randomly selected from firms not having the same outcome Same subject, usually before and after One firm selected as match for each firm in treatment group from set of firms having similar characteristics by matching on “closest” values Randomly selected from firms not having the same outcome, but matching by industry, year, size or group level Select at different rates, but within every group Error 1 Error 2 Error 3 √ √ √ 21 17 11 √ √ √ 16 1 N/A N/A 3 √ 5 √ N/A N/A √ √ √ 10 11 15 √ 7 √ 0 √ 5 0 0 5 56 29 44 2 Adapted, with modifications, from CKS (2007). See that for additional guidance for researchers, by category. 58 Insert Table 3 here. (Note to reader: This Table 3 may appear out of order, at end, or be attached in a separate PDF document file.) 59 Table 4: Categorization of Studies by Journal of Publication Journal ABR ADVA AE AH AJPT AMBR AMJ CAR DS IJOA JAAF JAE JAR JBR JMIS JOBE MAJ MF TAR QJBE Total Count of papers 9 1 1 1 22 1 1 3 2 1 7 4 6 2 1 1 1 1 10 1 Suffer Error 1 Suffer Error 2 Suffer 0 errors Suffer 1 error Suffer 2 errors 8 0 1 1 6 1 0 1 0 0 1 3 2 0 0 0 1 1 3 0 Suffer Error 3 4 1 1 1 12 1 0 2 2 1 3 2 5 1 1 0 1 1 4 1 9 1 0 0 17 1 1 3 0 1 2 4 3 2 0 1 1 1 8 1 76 55 Avg # Errors 4 1 1 1 5 0 0 3 0 1 1 1 2 1 0 0 0 0 2 1 Suffer 3 errors 4 0 0 0 5 1 0 0 0 0 0 2 1 0 0 0 1 1 2 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 0 0 1 0 1 0 0 0 10 0 1 0 2 0 4 1 3 1 1 1 0 0 5 0 29 44 5 30 24 17 1.70 2.33 2.00 2.00 2.00 1.59 3.00 1.00 2.00 1.00 2.00 .86 2.25 1.67 1.50 1.00 1.00 3.00 3.00 1.50 2.00 Journals, in order of abbreviated titles above, are Accounting and Business Research, Advances in Accounting, Accounting Enquiries, Accounting Horizons, Auditing: A Journal of Practice & Theory, American Business Review, Academy of Management Journal, Contemporary Accounting Research, Decision Sciences, International Journal of Accounting, Journal of Accounting Auditing and Finance, Journal of Accounting and Economics, Journal of Accounting Research, Journal of Business Research, Journal of Management Information Systems, Journal of Business Ethics, Managerial Auditing Journal, Managerial Finance, The Accounting Review, and Quarterly Journal of Business and Economics. 60 Table 5: Categorization of Studies by Year of Publication Suffer Error 1 Suffer Error 2 Suffer Error 3 Suffer 0 errors Suffer 1 error <1980 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 Count of papers 0 2 1 0 0 0 3 1 2 0 1 2 5 5 4 4 6 3 2 6 5 7 7 3 7 1 0 2 0 2 1 1 4 3 3 3 1 2 2 3 5 6 7 3 7 1 1 2 0 0 0 0 4 2 3 2 2 0 1 0 3 3 1 1 3 2 1 3 1 0 1 2 4 2 3 4 5 0 1 4 3 4 0 1 3 0 0 0 0 0 0 0 0 1 0 0 1 1 0 2 0 0 0 0 0 Total 76 56 29 44 ’80-‘84 3 1 2 ‘85-‘89 7 5 ‘90-‘94 20 ’95-‘99 ’00-‘03 Year Suffer 3 errors Avg # Errors 1 0 1 1 2 0 1 0 2 0 1 3 2 1 1 2 3 6 1 2 Suffer 2 errors 0 1 0 0 0 1 1 3 1 3 1 1 0 0 3 0 2 1 2 4 1 0 2 0 0 0 0 2 1 1 2 1 0 1 0 3 2 0 0 1 2.00 2.00 2.33 1.00 1.00 2.00 1.50 2.40 1.40 2.25 2.25 1.33 0.67 2.00 1.17 2.20 1.86 1.14 1.67 1.85 5 30 24 17 1.70 3 0 1 1 1 2.00 2 5 0 4 1 2 1.71 14 11 15 1 4 9 6 2.00 22 13 6 13 4 9 4 5 1.45 24 23 8 8 0 12 9 3 1.63 61 Table 6: Categorization of Studies by Journal Quality Level and Year Range Journal Quality Level/ Yr Range Count of papers Suffer Error 1 Suffer Error 2 Suffer Error 3 ’80-‘84 ’85-‘89 ’90-‘94 ’95-‘99 ’00-‘03 Subtotal 3 4 8 3 5 23 1 3 6 3 5 18 2 1 5 0 1 9 3 2 6 1 1 13 ’80-‘84 ’85-‘89 ’90-‘94 ’95-‘99 ’00-‘03 Subtotal 0 2 7 6 7 22 0 1 5 4 7 17 0 1 3 2 0 6 0 2 6 2 2 12 ’80-‘84 ’85-‘89 ’90-‘94 ’95-‘99 ’00-‘03 Subtotal 0 1 4 10 7 22 0 1 2 4 7 14 0 0 3 4 5 12 0 1 3 7 2 13 ’80-‘84 ’85-‘89 ’90-‘94 ’95-‘99 ’00-‘03 Subtotal 0 0 1 3 5 9 0 0 1 2 4 7 0 0 0 0 2 2 0 0 0 3 3 6 Total 76 55 29 44 Suffer 0 errors Upper AJPT Lower Other 5 Suffer 1 error Suffer 2 errors Suffer 3 errors Avg # Errors 0 0 1 0 0 1 1 3 0 2 3 9 1 0 4 1 2 8 1 1 3 0 0 5 2.00 1.50 2.13 1.33 1.40 1.74 0 0 0 2 0 2 0 1 2 2 5 10 0 0 3 0 2 5 0 1 2 2 0 5 -2.00 2.00 1.33 1.29 1.59 0 0 0 2 0 2 0 0 1 4 1 6 0 1 2 1 5 9 0 0 1 3 1 5 -2.00 2.00 1.50 2.00 1.77 0 0 0 0 0 0 0 0 1 1 3 5 0 0 0 2 0 2 0 0 0 0 2 2 --1.00 1.67 1.80 1.67 30 24 17 Upper Tier Accounting Journals defined as: CAR, JAE, JAR, TAR Auditing defined as AJPT alone Lower Tier Accounting Journals defined as: ABR, ADVA, AE, AH, IJOA, JAAF, JMIS, MAJ Other Journals defined as: ACADMJ, AMBR, DS, JBR, JOBE, MF, QJBE. 1.70 Research Stream Author(s), Journal, Paper Title Stice (1991). The Accounting Review, “Using Financial and Market Information to Identify Pre-Engagement Factors Associated with Lawsuits Against Auditors”. Lys and Watts (1994).Journal of Accounting Research, “Lawsuits Against Auditors”. This paper was discussed by Jennifer Francis’ “Discussion of Lawsuits against Auditors”. Krishnan, J. and Krishnan, J. (1997). The Accounting Review , “Litigation Risk and Auditor Resignations” Shu (2000). Journal of Accounting and Economics . “Auditor Resignations: Clientele Effects and Legal Liability” SSCI citation count 1/26/07 60 Research Design Category auditor litigation auditor litigation CB-FM dup - 21 cites auditor litigation CB-SM 15 auditor litigation CB-SM Heninger (2001). The Accounting Review , “The Association Between Auditor Litigation and Abnormal Accruals”. 11 Beasley (1996). The Accounting Review , “An Empirical Analysis of the Relation Between the Board of Director Composition and Financial Statement Fraud” McMullen (1996). Auditing: A Journal of Practice & Theory ,“Audit Committee Performance: An Investigation of the Consequences Associated with Audit Committees” 60 Abbott, Park, Parker (2000). Managerial Finance . “The Effects of Audit Committee Activity and Independence on Corporate Fraud” Vafeas (2001). Auditing: A Journal of Practice and Theory , “On Audit Committee Appointments”. 5 14 auditor litigation audit committee impact CB-SM CB-SM audit committee impact NCB-SM 0 Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample CB-SM 32 Choice-Based and/or Matched Sample Selection audit committee impact audit committee impact CB-FM NCB-SM Create two semi-matched samples from Compustat firms: one is matched on year only Identify 49 cases of auditor litigation, excluding and then random selection, the other is matched financial and service firms. on year and industry (SIC3). Probit regression of auditor litigation or not Fully-match 163 firm-year observations: match by year, industry (3 digit SIC), and Compustat Identify 163 auditee firms whose auditors were delisting code, if any, and then select firm of sued during 1955-1994, and for which Compustat closest size (total assets). data was available OLS (and logit not reported) regressions of litigation or not Identify 67 auditee firms with lawsuits against auditors having 8 years of Compustat data. Logit regression of auditor resignations with respect to litigation risk and clientele effects. Identify 262 cases of new appointments of directors to the audit committee during 19941998. Fully-match 238 nonexecutive director appointments: match on firm and year and on professional affiliation of director (inside vs. outside director), then select closest in age. Yes No No No Yes No Yes No Yes Yes Yes No Yes No No No No No Yes No No No No No Yes No No No No No Yes No No No No No Yes No No No No No Yes No Yes No Yes No Yes No Yes Yes No No Logit regression explaining fraud Logit regression explaining Semi-match firms: create 5 corresponding samples of approximately the same size, matched litigation or not, with bootstrap Identify 5 sets of firms having 5 financial on year. Further, discussion of the determination standard errors reporting issues during 1982-1988 such as of one variable (audit committee presence or not) quarterly earnings corrections, ranging in number suggests a pair-wise correspondence, not fully from 62 to 96 firms. described, underla Fully-match to 78 firm-year observations in nonIdentify 78 firms subject to SEC Accounting and sanctioned firms matched by size, industry, trading exchange, and time period. Auditing Enforcement Releases. Reweight-ing for generalizing Logistic regressions of resignation versus dismissal Semi-match 67 firm-year observations: match on Logit regression of litigation or year, industry ( SIC4: 62; SIC3: 5), and then not randomly select one. Semi-match 75 firm-year observations having Identify 75 public firms having an occurrence of complete data: Match on year, stock exchange, financial statement fraud publicly reported during firm size within +/-30% in market value, industry (SIC4 or 3 or 2) 1980-1991. Additional control for “closest” matching Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Semi-match firm-year observations of firms who dismissed auditors (auditors did not resign), Identify 141 firms whose auditors resigned during following Stice (1991): one sample of 141 matched on year only and then random selection, 1989-1995 and all required data available. a second set of 141 is matched on year and industry. Two control groups, 1) for each auditor resignation firm ten firmsare randomly drawn from Compustat for the same year resulting in 269 auditor resignations over the years 1988 to 2,690 firms ,2) from 1263 randomly selected 1996. client initiated auditor changes between 1987 and 1955, 433 firms selected based Analyses Incorp. Matching/Stratif ication? Linear probability model (OLS regression) explaining 1-0 sanction or not Logit regression of director serving on audit committee or not Research Stream Author(s), Journal, Paper Title SSCI citation count 1/26/07 Carcello and Neal (2003). The Accounting dup - 10 Review , “Audit Committee Characteristics cites and Auditor Dismissals following ‘New’ Going-Concern Reports” Simunic (1980), Journal of Accounting Research, " The Pricing of Audit Services: Theory and Evidence" 182 Turpen (1990). Auditing: A Journal of Practice and Theory , “Differential Pricing on Auditors Initial Engagements: Further Evidence” 13 Maher, Tiessen, Colson, Broman (1992). The Accounting Review , “Competition and Audit Fees”. 14 Sanders, Allen & Korte (1995). Auditing: A Journal of Practice and Theory , “Municipal Audit Fees: Has Increased Competition Made a Difference?” 3 Iyer and Iyer (1996). Auditing: A Journal of Practice & Theory , “Effect of Big 8 Mergers on Audit Fees: Evidence from the United Kingdom” Walker and Casterella (2000). Auditing: A Journal of Practice & Theory , “The Role of Auditee Profitability in Pricing New Audit Engagements” 2 Geiger and Rama (2003). Auditing: A Journal of Practice & Theory , "Audit Fees, Nonaudit Fees, and Auditor Reporting on Stressed Companies" Kida (1980). Journal of Accounting Research . “An Investigation into Auditors’ Continuity and Related Qualification Judgments” 3 Research Design Category 55 Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample audit committee impact CB-FM Identify 66 nonfinancial public firms that received a Going concern report from a big 6 auditor and then dismissed the auditor during the period 19881999, but did not go bankrupt, and which had an audit committee and information available. Also, randomly STRATIF Survey firms in 4 strata: Big 8 vs Non Big 8 Auditor, small vs. large (large is > $125 million sales). Obtain useful responses in each strata of: Big 8 Large (172), Big 8 Small (117), Non Big 8 Large (38), and Non Big 8 Small (70) Determinants of audit fees Determinants of audit fees NCB-SM 3 Choice-Based and/or Matched Sample Selection Identify 327 public companies that changed auditors during 1982-1984. Collect 57 usable surveys from these. Determinants of audit fees On the pooled sample of 187 First, fully-match each GC firm with a firm receiving a going concern report but that did not pairs, run logit regression of dismiss its auditor: Match by year, industry and whether dismissed auditor or closest size. Drop four as matches not available, not, on variables including yielding 62 pairs. Second, fully-match clean interactions with whether opinion firms received a going concern or not NCB-FM-W NCB-SM Identify 80 firms with auditor tenure of 1 to 3 years Determinants of audit fees Determinants of audit fees Prediction of Fin Distress including GCQ CB-FM Semi-match 327 firm-year observations from Compustat firms that did not change auditors during 1980-1984: match proportionally by exchange (Exchange-traded vs. OTC). Collect 89 usable surveys from these, then match proportionally by year. CB-FM Yes No Yes Yes No No No No No No Yes No Yes No No No Yes No Yes Yes No No No No Yes Yes No No No No Yes Yes No No No No Yes No Yes Yes Yes No Yes No Yes Yes No No Yes No Yes No Yes No OLS regression of Log(Audit Fees) over 57+89=146 firms. OLS regression of change in audit fees on other changes OLS of change in audit fees regressed on differences, over 159 observations. Fully-match, prospectively: Each auditee is OLS of change in audit fees on regarded as its own control before and after big 8 differences auditors become the big 6 by mergers during 1989. Match to 80 firms that have auditor tenure of Regression explaining audit fees other than 1 to 3 years, based on total assets and SIC Fully-match 66 non-GCQ but financially stressed 66 firms receiving a first-time GCQ opinion and firms by a financial stress measure, industry and size measured by sales. meeting data requirements Identify manufacturing firms having 10 goingconcern potential indicators, from those filing during May 1974-April 1975, having complete Moody’s data. Get sample size down to 20 by eliminations “due to size, date of problem, or finally on a random basis”. Reweight-ing for generalizing OLS model explaining audit fees NCB-FM-W 270 UK firms audited by Big 8 auditors and available data in 1987 and 1991. Additional control for “closest” matching Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match, prospectively: Each auditee is 78 firms in a convenience sample (a database for regarded its own control before and after legislative, enforcement, and rule change events; which 1977 and 1981 audit fees are known) no further data collection required. In 1989, 289 cities responded to another survey. Determinants of audit fees Of these 159 appeared in 1985 survey. NCB-FM-W 642 cities responding to survey request in 1985. Determinants of audit fees Analyses Incorp. Matching/Stratif ication? Identify 20 non-problem firms matched by year, industry and asset size. Logit regression explaining going concern report Discriminant analysis of problem vs. non-problem firms, using 5 financial ratios. Also, compare this model’s performance compared to experimental subject auditor judgments on same firms’ data. Research Stream Author(s), Journal, Paper Title SSCI citation count 1/26/07 Levitan and Knoblett (1985). Auditing: A 14 Prediction of Journal of Practice & Theory , “Indicators Fin Distress of Exceptions to the Going Concern including Assumption” GCQ Mutchler (1985). Journal of Accounting Research. “A Multivariate Analysis of the Auditor’s Going-Concern Opinion” 45 Koh (1991) Accounting and Business Research , “Model Predictions and Auditor Assessments of Going Concern Status” Not in SSCI Ponemon and Schick [1991]: Auditing: A Journal of Practice and Theory, “Financially Distressed Companies and Auditor Perceptions of the Twelve Characteristics of Decline”. Chen and Church (1992). Auditing: A Journal of Practice & Theory , “Default on Debt Obligations and the Issuance of Going-Concern Opinions” 2 25 Citron and Taffler (1992). Accounting and Business Research, “The Auditor Report Under Going Concern Uncertainties: An Empirical Analysis”. Not in SSCI Fleak and Wilson (1994). Journal of Accounting, Auditing, and Finance . “The Incremental Information Content of the Going-Concern Audit Opinion” Not in SSCI Cormier, Magnan, Morard (1995) Journal of Accounting, Auditing, and Finance," The auditor’s consideration of the going concern assumption" Not in SSCI Lenard, Alam, Madey (1995) Decision Sciences . “The Application of Neural Networks and a Qualitative Response Model to the Auditor’s Going Concern Uncertainty Decision” 36 Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Research Design Category Choice-Based and/or Matched Sample Selection Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample CB-FM CB-NM 35 Compustat firms filing Chapter 11 bankruptcy in 1980-1981 Identify 165 firms receiving going concern audit qualifications during 1978-95. CB-FM Of one accounting firm’s clients, identify 43 firms in financial distress, have each audit mgr rate 12 characteristics. CB-SM Identify 127 public industrial firms receiving a going-concern opinion for the first time during 1982-1986. These get weight 1 in analysis. CB-FM STRATIF CB-NM CB-NM Additional control for “closest” matching Reweight-ing for generalizing Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match 35 non-bankrupt Compustat firms by Discriminant analysis explaining year, SIC code, and closest size (assets) bankruptcy or not Identify 119 firms having distress but not going concern qualification, by random selection of firms for review from 2,855 available Identify 119 manufacturing firms having a Going manufacturing firms. 684 were reviewed to find Concern qualification during an 11 month period. 119 meeting distress criteria. CB-FM Analyses Incorp. Matching/Stratif ication? Fully-match to 165 non-GC firms:matched on industry, size (assets) and year. Match according to size (revenues within 5%), public vs. closely held control, and 2 digit SIC code. Yes No Yes No No No No No Yes No Yes No Yes No Yes Yes Yes No Yes No (size only coarsely ) Yes No Yes No No No No No Yes No Yes No Yes No No No No No Yes No No No No No Yes No No No No No Yes No Probit regression explaining bankruptcy, using WESML ANOVA and MANOVA on 12 constructs of organization decline surveyed 2x2 tabulations, e.g. ,of auditor switch or not vs. going concern or not, (chi-square tests possible but not performed) OLS explaining Cumulative Selected controls from all Compustat firms distressed but not GC, selecting 25% of those, Abnormal Returns around audit opinion release dates 572, and then dropping for data reasons, yielding 325. Choice-based in that controls selected at 25% sampling rate. Identify 112 firms apparently not facing Logit, discriminant analysis, financial difficulties (stock return > 0%) recursive partitioning Identify 138 Canadian firms that potentially face explaining financial difficulty financial difficulties (stock return < -50%) status. Identify 153 firms having Going-Concern qualifications during 1979-1986 and all data acceptable. Selected 40 firms with unqualified audit opinions from Compustat PST firms (presumably in the From 1988 Disclosure II Database, identified 40 same year?) firms having going conceren audit qualification No Discriminant analysis of goingconcern qualification or not Semi-match 127 firm-year observations matched Logit regression of Going on year, from 1,015 Compustat firms having Concern Opinion received or not negative net worth or other “problem” status. (Weighting of 1015/127 used in logit regression) Fully-match two subsamples of 61 and 25 Identify 61 firm-year observations (for 38 UK unqualified firm-year observations: Match by firms) receiving going concern audit qualifications year, industry, size, and financial distress status but not going bankrupt; identify 25 failed firms (percentile ranking of an Altman Z-score). that received GC qualification. Yes Logit and Neural network models explaining GC audit qualifications Research Stream Author(s), Journal, Paper Title Carlson, Glezen, Benefield (1998) Quarterly Journal of Business and Economics. “An Investigation of Investor Reaction to the Information Content of a Going Concern Audit Report while Controlling for Concurrent Financial Statement Disclosures” SSCI citation count 1/26/07 Not in Prediction of SSCI Fin Distress including GCQ Foster, Ward, Woodroof (1998). Journal of Accounting, Auditing and Finance . “An Analysis of the Usefulness of Debt Defaults and Going Concern Opinions in Bankruptcy Risk Assessment” Not in SSCI Lenard, Madey, Alam (1998) The Journal of Management Information Systems , “The Design and Validation of a Hybrid Information System for the Auditor’s Going Concern Decision” Kleinman and Anandarajan (1999). Managerial Auditing Journal. “The Usefulness of Off-Balance Sheet Variables as Predictors of Auditor’s Going Concern Opinions” Koh and Tan (1999). Accounting and Business Research , “A Neural Network Approach to the Prediction of Going Concern Status” Not in SSCI Morris and Strawser (1999). Auditing: A Journal of Practice and Theory, “An Examination of the Effect of CPA Firm Type on Bank Regulators’ Disclosure Decisions” Lenard, Alam, Booth (2000) Decision Sciences . “An Analysis of Fuzzy Clustering and a Hybrid Model for the Auditor’s Going Concern Assessment” Behn, Kaplan, Krumwiede (2001). Auditing: A Journal of Practice & Theory , “Further Evidence on the Auditor’s GoingConcern Report: The Influence of Management Plans” Citron and Taffler (2001). Journal of Business Ethics . “Ethical Behaviour in the U.K. Audit Profession: The Case of the Self-Fulfilling Prophecy Under GoingConcern Uncertainties” Not in SSCI Not in SSCI 0 3 0 1 Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Prediction of Fin Distress including GCQ Research Design Category Choice-Based and/or Matched Sample Selection Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample NCB-FM-B Identify 88 firms that received Going Concern Audit Reports (GCAR) during 1981-1988, using NAARS. CB-NM Identify 82 firms going bankrupt during 19881991 for an estimation sample, and 44 bankrupcies during 1992-1993 for a holdout sample. Select the firm-year the year before bankruptcy. CB-NM 32 bankrupt firms from 1989 and 26 bankrupt firms from 1990 CB-SM CB-FM CB-SM Identify 55 distressed but not bankrupt firms during first period, and 40 during second period. Selection of which firm-year observation was mostly 1990. Choice-based, random selection of firms, and analysis is one obs per firm during time period apparen Reweight-ing for generalizing ANCOVA explaining market returns around report dates Logit regression of bankruptcy as a function of audit opinion and other variables 32 firms randomly selected from 1562 nonbankrupt firms from 1989and 26 firms randomly selected from 1602 nonbankrupt firms from 1990 Stepwise and non-stepwise From nonGCAR Big 6 audited firms, identify From Big 6 audited clients in 1992 Compact 173 firms matched to GCAR firms on size (both discriminant analysis to explain Disclosure database, Identify 61 firms receiving a GCAR status assets and revenues). Authors do not explain this specific Going Concern audit report (GCAR) matching procedure in detail. It is not clear how during 1990-1992 and having complete data. this matching was Use sample of Koh (1991) 165 matched. Neural network explaining bankruptcy Use sample of Koh (1991) Identify 116 Texas banks closed during 19901991. Additional control for “closest” matching Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match to 88 non-GCAR firm-year observations by fiscal year, industry, and a Zscore measure of financial distress. Analyses Incorp. Matching/Stratif ication? Semi-match to 116 firm-year observations of non- Logit regression of bankruptcy closed Texas banks: match by year, then (closure) or not randomly select one. Yes No Yes Yes Yes No No No No No No No No No No No Yes No Yes No No No Yes No Yes No Yes No Yes No Yes No No No No No No No No No Yes No Yes No No No No No Yes No Yes Yes No No Apparently same control sample as Lenard et al (1998) CB-NM CB-SM CB-FM Apparently same sample as Lenard et al (1998) Semi-match 148 firm-year observations of manufacturing firms: match only on year, then 148 publicly traded manufacturing firms receiving randomly select one. Going concern report within an unqualified audit Identify 99 firms receiving GC qualifications during 1987-1994 (of which 17 went bankrupt before publishing next accounts (i.e. financial statements, said British-wise) Fully-match to 99(?) firms based on year, exchange listing status, industry, size (“turnover”, i.e. sales) and Z-score (of which 8 went bankrupt) Logit regression explaining going concern report Logit regression explaining bankruptcy by sales, Z-score, opinion (GC or not), listing status, loss Research Stream Author(s), Journal, Paper Title Vanstraelen (2002). Accounting and Business Research , “Auditor Economic Incentives and Going-Concern in a Limited Litigious Continental European Business Environment: Empirical Evidence from Belgium” Gaeremynck and Willekens (2003). Accounting and Business Research , “The Endogenous Relationship Between AuditReport Type and Business Termination: Evidence on Private Firms in a NonLitigious Environment” SSCI citation count 1/26/07 Not in Prediction of SSCI Fin Distress including GCQ Not in SSCI Prediction of Fin Distress including GCQ Geiger and Rama (2003). Auditing: A Journal of Practice & Theory , "Audit Fees, Nonaudit Fees, and Auditor Reporting on Stressed Companies" dup - 3 Prediction of cites Fin Distress including GCQ Vanstraelen (2003). Journal of Accounting Auditing and Finance , “Going Concern Opinions, Auditor Switching, and the Self-Fulfilling Prophecy Effects Dopuch, Holthausen, Leftwich (1987). The Accounting Review , “Predicting Audit Qualifications with Financial and Market Variables” Not in SSCI Wilkerson (1987). Journal of Accounting Research , “Selecting Experimental and Comparison Samples for Use in Studies of Auditor Reporting Decisions” 6 Elliott (1992).Journal of Accounting Research , “’Subject to’ Audit Opinions and Abnormal Security Returns—Outcomes and Ambiguities”. DeFond and Jiambalvo (1993). Contemporary Accounting Research , “Factors related to auditor-client disagreements over income-increasing accounting methods” Kinney and McDaniel (1993). Auditing: A Journal of Practice & Theory. “Audit Delay for Firms Correcting Quarterly Earnings” Buchman and Collins (1998). Journal of Business Research , “Uncertainty About Litigation Losses and Auditors”. 45 Not in SSCI !!?! Not in SSCI 1 0 Prediction of Fin Distress including GCQ Prediction of Audit Qualification other than GCQ Prediction of Audit Qualification other than GCQ Research Design Category Choice-Based and/or Matched Sample Selection Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample CB-FM CB-FM CB-FM CB-FM CB-SM CB-SM Identify 392 bankrupt firms Identify 114 Belgian firm bankruptcies in 1995 and 1996. Identify 392 bankrupt firms Fully-match by year, industry (4 digit NACE code) and size (assets). Prediction of From NAARS, fully match 85 firms by 4 digit Audit SIC, sign of earnings change, having no Identify 85 firms announcing, at year-end, NCB-FM-B Qualification corrections of previously reported interim earnings extraordinary items, and closest in size other than (revenues). Semi-match firm-year observations from firms Prediction of Audit 60 firms having audit opinions qualified due to disclosing litigation uncertainty but unqualified: CB-SM Match by year???, by industry (SIC3), and then Qualification litigation uncertainty and having full data. other than randomly select one. No Yes No No No Yes No Yes No No No Yes No Yes Yes No No Yes No Yes No No No Yes No No No Yes Yes Yes No No No No No Yes Yes Yes No Yes No Yes No Yes No No No Yes Yes Yes No Yes No Yes No No No Yes No Logit regression Semi-match to 33 firms also investigated by the Logit regression explaining audit qualification or not, on Identify 16 firms investigated by the SEC for price same 9 SEC investigations, but not receiving fixing during 1972-1981, and receiving audit audit qualifications. Each of 9 investigations had Cumulative Abnormal Returns and other variables qualifications due to the uncertainty, within 9 both outcomes represented. separate investigations. Fully-match to 145 firm-year observations not having audit qualifications: match by year (as close to same year-end as possible), industry (usually 4 digit SIC), and closest by a measure of Fully-match 40 firm-year observations of firms changing auditors for other reasons: match by closest year of change, and closest industry. Yes Logit regression explaining going concern report Probit regression with WESML Semi-match to N firm-year observations of NYSE and AMEX firms not having audit Identify some number N of firm-years having an qualifications (allowing consistency exceptions): auditor qualification for the first time during 1969- Match on year, then select randomly. After dropping firms having incomplete CRSP and 1980, of NYSE and AMEX firms. Compustat, and setting aside holdout sa Prediction of Audit Randomly select 145 firms having a “Subject To” NCB-FM-B Qualification audit qualification and CRSP data availability. other than Prediction of Audit Of firms changing auditors, identify 40 firms 8-K Qualification CB-FM reporting disputes where auditor objected to other than management attempts to report higher earnings GCQ Reweight-ing for generalizing Logit regression of Going Concern Fully match 114 continuing firm-year Logit regression of audit report observations: Match by size, industry, and year. (clean or not) NOT SO SIMPLE has 2 equation modeling endogeneity Fully-match 66 non-GCQ but financially stressed firms by a financial stress measure, industry and 66 firms receiving a first-time GCQ opinion and size measured by sales. meeting data requirements Additional control for “closest” matching Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match to create two control samples, one of financially stressed and one of non-stressed firms: Match by year, industry (4 digit NACE code) and size (assets). Analyses Incorp. Matching/Stratif ication? Univariate matched t-tests of cumulative pair-wise returns differences Logit regression explaining disagreements OLS of paired differences in audit delay upon pairwise difference in explanaotry variables. Univariate tests, and also a pooled logit regression of material loss or not Research Stream Author(s), Journal, Paper Title Bartov, Gul, Tsui [2000] Journal of Accounting and Economics, “ Discretionary-Accruals Models and Audit Qualifications”. Seipel and Tunnell (2000) American Business Review . “A Stochastic dominance analysis of the issuance of qualified opinions” SSCI citation count 1/26/07 20 Prediction of Audit Qualification other than Not in Prediction of SSCI Audit Qualification other than GCQ Peasnell, Pope, Young (2001). Accounting and Business Research , “The Characteristics of Firms Subject to Adverse Ratings by the Financial Reporting Review Board” Not in SSCI Feltham, Hughes, Simunic (1991).Journal of Accounting and Economics , “Empirical Assessment of the Impact of Auditor Quality on the Valuation of New Issues”. 24 Teoh and Wong [1993]:The Accounting Review, “Perceived Auditor Quality and the Earnings Response Coefficient”. 54 Allen (1994). Auditing: A Journal of Practice & Theory , “The Effect of LargeFirm Audits on Municipal Bond Rating Decisions” 0 Clarkson and Simunic (1994). Journal of Accounting and Economics . “The Association Between Audit Quality, Retained Ownership, and Firm-Specific Risk in U.S. vs. Canadian IPO Markets” 21 Becker, DeFond, Jiambalvo, and Subramanyam (1998). Contemporary Accounting Research , “The Effect of Audit Quality on Earnings Management” Not in SSCI Colbert and Murray (1998). Journal of Accounting, Auditing, and Finance. “The Association Between Auditor Quality and Auditor Size: An Analysis of Small CPA Firms” Not in SSCI Prediction of Audit Qualification other than GCQ Research Design Category Choice-Based and/or Matched Sample Selection Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample CB-FM CB-FM CB-FM Fully match 173 firm-year observations Match on Logit regression explaining year, 2 digit SIC, and Big 6 vs. non-Big-6 whether audit opinions are auditor, then select nearest in assets (or, for one qualified or not. analysis, nearest in total accruals). Identify control firm year observation for each, Chi squared test of pair-wise Stochastic Dominance status of Identify 3 case samples of firms having audit matched by size, industry, year-end and/or CARs qualifications : 75 going concern, 37 litigation, 27 financial condition (Z-score) asset valuations Identify 173 Compustat firms having qualified audits. Identify 47 adverse FRRP rulings. audit quality Fully-matched pairs are chosen from within the sample, prospectively: 110, 102, and 85 pairs are 251 cases of firms employing high quality (large) found with adequate matching by assets, IPO NCB-FM-B auditors and 141 firms employing low quality proceeds, or MVE. Within those, 50, 40, and 37 (small) auditors are identified. pairs have market returns data available for some analyses. audit quality Fully-match 1,263 big 8 audited firm-yearobservations to nonBig8 ones matching by year, Identify 1,297 Compustat and CRSP firms having industry (4 digit SIC, or 3 or 2 digit if necessary), NCB-FM-B non-big 8 auditors during 1980-1989, and 16,000 and then select closest in size (total assets). having big 8 auditors. audit quality CB-FM audit quality NCB-SM audit quality CB-NM Identify 174 IPOs in Canada during 1984 and 1987 Yes No Yes No No No Yes No Yes No Yes No Yes No Yes No No No Yes No Yes No Yes No Yes No Yes No Yes No Yes No Yes No No No Yes No Yes No Yes No Yes No No No Yes No No No No No No No OLS explaining market value OLS regression of cumulative abnormal returns between forecast and earnings dates Logit regression explaining Compare 44 IPOs having high quality audit to 44 firms having low quality audit matched on size high/low quality as function of variables including firm-specific (assets), underwriter prestige (high/low), and risk measures. terms of offering ("firm commitment" or "best efforts") OLS regression of est. Semi-match 2,179 firm year observations of firms having non-big 6 auditors and match on discretionary accruals on big 6 10,397 firm-year observations of non-financial dummy and other variables year, industry, and decile of operating cash flows firms having big 6 auditors over 1989-1992 and (each selected control has at least one big 6 sufficient data. counterpart). Compare to 325 audit firms not receiving qualified or adverse reports, selected by interval Identify 97 audit firms that received a qualified or sampling (essentially randomly) from adverse report from AICPA’s Private Companies alphabetical files of the PCPS as of 1996. Practice Section peer review program. Reweight-ing for generalizing Logit regression of FRRP censure Fully-match to non-big 8 audited cities matching Ordered logit models explaining within 3% on population, then selecting closest bond ratings are run within each sample separately Identify 212 cities having a Moody’s bond rating match on population. Yield is 125 city pairs. NCB-FM-B during 1978-1986 and audited by a big 8 firm audit quality Additional control for “closest” matching Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match to firm-year observations not receiving adverse FRRP rulings: Match by same level-6 industry group, year, and closest assets (require assets within 25%, else go to level-4 industry match), and data available. Analyses Incorp. Matching/Stratif ication? Ordered logit explaining peer review outcomes: unqualified / unqualified with comment / qualified / adverse. Research Stream Author(s), Journal, Paper Title Bauwhede, Willekens, Gaeremynck (2003). International Journal of Accounting , “Audit Firm Size, Public Ownership, and Firms’ Discretionary Accruals Management”. SSCI citation count 1/26/07 Not in audit quality SSCI Krishnan, G. (2003). Auditing: A Journal of Practice & Theory. “Audit Quality and the Pricing of Discretionary Accruals” 2 Kreutzfeldt and Wallace (1986) Auditing: A Journal of Practice & Theory , "Error Characteristics in Audit Populations: Their Profile and Relationship to Environmental Factors" 62 Kreutzfeldt and Wallace (1990) Auditing: A Journal of Practice & Theory , "Control Risk Assessments: Do They Relate to Errors?" Wallace and Kreutzfeldt (1995). Journal of Accounting, Auditing and Finance , “The Relation of Inherent and Control Risks to Audit Adjustments” 6 Jamal and Tan (2001). Journal of Accounting Research . “Can Auditors Predict the Choices Made by Other Auditors?” Bedard and Graham (2002). Auditing: A Journal of Practice & Theory , “The Effects of Decision Aid Orientation on Risk Factor Identification and Audit Test Chung, Firth, Kim (2003). Accounting and Business Research , “Auditor Conservatism and Reported Earnings” Identify 31 industrial firms listed on the Brussels Stock Exchange having consolidated financial NCB-FM-B statements and a comparable matching firm with sufficient data. audit quality Not in SSCI 3 0 0 auditor judgment auditor judgment auditor judment auditor judgment auditor changes 40 Fully-match 136 firm-year observations: Match on industry (at least 2 digit NACE code), size (assets), and year, yielding 31 control firms, and then collect other years’ data when available for both case and control firm in a given year, yielding 136 firm-year observations. Mean and standard deviation of number of audit adjustments as percentage of account balances; ANCOVA of same comparing strata STRATIF Use sample of Kreutzfeldt and Wallace (1986) Note same sample also used in Wallace and Identify 260 Arthur Anderson clients, stratified in Kreutzfeldt (1995) sampling by client size, industry, and public vs. privately held STRATIF Use sample of Kreutzfeldt and Wallace (1990): select 260 Arthur Andersen audit clients by stratified sampling in groups by size, industry, and public versus private NCB-FM-W Fully matched in construction in terms of who Within an auditing firm 14 sets of 3 persons who works together (and may share common work together (a manager, a top senior, and a knowledge base). mediocre senior) are identified. OLS regression models explaining number of audit adjustments, or in other words, error rate upon control risk factors ANOVA exploring differences in accuracy of predictions about audit task choices in other sets. Matched pairs were defined prospectively in the OLS regression of auditor sample creation. In each pair, one auditor is judgments of number of client randomly assigned to the positive and one to the engagement risk factors present negative treatment. Fully-match to non-Big 6 firms: Match by year, OLS regression of industry (2 digit SIC code), and closest in size. Earnings/price ratio From all Compustat Plus firm-years in 1988-1997, This reduces sample greatly. Then, winsorize NCB-FM-B select all audited by Big 6 firms. and delete some observations. Yield is 3,860 observations in total. Fully-match to 48 firms not switching auditors: Univariate t-tests of pair-wise Identify 48 firms switching auditors during 1972- match on firm’s Beta (from a market model), and differences in cumulative market NCB-FM-B by industry where possibly (by 4-, 3- 2-digit SIC, returns 1975 and CRSP data available. or not at all ) NCB-FM-W 23 pairs of senior auditors who share a client engagement are identified. auditor changes NCB-FM-B Identify 132 public firms filing for bankruptcy during 1974-1982, and other requirements. Fully-match 132 non-bankrupt firm-year observations, retrospectively: Match by year, industry (SIC4), and size (revenues in 4th year prior to bankruptcy of case firm). Additional control for “closest” matching Reweight-ing for generalizing OLS regression explaining discretionary accruals Identify 3,316 firm-year observations in Non Big Regression of Stock returns on operating cash flow, Identify 15,342 firm-year observations in Big 6 6 audited firms that correspond in industry and Nondiscretionary Accruals, and audited firms for which corresponding Non Big 6 cash flow decile. discretionay accruals control observations in same 2 digit SIC and cash conditioned on audit quality flow decile are available. STRATIF Analyses Incorp. Matching/Stratif ication? Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Identify 260 Arthur Anderson clients, stratified in sampling by client size, industry, and public vs. privately held auditor judgment auditor judment Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample NCB-SM Fried and Schiff (1981). The Accounting Review , “CPA Switches and Associated Market Reactions” Schwartz and Menon (1985). The Accounting Review . “Auditor Switches by Failing Firms”. Research Design Category Choice-Based and/or Matched Sample Selection 2x2 tables, e.g. bankrupt or not versus auditor switch or not, with Chi-squared tests. Auditor switch is factor in each pairwise, 2x2 test; bankruptcy is not. Yes No Yes Yes Yes No Yes No Yes Yes Yes No No No No No Yes No No No No No Yes No No No No No Yes No Yes No No No No No Yes No No No No No Yes No Yes No Yes No Yes Yes Yes No Yes No Yes No Yes No Yes No Research Stream Author(s), Journal, Paper Title SSCI citation count 1/26/07 Eichenseher and Shields (1989). Advances Not in in Accounting, Supplement , “Corporate SSCI capital structure and auditor ‘fit’”. DeBerg, Kaplan, Pany (1991). Accounting Horizons, “An Examination of Some Relationships Between Non-Audit Services and Auditor Change”. Not in SSCI Seabright, Levinthal, Fichman (1992). Academy of Management Journal . “Role of Individual Attachments in the Dissolution of Interorganizational Relationships” 80 Dhaliwal, Schatzberg, Trombley (1993). Auditing: A Journal of Practice & Theory , “An Analysis of the Economic Factors Related to Auditor-Client Disagreements Preceding Auditor Change” 6 Schwartz and Soo (1995). Auditing: A Journal of Practice and Theory , “An Analysis of Form 8-K Disclosures of Auditor Changes by Firms Approaching Bankruptcy” 5 Krishnan, J. and Krishnan, J. (1997). The Accounting Review , “Litigation Risk and Auditor Resignations” 21 Research Design Category auditor changes NCB-FM-B auditor changes Identify 27 AMEX firms that changed auditors during 1981-1982, and did not go bankrupt or change auditors again before 198_ Fully-match 27 firm-year observations of firms not changing auditors during 1974-1984: match on size (assets), industry (3-digit SIC), and big 8 vs. non-big 8 auditor (matching to the former auditor of the case firms) OLS regression with jackknife standard errors, explaining auditor change or not (linear probability model) Univariate Wilcoxon matched pair tests of Non-Audit Service usage Match to 170 non auditor switch companies by year and size grouping (assets-based) Logit models explaining aditor switch or not auditor changes auditor changes NCB-SM auditor changes CB-FM auditor changes Identify 170 cases of auditor changes within 32 industries (2 digit SIC) having 20 or more companies Semi-match to 71 firm-years within Smith’s sample having an auditor change but without a Within a sample collected by Smith (1988) of reported disagreement: match on industry (64 of CRSP firms that changed auditors during 1973- 71 matches by 2 digit SIC). Also, create from 1982, identify 71 that issued a timely 8-K report Compustat a second comparison sample of of a disagreement with the auditor. industry averages, matched as c Univariate comparisons of Cumulative Abnormal Returns Fully-match 59 firm-year observations from Compustat firms changing auditors during 1985Identify 59 firms voluntarily making a auditor 1992 that did not go bankrupt: match by industry change within 3 years of bankruptcy and having (SIC2 in most cases), size (total assets), and filed 8-Ks, out of 307 firms filing for bankruptcy closest auditor change year??? during 1987-1992 a) Univariate Chi-squared on various b) Logit regression of Bankruptcy c) Univariate t-test on CARs around events. B. is choice-based. Semi-match firm-year observations of firms who dismissed auditors (auditors did not resign), following Stice (1991): one sample of 141 Identify 141 firms whose auditors resigned during matched on year only and then random selection, 1989-1995 and all required data available. a second set of 141 is matched on year and industry. Logistic regressions of resignation versus dismissal auditor changes NCB-FM-B Identify 88 auditor resignations Analyses Incorp. Matching/Stratif ication? Additional control for “closest” matching Reweight-ing for generalizing Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Fully-match to AMEX or NYSE firms not changing auditors, retrospectively: Match on Identify 83 AMEX or NYSE listed non-bankrupt “comparable” industry, assets, net income, and NCB-FM-B firms that changed auditors from one Big 8 auditor leverage. 20 firms not matched; yield is 63 to another Big 8 auditor during 1978-1982. pairs. CB-SM Not in SSCI Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample CB-SM Dunn, Hillier, Marshall (1999). Accounting and Business Research , “The Market Reaction to Auditor Resignations” Choice-Based and/or Matched Sample Selection From firms not having auditor resignation, match Event study of cumulative on industry, size of company, current status abnormal returns (CARs) for test (merged/bankrupt/continuing in operation) sample and for control sample Yes No Yes Yes Yes No Yes Yes Yes No Yes No Yes No No No No No Yes No No No Yes No Yes No Yes No a)Y b)N c)Y No Yes No No No No No Yes No Yes No Yes No Research Stream Author(s), Journal, Paper Title SSCI citation count 1/26/07 Menon and Williams (1999). Journal of Not in Accounting, Auditing and Finance , “Error SSCI Cost and Auditors’ Termination Decisions” Shu (2000). Journal of Accounting and Economics . “Auditor Resignations: Clientele Effects and Legal Liability” Woo and Koh (2001). Accounting and Business Research , “Factors Associated with Auditor Changes: A Singapore Study” Johnson, Khurana, and Reynolds (2002). Contemporary Accounting Rsearch, "Audit-Firm Tenure and the Quality of Financial Reports" Carcello and Neal (2003). The Accounting Review , “Audit Committee Characteristics and Auditor Dismissals following ‘New’ Going-Concern Reports” dup - 15 auditor changes CB-SM 8 auditor changes Not in SSCI Beasley (1996). The Accounting Review , “An Empirical Analysis of the Relation Between the Board of Director Composition and Financial Statement Fraud” Green and Choi (1997). Auditing: A Journal of Practice & Theory , “Assessing the Risk of Management Fraud Through Neural Network Technology” dup 60 cites CB-SM 269 auditor resignations over the years 1988 to 1996. CB-SM Identify 54 Stock Exchange of Singapore listed firms that changed auditors during 1986-1995. From Big 6 audited clients in 1986-1995 Compact Disclosure database, Identify 821 firms having a NCB-FM-B "short-tenure" auditor relationship (2-3 years) and having complete data. auditor changes CB-FM Identify 66 nonfinancial public firms that received a Going concern report from a big 6 auditor and then dismissed the auditor during the period 19881999, but did not go bankrupt, and which had an audit committee and information available. Also, randomly select 125 clean opinion firms that dismissed auditors (excluding those without audit committees or without financial data available). NCB-FM-B 86 firms having 10-K statements subsequently found to be fraudulent, and data available. fraud CB-SM 9 CB-FM Logit regression of auditor resignations with respect to litigation risk and clientele effects. Semi-match to firms not changing auditors: Match by year, country of incorporation, then random selection of one. Logit regression explaining auditor change Create two samples with full matching: identify OLS regression explaining 821 firms having a medium-tenure audit client unexpected accruals relationship (4-8 years) and 821 firms having a long-term audit relationship (9+ years), match by year, industry (2-4 digit SIC), and closest in size (total assets). First, fully-match each GC firm with a firm On the pooled sample of 187 receiving a going concern report but that did not pairs of two types, run logit dismiss its auditor: Match by year, industry and regression of whether dismissed auditor or not, on variables closest size. Drop four as matches not available, including interactions with yielding 62 pairs. Second, fully-match clean whether received a going opinion firms to 125 that did not dismiss concern or not auditors. Semi-match 75 firm-year observations having Identify 75 public firms having an occurrence of complete data: Match on year, stock exchange, financial statement fraud publicly reported during firm size within +/-30% in market value, industry (SIC4 or 3 or 2) 1980-1991. fraud 86 firms having 10-K statements subsequently found to be fraudulent, and data available. Logit regression explaining resigned versus continuing auditor or declined versus continuing auditor decisions. Two control groups, 1) for each auditor resignation firm ten firmsare randomly drawn from Compustat for the same year resulting in 2,690 firms ,2) from 1263 randomly selected client initiated auditor changes between 1987 and 1955, exclude nonCompustat and financial and utility firms, then randomly select 433 firms. Fully-match 86 nonfraud firm-year observations from Compustat retrospectively: match by year, size (“comparable” assets), and industry. Fully-match 86 nonfraud firm-year observations from Compustat: match by year, size, and industry (4 digit SIC). Analyses Incorp. Matching/Stratif ication? Additional control for “closest” matching Reweight-ing for generalizing Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Selection of Controls or Comparison Sample, If Any Create a control sample of 2168 firm year observations (in 2168 distinct firms ) matched by Identified 2 samples in the 1990-1996 period; 217 2 digit SIC, Big 6 auditor and year, matched to firms whose Big 6 auditor resigned, 67 firms the observations in either case samples, and retained the same auditor for the seven year whose Big 6 auditor declined reappointment period. auditor changes auditor changes Green and Calderon (1995). Accounting Enquiries , “Analytical Procedures and Auditors’ Capacity to Detect Management Fraud” Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample Not in SSCI 10 Research Design Category Choice-Based and/or Matched Sample Selection Yes No No No No No Yes No No No No No Yes No No No No No Yes No Yes Yes Yes No Yes No Yes Yes No No Yes Yes Yes No Yes No Yes No No No No No Yes No Yes No Yes No Univariate Wilcoxon signed rank tests of differences Logit regression explaining fraud Neural network explaining Fraud or non-fraud Research Stream Author(s), Journal, Paper Title Bell and Carcello (2000). Auditing: A Journal of Practice & Theory , “A Decision Aid for Assessing the Likelihood of Fraudulent Financial Reporting” Total SSCI citation count 1/26/07 6 Research Design Category Principal Analyses Explaining Choice and/or Using Matching Identification of Cases, or Stratified Sample Selection of Controls or Comparison Sample, If Any Employ sample of 77 audit client engagements of an audit firm where material financial statement fraud was identified during 1960-1989, as identified in a prior study. For comparison, solicit survey responses from a stratified sample of non-fraud audit engagements of the same firm during 1990, with stratification ensuring industry representation was proportional to the audit firm’s client base (but we presume in differe fraud CB-SM 1027 Choice-Based and/or Matched Sample Selection Analyses Incorp. Matching/Stratif ication? Additional control for “closest” matching Reweight-ing for generalizing Needed Applied Needed Applied Needed Applied ? ? ? ? ? ? Logit regressions explaining fraud or not. Yes No No No No No
© Copyright 2025