Factors of the Wisconsin Card Sorting Test as Measures of Frontal

Psychiarry Research, 46:175-199
Elsevier
175
Factors of the Wisconsin Card Sorting Test as Measures
of Frontal-Lobe Function in Schizophrenia and in
Chronic Alcoholism
Edith V. Sullivan, Daniel H. Mathalon, Robert 6. Zipursky,
Zoe Kersteen-Tucker,
Robert T. Knight, and Adolf Pfefferbaum
Received June 3, 1991; revised version received April 17, 1992; accepted June 13, 1992.
Abstract. The purpose of this study was to examine the factor structure of the
Wisconsin Card Sorting Test (WCST). The scores of 22 patients with schizophrenia, 20 patients with chronic alcoholism, and 16 normal control subjects were
entered into a principal components
analysis, which yielded three factors:
Perseveration,
Inefficient
Sorting, and Nonperseverative
Errors. WCST performance
of seven patients with lesions invading the dorsolateral
prefrontal
cortex,
available
from another
study, provided
criterion
validity for the
Perseveration factor and, less strongly, for the Inefficient Sorting factor. Two
patterns of performance characterized the three patient groups: the schizophrenic
group and frontal lobe group had the highest Perseveration factor scores, whereas
the alcoholic
group
had
the highest
Inefficient
Sorting
scores;
the
Nonperseverative
Errors factor showed no significant group differences. Construct validity of these factors involved assessing, in all but the frontal group, the
degree of overlap (convergent validity) and separation (discriminant
validity) of
each WCST factor with scores from tests of other cognitive functions. The convergent and discriminant
validity of the Perseveration
factor, but not the remaining
two factors, received support only within the group of schizophrenic
patients.
Key Words. Dorsolateral
ponents analysis.
prefrontal
cortex,
neuropsychology,
principal
com-
Classical studies of brain-behavior
relationships
pose questions about impaired and
spared cognitive functions in patients with known brain lesions. Clinical assessment,
by contrast,
typically
uses neuropsychological
tests to document
patterns
of
functional
sparing and loss in an effort to determine the presence and locus of brain
Edith V. Sullivan, Ph.D., is Health Science Specialist,
Psychiatry
Service, Palo Alto Department
of
Veterans
Affairs Medical Center, and Senior Research
Associate,
Department
of Psychiatry
and
Behavioral
Sciences, Stanford University School of Medicine. Daniel H. Mathalon,
Ph.D., is a medical
student, Stanford
University School of Medicine. Robert B. Zipursky,
M.D., was Staff Psychiatrist,
Psychiatry
Service, Palo Alto Department
of Veterans Affairs Medical-Center,
and Assistant Professor,
Department
of Psvchiatrv
and Behavioral
Sciences. Stanford
University
School of Medicine. Zoe
Kersteen-Tucker,
Ph.D., is a postdoctoral
associate; School of Education,
University of California,
Berkeley, and Department
of Neurology,
University of California,
Davis. Robert T. Knight, M.D., is
Professor of Neurolcgy,
University of California, Davis. Adolf Pfefferbaum,
M.D., is Chief of Psychiatry
Research, Psychiatry Service, Palo Alto Department
of Veterans Affairs Medical Center, and Professor,
Department
of Psychiatry
and Behavioral
Sciences, Stanford
University
School of Medicine.
Dr.
Zipursky is now at the Clarke Institute of Psychiatry in Toronto. (Reprint requests to Dr. E.V. Sullivan,
Psychiatry Service, 116A3, DVA Medical Center, 3801 Miranda Ave., Palo Alto, CA 94304, USA.)
0165-1781/93/$06.00
@ 1993 Elsevier Scientific
Publishers
Ireland
Ltd.
176
pathology.
Criterion
validity
is usually inferred from the sensitivity
of a neuropsychological
test to selective functional
impairments
in patients with known focal
lesions. Patients with focal lesions in a common
location, forming the criterion
group, are likely to have deficient test performance
for a common reason-their
shared pathology.
Deficient performance
on the same test by patients without
demonstrable
brain lesions, however, may be attributable
to the dyshnction
of brain
regions other than or in addition to the region implicated
in the criterion group.
Thus, it cannot be assumed that the criterion validity of a neuropsychological
test
will generalize to populations
of patients with unknown or diffuse brain pathology.
Outside of the original criterion population,
the use of a neuropsychological
test
for the assessment of a specific brain function (or dysfunction)
requires additional
evidence of validity in the particular
population
under study. The difficulty in
establishing
this additional validity is that no single validity criterion is available in
patients with unknown or diffuse brain pathology (see also Bornstein,
1986; Cur et
al., 19900, 1990h; Heinrichs,
1990; Matarazzo.
1990; Yeo et al., 1990). Given this
criterion problem, the only recourse is to demonstrate
construct \wlidit~~ (Cronbach
and Meehl. 1955) by showing
patterns
of convergence
or correlaiions
among
neuropsychological
tests that have established sensitivity to focal lesions (i.e., tests
for which similar criterion validity has been demonstrated).
We present here an
example of an approach that addresses both criterion and construct validity of a
commonly used neuropsychological
test, the Wisconsin Card Sorting Test (WCST),
in psychiatric
diseases in which the underlying
pathology
is global or poorly
localired
in this case, schizophrenia
(Zipursky et al., 1992) and chronic alcoholism
(Pfefferbaum
et al.. 1992).
Perhaps the earliest evidence for the dorsolateral
prefrontal
cortical (I)l.Pl-(‘)
specificitv of the WCST comes from Milner’s (1963) description of patients who had
undergone cortical excisions of epileptogenic
foci. Patients with frontal-lobe
lesions
invading the Dt.PFC were impaired in the number of categories sorted and committed a large number of perseverativc errors as compared with patients with trontallobe lesions sparing the DLPFC
and patients with temporal-lobe
excisions.
An
observation
of particular importance
and relevance to psychiatric patients, who do
not typically have large focal cortical abnormalities.
is Milner’s observation
that the
WCST performance
of patients whose epileptogenic
focus included the DLPFC was
impaired even before surgery, suggesting that such physiological
derangement
is
sufficient to impair function.
Schizophrenia
and chronic alcoholism are characterized
by global cortical pathology, involving (among other structures) the frontal lobes (Weinberger
et al., 1986;
Pakkenberg,
1987; Ron, 1987; Freund and Ballinger, 1988; Shimamura
et al.. 1988:
Harper and Kril, 1990; Lishman, 1990; Zipursky et al., 1992; Pfefferbaum et al., 1992).
The WCST is widely used in psychiatric studies as a quantitative
measure of function
of the frontal lobes, in particular,
the DLPFC (e.g., in schizophrenia:
Kolb and
Whishaw, 1983; Weinberger et al., 1986, 1988; Goldberg et al., 1987; Berman et al.,
1988; Yates et al., 1990; Braff et al.. 1991; e.g., in chronic alcoholism:
Tarter, 1973;
Malmo, 1974; for reviews, see Heaton, 1981; Ryan and Butters, 1983; Parsons et al.,
1987). Its construct validity in nonlesioned groups, however, is not yet fully established.
177
Furthermore,
while it is tempting to attribute poor performance
on the WCST by
schizophrenic
and alcoholic subjects to frontal-lobe
dysfunction,
a clear attribution
is obscured for at least two reasons: (1) Several studies of patients with circumscribed
lesions do not fully support the frontal specificity of the WCST (Eslinger and
Damasio, 1985; Heck and Bryer, 1986; Kersteen, 1989; Anderson et al., 1991). (2) The
WCST is a complex problem-solving
task that probably requires multiple cognitive
processes rather than a single function (e.g., Anderson
et al., 1991; Dehaene and
Changeux,
1991). An expanded scoring system developed by Heaton (198 I), referred
to here as “conventional
scores,” offers an opportunity
to differentiate
these
processes. Although
a high degree of intercorrelation
exists among the WCST
scores, the possibility remains that the scoring methods do not simply measure a
unitary function.
Rather, several dimensions
could underlie the various WCST
scores, not all of which would necessarily reflect the cognitive function associated
with DLPFC integrity. The multiple measures of the WCST (Heaton,
1981) have
never been submitted to factor analysis, which could reduce redundancies
of the
many
scoring
systems
and, further,
suggest dissociable
cognitive
processes
contributing
to performance.
Accordingly,
the purpose of this study was three-fold:
(1) to identify multiple
dimensions
of WCST performance
by principal components
analysis; (2) to examine
the criterion
validity of the resulting factors as reflected in their sensitivity
to
impairment
in patients with lesions invading the DLPFC;
and (3) to assess the
construct validity of each WCST factor against other measures of frontal-lobe
and
nonfrontal-lobe
functions in two psychiatric populations.
In connection
with construct
validity,
we used bivariate
and multivariate
approaches to examine both the convergent and discriminant
validity of the WCST
factors in schizophrenia
and alcoholism.
To this end, we used tests previously
demonstrated
to be sensitive to DLPFC function
(i.e., tests with demonstrated
criterion validity), other than the WCST, to seek evidence supporting the convergent
validity of the WCST factors as potentially valid measures of the executive function
of the frontal lobes. Although Goldberg et al. (1988) used four tests of frontal-lobe
function (WCST, Category Test, Trail Making B, and verbal fluency) in an attempt
to characterize the relationship
among them and to determine which was the best to
detect impairment
in schizophrenia,
only one test (WCST) was known from lesion
studies to be sensitive specifically to DLPFC function. The discriminant
validity, or
specificity,
of the WCST factors as measures of executive function
would be
supported
by showing that WCST performance
was not associated with tests of
nonexecutive
function. Thus, we used memory tests known to be dependent
upon
medial temporal-lobe
integrity and dissociable from frontal-lobe
integrity (Mimer,
1958, 1971; Milner et al., 1985, 1991).
The tests used here (i.e., self-ordered
pointing and temporal ordering tests) to
assess the construct validity of the WCST factors have been shown in studies of
patients with circumscribed
lesions to be specific in double dissociation
paradigms.
In particular,
studies by Petrides and Milner (1982; Milner et al., 1985) have
shown that patients with frontal-lobe
lesions (mostly chronic epileptic patients
treated with resection) were selectively impaired on self-ordered pointing tests (but
178
see Kersteen, 1989), but not on tests of fact memory. By contrast, patients with large
excisions of the temporal lobes could show impairment
on self-ordering
tests, but
patients with small excisions of the temporal lobes were not impaired; moreover,
both temporal-lobe
groups showed significant impairment
on tests of fact memory.
Clearly, if the alcoholic and the schizophrenic
patients of the present study have
medial temporal-lobe
pathology at all, it is more likely to fall in the category of small
rather than large lesions. Thus, it is unlikely that the contribution
of the potential
medial temporal-lobe
pathology
in producing
an ordering deficit would be significant. Furthermore,
an unqualified
double dissociation
has been shown for the
temporal ordering tests, whereby patients with frontal-lobe
lesions are impaired in
making temporal-order
judgments but not,in item recognition,
whereas patients with
temporal-lobe
lesions invading the hippocampal
complex show the opposite pattern
of sparing and loss (Mimer, 1971; Milner et al., 1991).
The selection of tests for demonstrations
of construct
validity is necessarily
imperfect.
No single study can provide the definitive
answer to the question of
construct
validity. Rather, the test relationships
reported in this article can only
contribute
to the accumulating
“nomological
network”
of associations
and
dissociations
on which the ultimate answer to the construct validity question must
rest (after Cronbach and Meehl, 1955).
Methods
Subjects. All schizophrenic
patients, alcoholic patients, and normal control subjects were
right-handed
male veterans of the United States armed forces. The schizophrenicpatients
met
DSM-III-R
criteria for the diagnosis of schizophrenia
(American
Psychiatric
Association,
1987) and were recruited from the inpatient psychiatric service of the Palo Alto Department
of
Veterans Affairs Medical Center (PADVAMC)
(Table 1). All but one of the subjects were
inpatient{ on the unlocked voluntary ward of the Mental Health Clinical Research Center
(MHCRC);
the remaining patient was housed on a locked ward at the time of the study but
had previously been a patient on the MHCRC ward. Informed consent for participation
in
this study was obtained for all subjects. Patients meeting criteria for current DSM-III-R
alcohol abuse or who had ever met criteria for DSM-III-R
alcohol dependence were excluded
from the study. Other reasons for exclusion were a history of significant medical illness, head
injury resulting in loss of consciousness
for more than 30 minutes, or current DSM-III-R
diagnosis of drug dependence. DSM-III-R
diagnoses were determined in clinical interview by
a psychiatrist,
clinical psychologist,
or psychiatric research fellow. The diagnoses for all but
one of the subjects were reached by consensus of two of the above clinicians; the remaining
patient was diagnosed using an earlier system for establishing
diagnoses at the MHCRC in
which the DSM-III-R
diagnosis was determined
by a single clinician. All schizophrenic
patients met DSM-III-R
criteria for chronic schizophrenia
and were subtyped as follows:
paranoid (n = 5), disorganized
(n = 4) undifferentiated
(n = IO), and residual (n = 3). In
all patients
were diagnosed
by Research
addition
to receiving DSM-III-R
diagnoses,
Diagnostic Criteria (RDC; Spitzer et al., 1975). The Schedule for Affective Disorders and
Schizophrenia-Lifetime
Version (SADS-L; Endicott and Spitzer, 1978) was administered
to
each patient by a trained research assistant or research nurse. RDC diagnoses were established
by a process%consensus
among the same clinicians involved in the DSM-III-R diagnosis along
with the SADS-L interviewer. Of the 22 patients inthis study, 19 met RDC for chronic schizophrenia of the following subtypes: paranoid (‘n = IO), disorganized
(n = 4) undifferentiated
(n = 2) and residual (n = 3). The three remaining patients met RDC for schizoaffective
disorder with the following subtypes: chronic, depressed type, mainly schizophrenic (n = 1); chronic, depressed type, other (n = I); and subchronic, manic type, mainly schizophrenic
(n = 1).
(n = 20)
(n = 22)
SD
Mean
SD
Mean
SD
Mean
9.4
61.92
5.1
37.9
5.6
34.4
4.1
14.1
1.8
12.02
2.2
1 2.62
2.1
15.3
Education
WI
-
4.0
18.8
5.6
18.7
5.5
21.9
Handedness
score’
3.7
3.73
7.7
1 8.33
6.7
12.03
-
Disease
duration
(yr)
’
-
7.9
106.0
9.1
106.7
7.5
107.4
NART
IQ
1. Based on a quantitative measure of handedness (Crovit! and Zener, 1962). Right-handers score between 14 and 31.
2. Age was significantly different from the control group, and education was significantly different from the remaining groups (analysis of variance, p <’ 0.001).
3. All comparisons were significant:frontal < schizophrenic < alcoholic (analysis of variance, p < 0.0001).
4. ” = 4.
Note. NART = National Adult Reading Test.
Frontal lobe (n = 7)
Alcoholic
Schizophrenic
SD
6.9
36.7
Normal control (n = 16)
Mean
Group
characteristics
Age
WI
Table 1. Demographic
4.8
10.0’
2.0
9.8
2.3
9.8
3.1
11.0
Vocabulary
sc+ll
score
180
The schizophrenic
patients underwent cognitive testing when they were considered by their
treatment
team to have likely achieved the maximum benefit from their medications.
All
schizophrenic
patients were receiving treatment with standard antipsychotic
medications with
the following three exceptions: (1) three patients were participating
in a randomized
doubleblind study of a new antipsychotic,
raclopride (Astra); (2) one patient was participating
in an
open label study of the novel antipsychotic
drug, (X-943 (Parke-Davis);
and (3) one patient
reported
that he had not taken any medication
given to him during a part of his
hospitalization,
including the time of his participation
in this study. In addition, the patients
were on a broad range of additional
psychopharmacologic
medications
during testing,
including anticholinergics,
benzodiazepines,
lithium, and tricyclic antidepressants,
as well as
chloral hydrate, propranolol,
amantadine,
hydroxzine, and buspirone.
The alcoholic group included 20 patients who were enrolled in an inpatient alcoholic
rehabilitation
program at the PADVAMC;
all had been abstinent for 3 to 4 weeks before
testing. The alcoholic patients. all right-handed
men, were recruited for this study as follows:
Consecutive
admissions
to the Alcohol Rehabilitation
Program at the PADVAMC
were
reviewed for eligibility. Patients entered into this study met RDC for alcoholism, had no past
history of hospitalization
for DSM-Ill-R
schizophrenia
or major affective disorders, and had
no history of medical or neurological
illness or trauma (e.g., loss of consciousness
> 30
minutes) that would have affected the central nervous system. Subjects who met RDC for
substance abuse other than alcohol within the past year were excluded. These criteria were
applied as follows: The hospital charts of all new admissions
were reviewed for readily
determinable
exclusion factors. Then, patients who passed the initial chart review were invited
to participate in the study and, if they consented, underwent additional screening. Psychiatric
screening involved an interview by a psychiatrist or a postdoctoral
fellow in psychiatry who
used the SADS-L to ensure the absence of current or past psychiatric and nonalcoholic
drug
use problems,
and to confirm the RDC diagnosis. Screening also included a medical and
psychiatric history. physical examination,
and a panel of blood tests (complete blood count,
SMA-20) administered
to all patients on admission.
The rzormal c,ontro/ group comprised
I6 community
volunteers.
recruited by newspaper
and word of mouth.
Other than the primary
diagnoses
of
advertisement,
posters,
schizophrenia
or chronic alcohol abuse, the control subjects met the same inclusion and
exclusion criteria as the schizophrenic
and the alcoholic patients. No schizophrenic
patient or
normal control subject met DSM-III-R
criteria for current substance dependence or for past
or current alcohol dependence.
The alcoholic and the control subjects were drawn tram a larger pool of research subjects to
match the schizophrenic
group. The schizophrenic
and alcoholic groups were matched in
years of formal education and duration of illness. Although the normal control group had
more years of formal education than the two patient groups (p < O.OOl), all three groups
performed
similarly on the National
Adult Reading
Test, an estimate
of premorbid
intelligence (Nelson, 1982) and on the Vocabulary subtest of the Wechsler Adult Intelligence
Scale-Revised
(Wechsler, 1981) an estimate of current intelligence (Table I). As discussed by
Zipursky et al. (1992). matching groups on the basis of premorbid intelligence may be more
meaningful than matching on morbid intelligence or years of education. Group differences in
years of education
may result from premature
dropout
from school due to prodromal
symptoms of the psychiatric disorder (cf. Meehl, 1971) and, in turn, could influence estimates
of current intelligence based on the Wechsler scales. which are highly correlated with years of
formal education.
The patients with frontal-lobe
lesions had been tested by one author, and their WCST data
were presented previously in a study of cortical contribution
to repetition priming (Kersteen,
1989; Kersteen-Tucker
and Knight, 1989). Their characteristics
are summarized
here. This
group comprised seven right-handed
men, aged 43 to 73 years (Table I). These patients were
selected from and tested at the Neurology Service of the Martinez Department
of Veterans
Affairs Medical Center on the basis of having unilateral focal lesions invading the dorsolateral
prefrontal cortex on the left or right side as determined
by lesion reconstruction
with com-
181
puted tomography
or magnetic resonance imaging (Frey et al., 1987; Kersteen, 1989). The
average lesion volume was estimated to be 40.7 cc (SD = 13.1, range = 25.9-59.5). Six men
had experienced unilateral cerebral vascular accidents (4 left-sided and 2 right-sided) and one
a coloid cyst of the third ventricle, which was resected via the dorsolateral frontal cortex; these
events or surgery had occurred 1 to 11 years before testing. Four patients had an expressive
aphasia, but none was demented as assessed by the Mini-Mental Status Examination (Folstein
et al., 1975) (mean = 27.0, SD = 1.3, range = 25-29).
Procedure. Four measures of executive function (including the WCST) and four of memory
function were administered
to the schizophrenic,
alcoholic, and control subjects. For the
patients with frontal-lobe lesions, only WCST scores were used in this analysis, and their data
were not used to derive the WCST factors.
Tests of executive function: WCST and ordering. The ordering tests were used to
assess the convergent validity of the WCST factors, derived from the principal components
analysis, as measures of executive function.
In the WCST (Grant and Berg, 1948; Milner, 1963), the subjects sorted 128 cards, each of
which displayed 1 to 4 colored symbols, according to one of three strategies: color, form, or
number. The subject attempted to arrive at the correct sorting strategy, which was controlled
by the examiner but never mentioned explicitly. The correct strategy was switched without
warning after 10 consecutive correct responses. In all cases except for the frontal-lobe
group,
testing continued for 128 cards; thus, the maximum number of categories possible to achieve
was 12. Testing of the frontal-lobe
patients was discontinued
in two cases after six categories
had been achieved, in three cases after most of the cards had been sorted (i.e., at least 110
cards), or in two cases when patients experienced extreme frustration (one patient completed
44 cards and another 64 cards) (Kersteen, 1989). To equate all groups for the number of cards
sorted, the scores of the frontal patients were prorated to 128 cards. The responses were scored
according to Heaton’s criteria (1981) and yielded 14 scores: number of categories achieved,
total errors, total correct responses, number of correct responses excluding those achieved in
successful categories, prior category perseverations,
all perseverative responses, perseverative
errors, nonperseverative
errors, percent perseverative errors, unique responses, trial to the first
category, percent conceptual responses, failure to maintain set, and learning-to-learn.
Of the
14 scores, 11 were used in principal components
analysis. The three excluded scores (prior
category perseverations,
trial to the first category, and learning-to-learn)
were not used because of missing values for subjects who achieved fewer than three categories.
The verbal self-ordering
test (Petrides and Milner, 1982) samples working memory by
requiring subjects to create, organize, and execute a sequence of acts. In the verbal version,
test stimuli consisted of 54 abstract words taken from Paivio et al. (1968). Following the
procedure of Petrides and Milner (1982), all words were rated below 3.2 out of 7 on a scale
where the lower the rating, the less capable a word was at evoking an image. In addition, these
words had medium to high Thorndike-Lorge
(1944) frequency ratings, thereby excluding rare
words, which could be either unknown or salient to subjects. Two tests were administered,
one
consisting of six words and one of 12 words. Three trials of each test were given. The test order
was the same from subject to subject: the six-word test preceded the 12-word test.
The self-ordering test proceeded as follows. For the six-word test, the subject saw a series of
six pages shown one at a time, each displaying the same six words but in different places in a 2
X 3 matrix of squares. The subject’s task was to point to a word on successive pages not
previously selected. Thus, on the first page, the subject was correct regardless of which word
he chose; on the second page, only five words were potentially correct, and so on. For the
12-word test, 12 words and pages were used.
One component
of the nonverbal temporal ordering test assessed ordering function and
another component assessed memory function; the function tested depended upon the type of
question asked, and each component
was scored separately. Subjects were shown a series of
colored wallpaper swatches glued to 5- X I-inch index cards and were instructed to remember
them in their order of appearance. At various intervals throughout the series, subjects received
182
one of two types of two-choice
recognition
tests: recency judgment
tests contained
two
previously
presented
designs and measured
ordering function;
design recognition
tests
contained a previously presented design and a foil, never before seen on the test and measured
memory function (see below). For recency judgments,
testing ordering, patients were asked,
“Which one have you seen more recently?” For design recognition,
testing memory, patients
were asked, “Which one did you see in this test?” In both cases, patients pointed to their
choice. This test was an abbreviated version of the one described by Sullivan and Sagar (1989).
Although
recency judgments
and design-recognition
responses
were made after varying
intervals, only the two total scores were used in this analysis and were expressed as percent
correct of 50 recency-judgment
trials and percent correct of 25 item-recognition
trials.
Tests of memory function. These tests assessed verbal and nonverbal recognition
and
recall. They were used to assess the discriminant
validity of the WCST factors as measures of
executive function.
Recognition of nonverbal material was assessed within the context of the temporal ordering
test. As described above, recognition of previously seen designs (i.e., memory function) was
tested in a two-choice forced-choice
paradigm.
A final and separate part of the verbal self-ordering
test involved an unannounced wordrecognition test, assessing verbal recognition memory. For each of the 54 words used in the
six-word and 12-word self-ordering
tests combined, a three-choice,
forced-choice
recognition
question was posed. The advantage of testing recognition memory in this paradigm was that
the examiner
had some degree of assurance
that all subjects, regardless
of attentional
incapacities, attended to all stimuli because of the multiple pointing opportunities.
In the test of delayed recall of verbal memory, patients were asked to recall the stories of the
Wechsler Memory Scale (WMS; Wechsler and Stone, 1945) 1 hour after immediate recall and
without warning (Milner, 1958; Corkin et al., 1985). The score was the average number of
points achieved for the two stories.
In the test of delayed recall of nonverbal material, patients were instructed to recall the three
drawings of the WMS 1 hour after immediate recall and without warning (Milner, 1958).
After the recall tests, the patients copied the drawings from the cards so that the recall scores
could be corrected for drawing ability (Cooper et al., 1991). The copy and delayed recall scores
were derived according to Wechsler and Stone’s (1945) criteria for immediate recall. The
delayed recall score used in this analysis was the delayed recall score divided by the copy score,
following a similar correction approach used by Jones-Gotman
(1986).
Statistical Analysis. Comparisons
of the frontal-lobe
patients with the control subjects used
t tests. One-way analyses of variance (ANOVAs) compared performance
across the four
groups on the 14 conventional
scores of the WCST, and across the three groups (control
subjects, schizophrenic
patients, and alcoholic patients) on composite scores of the ordering
and the memory tests. Post hoc comparisons
for significant
F values (a = 0.05) were
performed with Scheffi tests (a = 0.05).
To reduce the 11 WCST scores to a smaller set of nonredundant
scores, the WCST data
were subjected to a principal components
analysis with a varimax rotation. The three groups
(control subjects, schizophrenic
patients, and alcoholic patients) were pooled for the principal
components
analysis to identify the major dimensions along which the groups might differ, as
well as to generate factor scores that could be compared across groups. On the basis of the
factor solution, WCST factor scores were computed by summing the standard scores from the
scores with high loadings on the factors. These WCST factor scores were then used in the
subsequent analyses.
Two composite scores were formed from the ordering and the memory test scores. The
ordering composite was a single score derived from the three ordering measures: six-word and
12-word self-ordering
tests and total nonverbal
temporal
ordering
test. The memory
composite was a single score derived from the four memory tests: recognition of verbal and
nonverbal
material and delayed recall of verbal and nonverbal
material. The ordering
composite was formed by standardizing
(i.e., calculating z scores) and then summing the
183
scores from the three ordering tests; the memory composite was formed by standardizing
(z scores) and then summing the scores from the four memory tests. A constant was added to
each composite so as to keep all scores in the positive direction. For both the ordering and the
memory composites, high scores reflected good performance.
The convergent validity and the discriminant validity of the WCST factors were assessed
within each of the diagnostic groups using bivariate and then multivariate approaches. The
bivariate approach examined simple Pearson product-moment
correlations between the
WCST factors and the ordering and memory composite scores. In addition, the regression
equations reflecting convergent validity were directly compared across groups using
hierarchical regression analyses, which included the WCST factors, the ordering composite,
and “dummy-coded” categorical variables representing group membership (Pedhazur, 1982).
The multivariate approach used hierarchical regression analysis (Cohen and Cohen, 1985) to
control for overlap of variance between the ordering and memory composites while examining
their unique relationships with the WCST factors.
Results
WCST Conventional
Scores. Table 2 presents the results of the conventional
scoring of the WCST. The performance
of the frontal-lobe
group was similar to that
reported by others (e.g., Milner, 1963; Heaton, 1981), and this sample therefore
appears to be representative
of other such samples. Of the nine instances showing
nominally
significant
group differences when all four groups were compared,
six
were significant with Bonferroni
correction @ < 0.004). The number of categories
sorted was the only WCST measure on which all three patient groups were
significantly
impaired relative to the control group. The schizophrenic
and frontallobe groups shared further impairment
on the following scores: total errors, total
correct,
all perseverative
responses,
perseverative
errors, percent perseverative
errors, and percent conceptual responses. The alcoholic, but not the schizophrenic
or
frontal lobe groups, had significantly
worse scores than did the control group on
number correct minus IO/category and failure to maintain set. In no comparison
did
the patient groups differ significantly
from each other. Five scores showed no group
differences: prior category perseverations,
nonperseverative
errors, unique responses,
trials to first category, and learning-to-learn.
Principal Components Analysis of the WCST Conventional
Scores. The 11
conventional
scores for the WCST from all 58 subjects in the control, schizophrenic,
and alcoholic groups were entered into a principal
components
analysis. Using
varimax rotation, three factors emerged with Eigenvalues > 1, accounting for 91% of
the original variance; Table 3 presents the factor loadings. In addition, the threefactor solution was readily interpretable.
The first factor, accounting for 58% of the
variance and representing
the major dimension
underlying
most of the indexes of
WCST performance,
consisted of seven high loading variables: all perseverative
responses, total perseverative errors, percent perseverative
errors, total errors, total
correct, percent conceptual
responses,
and number of categories achieved. This
factor seemed to tap WCST errors due to perseverative responses, and we labeled it
Perseveration. The second factor, accounting for 19% of the variance, had its major
contributions
from two high loading variables, failure to maintain
set and total
correct minus 10 per category achieved. These indexes seemed to reflect inefficiency
errors
% Perseverative
responses
3.9
-1.6
0.6
SD
0.3
18.4
SD
Mean
70.4
Mean
SD
0.9
1.0
SD
11.5
0.6
Mean
SD
6.6
Mean
6.2
SD
8.5
-8.6
1.2
1.0
24.3
46.0
5.0
14.0
4.4
2.4
21.6
26.2
17.4
22.7
3.8
19.4
Mean
SD
28.0
32.9
36.6
39.6
8.5
7.9
SD
SD
9.3
10.9
Mean
18.1
22.2
12.3
29.7
24.7
75.7
24.7
51.5
3.3
4.6
10.1
-4.0
1.4
1.5
19.2
53.4
12.5
17.1
3.5
2.5
13.9
17.0
8.5
22.1
16.0
20.6
19.4
23.4
14.7
17.1
20.3
34.2
16.7
85.7
17.9
42.9
3.2
5.2
(n = 20)
(n = 22)
6.3
-6.4
1.2
1.1
24.7
39.9
4.1
15.0
2.2
1.0
19.3
30.3
9.7
20.0
24.7
38.8
34.7
47.6
17.8
21.5
9.5
33.0
24.5
69.1
24.5
56.9
2.9
3.6
1. Scores prorated to 128 cards. On average. the frontal-lobe group sorted 98 cards k 32.7.
2. One-way analysts of vanance.
3. Excluded from the principal components analysis.
l
2.37
2.96’
5.11 **
1.51
1.33
5.90’**
0.29
5.99”’
5.37”
2.28
l
4.01 *
6.49”’
6.02”’
6.32”’
F2
< 0.004. Bonferroni corrections
(n = 7)
Frontal lobe’
Note. Values tn boldface type were stgntfrcantly worse than those for the normal control group (Scheffe F test, a 5 0.05). ‘p < 0.05. “p < 0.01, “‘p
a 5 0.05 for 14 comparisons, p 5 0.004.
Learning-to-learn3
Failure to maintain set
% Conceptual
Trials to 1 st category3
Unique responses
errors
errors
responses
9.1
10.9
13.4
SD
SD
17.4
Mean
Mean
11.4
SD
100.7
11.4
Mean
27.3
SD
2.4
SD
Mean
6.3
Mean
Alcoholic
Schizophrenic
Card Sortina Test
Control
(n= 16)
scores on the Wisconsin
perseverations3
Nonperseverative
Perseverative
All perseverative
Prior category
No. correct-l O/category
Total correct
Total errors
Categories
Table 2. Conventional
g
185
Table 3. Principal components analysis of the WCST conventional scores
Factors:
Variance proportion:
Eigenvalues:
Perseveration
0.58
7.24
Nonperseverative
errors
0.14
1.12
Factor loadings
WCSTscores
Perseverative
Inefficient
sorting
0.19
1.62
0.987’
0.055
0.081
All perseverations
0.983’
0.024
0.061
Perseverations
0.977’
0.102
0.069
Total errors
0.927’
0.169
0.312
Total correct
-0.918’
-0.161
0.327
-0.899*
-0.24
0.324
-0.808*
-0.522
0.250
set
-0.049
0.984’
0.027
No. correct-l O/category
0.468
0.848’
0.068
Unique
0.106
0.007
0.839.
0.203
0.027
0.703’
l
errors
% Conceptual
responses
Categories
Failure to maintain
responses
Nonperseverative
errors
Note. Principal components analysis with orthogonal
nonoverlapping loading on the factor.
(varimax) rotation (n = 58). * score with high and relatively
in sorting strategy, and hence this factor was labeled ZneSJicient Sorting. The third
factor, Nonperseverative Errors, accounted
for 14% of the variance and had its
highest loadings from two indexes reflecting WCST errors of a nonperseverative
nature: nonperseverative
errors and unique responses.
On the basis of the results of the factor analysis, factor composites were computed
for each of the three factors by summing the z scores from only those conventional
scores with high loadings on the factor. Each score contributed
to only one factor.
To form the z scores, each conventional
score was standardized
against the mean and
standard deviation for the combined group of control, schizophrenic,
and alcoholic
subjects. Because the frontal-lobe
patients were not included in the factor analysis
and were used to provide evidence of criterion validity of the factors, these means
and standard deviations were also used to standardize the conventional
scores of the
frontal-lobe
patients. High factor composites reflected poor WCST performance.
Criterion Validity of WCST Factors Based on Performance by Patients With
Frontal-Lobe
Lesions. A comparison
of the factor scores of the frontal-lobe
patients with the control subjects provided a test of criterion validity for each WCST
factor derived from the principal components
analysis. Comparisons
with t tests
revealed that the frontal-lobe
group had significantly higher scores on the Perseveration (t = -4.404, df= 21, p = 0.002) and the Inefficient Sorting factors (t = -2.774,
df = 21, p = 0.0114) than did the control group, but that these two groups did not
differ significantly
on the Nonperseverative
Errors factor (t = -0.48, df = 21, NS)
(Fig. 1). Therefore, the performance
of the frontal group supported the Perseveration and Inefficient Sorting factors as measures of frontal-lobe
dysfunction,
and so
provided criterion validity to these factors.
The purpose of the next analysis was to test whether the schizophrenic
and the
alcoholic groups showed a similar pattern of deficits as that observed in the frontal-
186
Fig. 1. Wisconsin Card Sorting Test (WCST) factor scores
3
2Q-
.
15
0
.
.
10.
I!
$5
e
co,
k
e
e
t
II
o
i
;
8
f
COIla
Schizophrenic Alcoholic
Fmntal
c0nuo1
Schizophrenic Alcoholic
Frontal
8
0
8
4*
2
‘2
3,
2,
[
6
z
0
8
0
0
-
_:..
lri
i
-2,
-3-
0
COnoOl
0
0
Schizophrenic
0
Alcoholic
T
:
.
FlUltal
Scatterplots and means of the three WCST factor scores for the four study groups. On these measures, low scores are
good. The Perseveration factw scores of the schizophrenic and frontal-lobe groups were significantly greater than that
of the normal control group (p<O.OOOl),
whereas the Inefficient Sorting score of the alcoholic group was significantly
greater than that of the normal control group (p< 0.01). The Nonperseverative Errors score showed no significant
group differences.
lobe group. To this end, the pattern of performance
based on the three WCST
factors was compared across all four groups with ANOVAs and post hoc Scheffe
tests. The Perseveration
factor scores of the schizophrenic
and frontal groups were
significantly
greater than that of the normal control group (F = 6.22; df= 3, 64;
p < O.OOl),
whereas the Inefficient
Sorting score of the alcoholic group was
significantly
greater than that of the normal control group (F = 4.08; df = 3, 64;
p = 0.01). With the use of a Scheffe post hoc test, which incorporated
a protected
p value by taking into account the number of comparisons
made in an analysis, it
was determined
that the frontal-lobe
group did not differ significantly
from the
187
control group on Inefficient
Sorting; this result conflicts with that obtained with
a simple t test. The Nonperseverative
Errors score showed no group differences
(F = 1.11; df = 3, 64; NS). The correlations
among the WCST factors were high in
the normal control subjects, moderate to low in the schizophrenic
and the alcoholic
patients, and not significant in the frontal patients (Table 4).
Construct Validity of the WCST Factors Based on Convergent and Discriminant Analysis Exclusive of the Patients With Frontal-Lobe Lesions. To assess
the validity of the WCST factors for groups other than the criterion sample of
frontal
patients,
a construct-validation
approach
was taken. Accordingly,
the
three WCST factors were analyzed in parallel in the subsequent analyses to elucidate
the differences in their meaning with respect to the data from the other tests. Only
data from the normal control, schizophrenic,
and alcoholic subjects were considered
in these analyses. Bivariate and multivariate
analytical approaches were used.
Ordering and memory composite scores. Of the total group of 58 subjects, 17
schizophrenic,
19 alcoholic, and 15 normal control subjects had data for all tests; 16
normal control subjects had data for the WCST and ordering tests. The ordering
composite scores of the schizophrenic
and alcoholic groups were inferior to those of
the normal control group, but the differences
were not statistically
significant
(F = 1.49; df = 2, 50; NS) and may reflect low power due to the small number of
subjects. The memory composite score of the schizophrenic
group was significantly
Table 4. Factor and composite
groups
score correlational
Inefficient
sorting
Group
matrix for the four subject
Nonperseverative
errors
Ordering
Memory
Normal control
0.81***
Perseveration
Inefficient
sorting
Nonperseverative
-0.07
0.84”*
0.28
0.59”
0.41
0.18
0.05
-0.28
errors
0.70”
Schizophrenic
Perseveration
Inefficient sorting
Nonperseverative
0.10
0.41*
0.11
errors
-0.70***
0.04
-0.59**
-0.70”’
0.01
-0.75***
O&i’*
Alcoholic
0.52”
Perseveration
Inefficient
sorting
Nonperseverative
Ordering
0.50
0.31
0.27
-0.07
0.29
0.15
errors
Frontal lobe
Perseveration
Inefficient
0.14
-0.18
sorting
‘p < 0.05;“p < 0.01;"'p < 0.001.
0.46’
-0.33
0.12
-
-
0.03
-
-
188
worse than the memory composite scores of the normal control and alcoholic groups
(F= 8.56; df = 2, 50;~ < 0.001) (Fig. 2).
Bivariate Convergent
Validity.
Simple (bivariate)
correlations
among the
WCST factors and ordering composite
scores were calculated within each group
(Table 4). These correlations
bear upon the convergent validity of the WCST factors
as measures of executive function. The convergent validity of a given test is demonstrated to the extent that it is correlated with another measure of the same function
(Campbell and Fiske, 1959; Wiggins, 1980). Thus, convergent validity of the WCST
factors would be supported by negative correlations
between the WCST factors (low
scores reflect good performance)
and the ordering composite (high scores reflect
good performance).
For the Perseveration
and Nonperseverative
Errors factors, the correlations
supported convergent validity in the schizophrenic
subjects only (Table 4, column 3).
In the alcoholic subjects, the correlations
were either not significant or, in the case of
the Perseveration
factor, the correlation
was in the counterintuitive
direction (i.e., a
tendency to perseverate was associated with good ordering performance).
In the
normal control subjects, no correlations
were significant.
These findings strongly suggest that the convergent validity of the WCST factors
as measures of a common cognitive function
depend on diagnostic
status (i.e.,
schizophrenic,
alcoholic, or normal control). fiierarchical
regression analyses were
conducted
to address explicitly
the dependence
of the correlations
upon group
membership.
In predicting each WCST factor, the ordering composite was entered
into a regression analysis first, followed by two dummy-coded
variables representing
group membership.
Finally, the product of the ordering composite and the dummycoded variables was entered in step 3, allowing for a test of differences among the
groups in the slopes of the regression lines. A significant predictive increment at step
3 suggested that the degree and/or direction of the relationship
between the WCST’
factors and the ordering composite interacted with diagnostic status (Table 5).
In predicting the Perseveration
factor from the ordering composite, there was a
strong group interaction
(p < 0.0001) (Table 5). There was a strong and expected
Fig. 2. Ordering and memory composite
Ordering
scores
MelOrJ
Means and standard errors of the mean of the ordenng and memory composite scores. On these measures, high
scores are good. The ordering composite scores of the schizophrenic and alcoholic groups were inferior to those of the
normal control group, but the differences were not statistically significant. The memory composite score of the
schizophrenic group was significantly worse than the memory composite scores of the normal control and alcoholic
groups (p < 0.001).
189
Table 5. Hierarchical regression of Wisconsin Card Sorting Test (WCST)
factors on ordering composite and diagnostic group
I?2
R
chanae
WCST factor
Stet4
Predictor added
Perseveration
Inefficient sorting
Nonperseverative
*p< 0.01:
errors
1
Ordering
2&3
Diagnostic
composite
0.21
0.04
groups’
0.46
4&5
Ordering
X diagnostic
0.17’
0.71
0.28*’
composite
0.10
0.01
group
0.45
0.19*
1
Ordering
2&3
Diagnostic
4&5
Ordering
X diagnostic
1
Ordering
composite
2&3
Diagnostic
4&5
Orderina
group*
group
group
X diaanostic
arouD
0.47
0.02
0.22
0.05
0.30
0.04
0.53
0.19*
‘“p < 0.001.
1. Two dummy-coded variables were used to represent the three diagnostic groups.
2. Two product variables, obtained by multiplying the ordering composite with each dummy variable, were used to
represent group differences in the slopes of the regression line.
relationship
in the schizophrenic
subjects (r = -0.70) a moderate but counterintuitive relationship
in the alcoholic subjects (r = OSO), and a weak counterintuitive
relationship
in the normal control subjects (r = 0.28) (Fig. 3).
When the Nonperseverative
Errors factor was predicted
from the ordering
composite, a significant group interaction
effect emerged (p = 0.004) (Table 5). The
slopes differed among the groups in a pattern similar to that observed with Perseveration: the schizophrenic
subjects showed the expected relationship
(r = -0.60) the
alcoholic subjects exhibited a weak but counterintuitive
relationship
(r = 0.30), and
the normal control subjects showed virtually no relationship
(r = 0.05).
In predicting Inefficient Sorting, no interaction
was found (Table 5). There was,
however, a significant group effect at step 2 of the regression equation, indicating
that the groups showed mean differences in their tendency toward Inefficient Sorting
(see ANOVA results described earlier). At step 1, with all groups combined,
the
Fig. 3. Perseveration factor scores regressed on ordering composite scores
Schizophrenics
Normal Controls
20
f
%
1s
101
Uz
I
=
e
5
m
y.1.S2.371
y=-*.26+0.51x
r=m
.
.
0
.
.
.=I_
j”
-
Alcoholics
20
.
-10’
-6
-4
ordaia#
-2
.
0
cmpaitc
2
scm
l
’ -10’
4
-6
-4
-2
OlJaiqWb=
0
2
4
-4
-4
ordate
-2
0
2
4
CompMilc scare
Convergence of the Perseveration and the ordering composite scores. Only for the schizophrenia
correlation in the expected direction. The slopes differed significantly (p < 0.0001).
group was the
190
ordering composite was not a significant predictor of Inefficient Sorting.
Bivariate discriminant
validity. Simple (bivariate)
correlations
between the
WCST factors and the memory composite bear upon the discriminant
validity of the
WCST factors as measures of executive function. Discriminant
validity is concerned
with demonstrations
of what a test does not measure and is an indication
of a test’s
specificity (Campbell and Fiske, 1959; Wiggins, 1980). Accordingly,
one indication
of the discriminant
validity of the WCST factors is their lack of correlation
with
the presumably
unrelated memory composite. Because convergent validity was not
demonstrated
for the normal control subjects or the alcoholic subjects, discriminant
validity was examined
only for the schizophrenic
subjects. Table 5 presents the
correlations
for the alcoholic and normal control subjects for descriptive purposes
only. The results revealed that Perseveration
and Nonperseverative
Errors were
significantly
correlated
with the memory composite
(Table 5, column 4). These
correlations
represent an apparent failure to support the discriminant
validity of
these two WCST factors in the schizophrenic
subjects.
Multivariate
discriminant validity. The emergence of a correlation
between the
ordering composite and the memory composite complicates the interpretation
of the
bivariate analysis of discriminant
validity (Table 4). This correlation may have arisen
from global cortical pathology present in schizophrenia
(Zipursky et al., 1992). In
other words, ordering and memory may have been correlated
because a single
pathological
process produced impaired functioning
of both the frontal-lobe
and the
medial temporal-lobe
regions in schizophrenia.
The bivariate approach to discriminant validity fails to consider this ordering-memory
correlation.
Therefore, a multivariate approach was undertaken
to examine the discriminant
validity of the WCST
factors while controlling
for the shared variance between the ordering and memory
functions in schizophrenia.
In light of the shared variance between the ordering and memory composites, the
simple correlation
between the WCST factors and the memory composite can be
explained
by at least two alternatives:
Alternative 1. The WCST factors lack
discriminant
validity. In addition to the shared variance between the ordering and
memory composite scores, aspects of memory that are unrelated to ordering contribute to WCST performance.
The left panel of Fig. 4 depicts this pattern
of
overlapping
variance among the measures. Alternative 2. The WCST factors reflect
specific measures of executive function.
The specificity,
however, is masked by
correlated
ordering and memory test performance
in schizophrenia
that produce
overlap between memory and WCST performance.
The right panel of Fig. 4 depicts
this pattern of overlapping variance among the measures.
To distinguish
between these alternatives,
we used hierarchical regression analysis
in which the ordering and memory composite scores were entered as predictors for
each WCST factor. The ordering composite score was entered into the equation first.
Then, the change in R* was examined after the memory composite score was entered
at the second step. A significant R2 change at the second step supports Alternative
1
(i.e., supports lack of discriminant
validity), whereas an insignificant
increment in R2
supports Alternative 2 (i.e., supports discriminant
validity). In effect, the analysis at
the second step examined
the predictive
utility of the components
of memory
performance
that were unrelated to ordering performance.
Thus, at the second step,
191
Fig. 4. Alternative patterns of overlapping variance among the measures
Alternative 1. The Wisconsin Card Sorting Test (WCST) factors lack discriminant validity. In addition to the shared
variance between the ordering and memory composite scores, aspects of memory that are unrelated to ordering
contribute to WCST performance. The left panel depicts this pattern of overlapping variance among the measures.
Alternative 2. The WCSTfactors are specific measures of executive function. The specificity, however, is masked by
correlated ordering and memory test performance in schizophrenia that produces overlap between memory and
WCST performance. The right panel depicts this pattern of overlapping variance among the measures.
the contribution
of memory to WCST performance
could be evaluated
while
ordering was held constant. This analysis of discriminant
validity was conducted
only in instances
where convergent
validity
was supported-namely,
for the
Perseveration
and Nonperseverative
Errors factors in the schizophrenic
subjects.
The results for the Perseveration
factor showed that memory performance
did not
produce a significant
improvement
in the R2 when ordering performance
was
controlled
for (R2 change = 0.10, NS). This result is consistent with Alternative
2.
For the Nonperseverative
Errors factor, memory performance
did contribute
a
significant
increment in prediction
when ordering performance
was controlled
for
( R2 change = 0.23, p < 0.05). This result is consistent with Alternative
1 (Table 6).
Discussion
The results of this study were as follows: (1) the schizophrenic,
alcoholic, and
frontal-lobe
groups all showed impairment
on the conventional
scores of the WCST;
(2) the conventional
WCST scores could be reduced to three factors, Perseveration,
Inefficient
Sorting,
and Nonperseverative
Errors;
(3) of these three factors,
Perseveration
and, to a lesser extent, Inefficient Sorting were sensitive to dysfunction
in the frontal-lobe
group and therefore possessed criterion validity; and (4) the
Perseveration
factor, as an index of executive function,
demonstrated
construct
validity for the schizophrenic
group.
Factors of WCST Scores and Patterns of Impairment.
The 11 conventional
scores of the WCST could be reduced
to three factors,
which we named
Perseveration,
Inefficient
Sorting,
and Nonperseverative
Errors.
This factor
structure closely resembles the sources of difficulty common on this task: “inefficient
initial conceptualization,
perseveration,
failure to maintain
set, and inefficient
learning across the several stages of the test” (Heaton, 1981, p. 7). The emergence of
the three factors suggests that different components
of cognition
contribute
to
WCST performance
and provides an attractive alternative
to the redundancy
of
‘p < 0.05; “p < 0.01
Nonperseverative
Perseveration
WCST Factor
errors
Memory composite
Ordering
Memory
1
2
composite
composite
Ordering composite
1
Predictor added
2
Step
0.76
0.59
0.77
0.70
R
0.23’
0.35
0.10
0.4Y’
R 2 change
-0.59
-0.70
Step 1
-0.63
-0.19
-0.42
-0.43
Step 2
fl at each step
Table 6. Hierarchical regression: Wisconsin Card Sorting Test (WCST) factors on ordering and memory composite
scores for the schizophrenic patients
193
analyzing all scores. Indeed, use of redundant scores could obscure the assessment of
the major dimensions underlying WCST performance. Furthermore, choosing a
subset of the scoring methods without regard to the underlying major dimensions
would be arbitrary. Examining WCST performance in terms of a reduced number of
factor scores may afford several benefits, including increased reliability and a more
manageable number of WCST scores. The results of the present study suggest one
possible factor solution for the conventional scores of the WCST, though this factor
structure requires replication. It would also be important to test the power of this
factor structure in predicting performance patterns in new samples of schizophrenic,
alcoholic, and frontal-lobe patients.
The performance of the frontal-lobe patients suggested that Perseveration and
possibly Inefficient Sorting possess criterion validity for the assessment of
impairment in patients with focal DLPFC lesions. The Perseveration factor result is
in accord with previously published observations that perseveration commonly
accounts for the WCST impairment of patients with frontal-lobe lesions (e.g.,
Milner, 1963; Drewe, 1974; Heaton, 1981). The tendency of schizophrenic patients to
perseverate was also observed by Fey (1951) even before the introduction of
psychoactive medications, indicating that this tendency is not simply a result of
medication. Regarding Inefficient Sorting, alcoholics have been shown to have a
graded impairment in maintaining set, depending upon their length of drinking (i.e.,
< 10 years vs. > 10 years; Tarter, 1973). The patients of the present study would have
been considered long-term alcoholics by Tarter’s criteria and behaved as such.
Apparently, the tendency to make nonperseverative errors does not show sensitivity
to DLPFC pathology.
Construct
Validity
of WCST
Factors as Measures
of Executive
Function.
The validity pattern, based on the frontal-lobe patients, could not be assumed to
hold for both psychiatric groups. Thus, an attempt was made to corroborate the
criterion validity results with evidence of construct validity of the three factors.
Moreover, construct validation was assessed in each group separately since the
factors might be valid measures of executive function in some populations but not
others.
The bivariate correlations of the Perseveration and Nonperseverative Errors
factors with the independent ordering composite (executive function) supported
their convergent validity within the schizophrenic group, but not within the alcoholic
or normal control groups. The Inefficient Sorting factor was not supported as a valid
measure of executive function within any of the three groups considered. When
convergent validity was demonstrated, the bivariate correlations did not support the
discriminant validity of the WCST factors.
To assess the multivariate discriminant validity of the WCST factors as measures
of executive function, we used multivariate analysis to test whether these factors did
nor measure memory function in the schizophrenic group. With this approach,
which controlled for the correlation between the ordering and memory composites,
the memory composite showed a significant association with Nonperseverative
Errors, but not Perseveration. Thus, these multivariate results, along with the
convergent validity results, suggested that (1) the Perseveration factor was a valid
194
and specific measure of executive function
in schizophrenic
subjects but not in
alcoholic or normal control subjects, and (2) the Nonperseverative
Errors factor
reflected both executive and memory function in schizophrenic
subjects, whereas in
alcoholic and normal control subjects it was associated with neither function. The
Inefficient
Sorting
factor was not significantly
associated
with the ordering
composite in any group, and it therefore could not be supported
as a measure of
executive function despite its criterion validity in patients with frontal-lobe
lesions.
In construct validation studies, the failure to show convergence between two tests
that theoretically
measure an underlying
construct
raises questions
about the
construct validity of both tests. While this caveat applies to all of the relationships
evaluated
in the present study, it is particularly
salient in the alcoholic subjects,
where the counterintuitive
relationships
of the Perseveration
and Nonperseverative
Errors factors with the ordering composite emerged: the alcoholic subjects showing
relatively poor scores on the WCST factors were the same individuals scoring well on
the ordering composite. Which of these scores should be taken as the true indication
of DLPFC
function
in alcoholics?
Our results do not permit an answer to this
question; rather, the findings markedly reduce our confidence in both the ordering
composite tests and the WCST factors as valid and selective measures of executive
function in alcoholics. We might speculate from these results that alcoholics perform
these tests qualitatively
differently from schizophrenic
patients and normal control
subjects, perhaps because the pathology
responsible
for the impairment
in the
alcoholics may involve brain regions other than the dorsolateral
frontal convexity
and its connections;
one candidate region is the orbital frontal cortex (cf. Freedman
and Oscar-Berman,
1986).
The problem of showing that a test possesses discriminant
validity-that
is, that a
test is a measure of a specific function-is
complicated
in disorders that may exhibit
multiple localizable cognitive or neurological
impairments.
To the extent that the
multiple impairments
in a disorder are correlated, perhaps due to multiple localized
lesions arising from a unitary pathogenesis,
the examination
of simple correlations
among measures designed to assess distinct functions (i.e., the bivariate approach)
can produce misleading conclusions
about the discriminant
validity of the measures.
For example,
in the schizophrenic
subjects, the pattern
of simple correlations
observed
between the two WCST factors, Perseveration
and Nonperseverative
Errors, and the memory composite could have come about for at least two reasons:
(1) the WCST factors could lack discriminant
validity (i.e., they may be tapping both
executive
and memory function);
and (2) the WCST factors could be specific
measures
of executive function,
but because schizophrenia
may produce
both
a correlation
between ordering
(executive
executive
and memory impairments,
function)
and memory
might
be expected.
Yet, even when these multiple
impairments
coexist in a disorder, the correlation
between the functions is less than
perfect. In other words, there are overlapping
and nonoverlapping
(i.e., dissociable)
components
of executive and memory function
in schizophrenia.
An important
challenge for neuropsychology,
then, is to find a method, richer than bivariate
analysis, to distinguish
between performance
on a nonspecific
test and correlated
impairments
in evaluating the discriminant
validity of a test.
195
Hierarchical multiple regression models offer one promising multivariate
approach to meeting this challenge. In our regression model, we could view the
correlation between the ordering composite and the memory composite as a
reflection of the overlapping executive and memory dysfunctions in schizophrenia.
When this correlation was removed from the memory composite, the residual
memory composite was able to reflect the component of memory function that was
dissociable from executive function. In effect, we removed the shared aspects of
memory and executive function that coexist in a disorder exhibiting both executive
and memory dysfunction. If the residual memory composite was uncorrelated with
the WCST factor, then we could conclude that the simple correlation between the
WCST factor and the nonresidualized memory composite arose from overlap between the functions within the disorder (i.e., from the covarying aspects of multiple
cognitive impairments). This was our conclusion about the Perseveration factor in
the schizophrenic patients. On the other hand, if the residual memory composite
(i.e., the aspect of memory function that was dissociuble from executive function)
were correlated with the WCST factor, then this WCST factor would be shown to be
sensitive to aspects of memory function that were dissociable from executive
function. Therefore, the factor would lack specificity, or discriminant validity, as an
executive measure. This was our conclusion about the Nonperseverative Errors
factor in schizophrenia. Thus, the contribution of multivariate hierarchical regression was to provide evidence about the validity and specificity of a test designed to
measure a specific function when applied to a disorder that causes multiple cognitive
and neurological impairments. Indeed, using tests without discriminant validity
could lead to erroneous conclusions suggesting the presence of “global,““diffuse,” or
“nonlocalizable” impairment when, in reality, such impairment comprises a limited
number of discrete deficits (cf. Sagar and Sullivan, 1988).
Depending upon the WCST score or factor that was analyzed and the group that
was considered, the WCST reflected dysfunction either relatively specific (e.g., the
Perseveration factor) or nonspecific (e.g., categories achieved). Similarly, Kersteen
(1989) reported that patients with DLPFC lesions (7 patients used in the present
study), as well as patients with temporal-parietal lesions, were impaired in categories
achieved and total errors on the WCST, whereas the former but not the latter group
showed impairment, relative to control subjects, in perseverative errors. Teuber and
Liebert (1958) also illustrated this point with a different test. Their patients with
penetrating head injuries involving the anterior aspects of the brain were selectively
impaired in setting a luminous line to the vertical when their bodies were tilted. By
contrast, patients with posterior brain injuries were selectively impaired in this task
when their field of vision was limited to a striped background set at various tilts.
Interestingly, when the results were scored taking starting position of the line to be
adjusted to the vertical into account, both head injury groups were impaired in the
body tilt condition. The conclusion drawn was that “...one and the same task could
be shown to reveal general (nonlocalizable) and specific (relatively localizable) effect
of brain injury” (p. 407), depending upon the analytic approach used. Thus, even in
studies investigating patients with localizable lesions, the selective effects of the
lesions can be elusive.
196
Construct
Validity
in Neuropsychology.
The analysis
used in this study
demonstrates
one approach that can be used to establish the construct validity of
neuropsychological
tests in populations
without
localized
brain
lesions.
In
populations
of patients with structurally
observable
localized lesions, there is an
obvious criterion against which to evaluate the validity of a test designed to measure
a localized brain function: patients with lesions in one brain region show impaired
performance
on the test relative to intact individuals
as well as to patients with
lesions in a different brain region. In patient groups without clearly delineated
structural
lesions but suspected
of having abnormal
brain function,
no single
criterion can serve to validate a neuropsychological
test. In the absence of observable
structural
or functional
brain abnormalities,
the inference
of localized
brain
pathology
from impaired performance
on neuropsychological
tests requires the
accumulation
of supporting
evidence that is necessarily
of an indirect nature. In
effect, neuropsychological
impairment
in psychiatric
populations
without demonstrable lesions takes on the status of a hypothetical
construct (Cronbach and Meehl,
1955). Inasmuch as constructs are not directly observable, their existence is inferred
from the pattern
of interrelationships
among
their hypothesized
observable
manifestations.
Since there is no set of necessary or sufficient conditions
(short of
discovering a localized brain lesion) that can establish the validity of a test measuring
a construct,
the validity must be established
by the “nomological
network”
of
associations
and nonassociations
that the test shows with other theoretically relevant
and irrelevant
variables. This is what Cronbach
and Meehl (1955) referred to as
construct validity. Demonstrating
patterns of convergent
and discriminant
validity
within and across data domains (e.g., psychometric,
behavioral,
physiological,
and
anatomical)
is a fundamental
approach
to the validation
of “neuropsychological
constructs.”
Acknowledgments.
The authors wish to thank Jody Rawles, Chung Nim Ha, Linda Davis,
Stacie DeMent, Maria Stein, Hubert Chen, and Dorothy Talbert for their careful attention to
detail in test construction,
administration,
and scoring; and Robert Heaton, Marlene OscarBerman, Margaret Rosenbloom,
and Paula K. Shear for valuable discussions and comments
on the manuscript.
Results from this study were presented at the International
Congress on
Schizophrenia
Research (Sullivan et al., 1991). This work was supported
by NIMH (MH30854), NIAAA (AA-05965), NARSAD,
NINDS (NS-21135) the Department
of Veterans
Affairs, and the Meyer Foundation.
References
American Psychiatric Association.
DSM-III-R: Diagnostic and Statistical Manualof Mental
Disorders. 3rd ed., revised. Washington,
DC: American Psychiatric Press, 1987.
Anderson,
S.W.; Damasio, H.; Jones, R.D.; and Tranel, D. Wisconsin Card Sorting Test
performance
as a measure of frontal lobe damage. Journal of Clinical and Experimental
Neuropsychology, 13:909-922, 199 1.
D.R. Physiological
dysfunction
of
Berman,
K.F.; Illowsky,
B.P.; and Weinberger,
dorsolateral
prefrontal
cortex in schizophrenia:
IV. Further evidence for regional and
behavioral specificity. Archives of General Psychiatry, 45:616-622, 1988.
Bornstein,
R.A. Contribution
of various neuropsychological
measures to detection
of
frontal lobe impairment. International Journal of Clinical Neuropsychology, 8: 18-22, 1986.
197
Braff, D.L.; Heaton, R.; Kuch, J.; Cullum, M.; Moranville, J.; Grant, I.; and Zisook, S. The
generalized pattern of neuropsychological
deficits in outpatients with chronic schizophrenia
with heterogeneous
Wisconsin Card Sorting Test results. Archives of General Psychiatry,
48:891-898, 1991.
Campbell, D.T., and Fiske, D.W. Convergent and discriminant validation by the multitraitmultimethod
matrix. Psychological Bulletin, 56:81-105, 1959.
Cohen, J., and Cohen, P. Applied Multiple Regression/Correlation
Analysis for the
Behavioral Sciences. Hillsdale, NJ: Lawrence Erlbaum, 1985.
Cooper,
J.A.; Sagar, H.J.; Jordan,
N.; Harvey, N.; and Sullivan,
E.V. Cognitive
impairment in early, untreated Parkinson’s disease and its dissociation from motor disability.
Brain, 114:2095-2122, 1991.
Corkin, S.; Cohen, N.J.; Sullivan, E.V.; Clegg, R.A.; Rosen, T.J.; and Ackerman,
R.H.
Analyses of global memory impairments
of different etiologies. Annals of the New York
Academy of Sciences, 444: 10-40, 1985.
Cronbach,
L.J., and Meehl, P.E. Construct validity in psychological
tests. Psychological
Bulletin. 52:281-302, 1955.
Crovitz, H.F., and Zener, K.A. Group test for assessing hand and eye dominance. American
Journal of Psychology, 75:271-276, 1962.
Dehaene, S., and Changeux, J.-P. The Wisconsin Card Sorting Test: Theoretical analysis
and modeling in a neuronal network. Cerebral Cortex, 1:62-79, 1991.
Drewe, E.A. The effect of type and area of brain lesion on Wisconsin Card Sorting Test
performance.
Cortex, 10:159-170, 1974.
Endicott, J., and Spitzer, R.L. A diagnostic interview: The Schedule for Affective Disorders
and Schizophrenia.
Archives of General Psychiatry, 35:837-844, 1978.
Eslinger, P.J., and Damasio, A.R. Severe disturbance
of higher cognition after bilateral
frontal lobe ablation: Patient EVR. Neurology, 35:1731-1741, 1985.
Fey, E.T. The performance
of young schizophrenics
and young normals on the Wisconsin
Card Sorting Test. Journal of Consulting Psychology, 15:311-319, 1951.
Folstein, M.F.; Folstein, S.E.; and McHugh, P.R. Mini-Mental
State: A practical method
for grading the cognitive state of patients for the clinician. Journal of Psychiatric Research,
12:189-198, 1975.
Freedman,
M., and Oscar-Berman,
M. Bilateral frontal lobe disease and selective delayed
response deficits in humans. Behavioral Neuroscience, 100:337-342, 1986.
Freund, G., and Ballinger, W.E., Jr. Loss of cholinergic muscarinic receptors in the frontal
cortex of alcohol abusers. Alcoholism, 12:630-638, 1988.
Frey, R.T.; Woods, D.L.; Knight, R.T.; Scabini, D.; and Clayworth,
CC. Defining
functional areas with averaged CT scans. Society for Neuroscience Abstracts, 1987.
Goldberg, T.E.; Kelsoe, J.R.; Weinberger, D.R.; Pliskin, N.H.; Kirwin, P.D.; and Berman,
K.F. Performance
of schizophrenic
patients on putative neuropsychological
tests of frontal
lobe function. International Journal of Neuroscience, 42151-58, 1988.
Goldberg, T.E.; Weinberger, D.R., Berman, K.F.; Pliskin, N.H.; and Podd, M.H. Further
evidence for dementia of the prefrontal type in schizophrenia?
A controlled study of teaching
the Wisconsin Card Sorting Test. Archives of General Psychiatry, 44:1008-1014, 1987.
Grant, D.A., and Berg, E.A. A behavioral analysis of degree of reinforcement
and ease of
shifting to new responses in a Weigl-type card-sorting
problem. Journal of Experimental
Psychology, 38:404-411, 1948.
Gur, R.C.; Saykin, A.J.; Benton, A.; Kaplan, E.; Levin, H.; Kester, D.B.; and Gur, R.E.
“Behavioral
imaging”-111.
Interexpert
agreement
and
reliability
of weightings.
Neuropsychiatry, Neuropsychology, and Behavioral Neurology, 3: 113-124, 1990a.
Gur, R.C.; Saykin, A.J.; Muenz, L.R.; Trivedi, S.S.; and Gur, R.E. Response to Yeo et al.‘s
critique
of behavioral
imaging.
Neuropsychiatry,
Neuropsychology,
and Behavioral
Neurology, 3:304-312, 19906.
Harper, C.G., and Kril, J.J. Neuropathology
of alcoholism. Alcohol & Alcoholism, 25:207216, 1990.
198
Heaton, R.K. Wisconsin Card Sorting Test Manual. Odessa, FL: Psychological
Assessment
Resources, Inc., 198 1.
Heck, E.T., and Bryer, J.B. Superior sorting and categorizing
ability in a case of bilateral
frontal
atrophy:
An exception
to the rule. Journal of Clinical and Experimental
Neuropsychology, 8:313-3 16, 1986
Heinrichs,
R.W. Variable associated with Wisconsin Card Sorting Test performance
in
neuropsychiatric
patients
referred
for assessment.
Neuropsychiatry,
Neuropsychology,
Behavioral Neurology, 3: 107-l 12, 1990.
Jones-Gotman,
M. Memory for designs: The hippocampal
contribution.
Neuropsychologia,
24: 193-203, 1986.
Kersteen,
Z.A. Behavioral
and neuroanatomical
aspects of priming and motor skill
learning. Unpublished
doctoral dissertation,
1989.
Kersteen-Tucker,
Z., and Knight, R.T. Cortical lesions dissociate short and long-term
components
of repetition priming. Society for Neuroscience Abstracts, 1989.
Kolb, B., and Whishaw, I.Q. Performance
of schizophrenic
patients on tests sensitive to left
or right frontal, temporal, or parietal function in neurological
patients. Journal of Nervous
and Mental Disease, 17 1:435443, 1983.
Lishman, W.A. Alcohol and the brain. British Journal of Psychiatry, 156:635-644, 1990.
Malmo, H.P. On frontal lobe functions: Psychiatric patient controls. Cortex, 10:231-237,
1974.
Matarazzo,
J.D. Psychological
assessment versus psychological
testing: Validation from
Binet to the school, clinic, and courtroom. American Psychologist, 45:999-1017, 1990.
Meehl, P.E. High school yearbooks:
A reply to Schwartz.
Journal of Abnormal
Psychology, 77:143-148, 1971.
Milner, B. Psychological deficits produced by temporal-lobe
excision. Research Publication
of the Association for Research of Nervous and Mental Diseases, 361244-257, 1958.
Milner, B. Effects of different brain lesions on card sorting. Archives of Neurology, 9:90100, 1963.
Milner, B. Interhemispheric
differences in the localisation
of psychological
processes in
man. British Medical Bulletin, 271272-277, 197 1.
Milner, B.; Corsi, P.; and Leonard, G. Frontal-lobe
contribution
to recency judgments.
Neuropsychologia, 29:60 l-6 18, 199 1.
Milner, B.; Petrides, M.; and Smith, M.L. Frontal lobes and the temporal organization
of
memory. Human Neurobiology, 4: 137-142, 1985.
Nelson, H.E. National Adult Reading Test, NART Windsor: Nelson Publishing Company,
1982.
Paivio, A.; Yuille, J.C.; and Madigan, S.A. Concreteness,
imagery, and meaningfulness
values for 925 nouns. Journal of Experimental Psychology, 76: l-24, 1968.
Pakkenberg,
B. Post-mortem
study of chronic schizophrenic
brains. British Journal of
Psychiatry, 151~744-752, 1987.
Parsons,
O.A.; Butters, N.; and Nathan, P.E., eds. Neuropsychology of Alcoholism:
Implications for Diagnosis and Treatment. New York: Guilford Press, 1987.
Pedhazur, E.J. Multiple Regression in Behavioral Research. 2nd ed. Fort Worth, TX: Holt,
Rinehart and Winston, 1982.
Petrides, M., and Milner, B. Deficits on subject-ordered
tasks after frontal- and temporallobe lesions in man. NeuropsychoIogia, 20:249-262, 1982.
Pfefferbaum,
A.; Lim, K.O.; Zipursky, R.B.; Mathalon,
D.M.; Rosenbloom,
M.J.; and
Sullivan, E.V. Brain gray and white matter volume loss accelerates with aging in chronic
alcoholics:
A quantitative
MRI study. Alcoholism: Clinical and Experimental Research,
16:1078-1089, 1992.
Ron, M.A. The brain of alcoholics:
An overview. In: Parsons, O.A.; Butters, N.; and
Nathan,
P.E., eds. Nemopsychology
of Alcoholism, Implications for Diagnosis and
Treatment. New York: Guilford Press, 1987. pp. 1 I-20.
199
Ryan, C., and Butters, N. Cognitive deficits of alcoholics. In: Kissin, B., and Begleiter, H.,
eds. l?hhePathology of Alcoholism. New York: Plenum Publishing Corp., 1983. pp. 485-538.
Sagar, H.J., and Sullivan, E.V. Patterns of cognitive impairment in dementia. In: Kennard,
C., ed. Recent Advances in Clinical Neurology. Vol. 5. Edinburgh:
Churchill Livingstone,
1988. pp. 47-86.
Shimamura,
A.P.; Jernigan, T.L.; and Squire, L.R. Korsakoff’s syndrome: Radiological
(CT) findings and neuropsychological
correlates. Journal of Neuroscience, 8:4400-4410, 1988.
Spitzer, R.L.; Endicott, J.; and Robins, E. Research Diagnostic Criteria (RDC). New York:
New York State Psychiatric Institute, 1975.
Sullivan, E.V.; Mathalon, D.H.; Zipursky, R.B.; and Pfefferbaum,
A. The validity of the
Wisconsin Card Sorting Test factors as measure of dorsolateral prefrontal cortical function in
schizophrenia
and alcoholism. Schizophrenia Research, 41394, 1991.
Sullivan, E.V., and Sagar, H.J. Nonverbal recognition and recency discrimination
deficits
in Parkinson’s disease and Alzheimer’s disease. Brain, 112: 1503-1517, 1989.
Tarter, R.E. An analysis of cognitive deficits in chronic alcoholics. Journal of Nervous and
Mental Disease, 157:138-147, 1973.
Teuber, H.-L., and Liebert, R.S. Specific and general effects of brain injury in man.
Archives of Neurology & Psychiatry, 80:403407, 1958.
Thorndike,
E.L., and Lorge, I. 7’he Teacher’s Word Book of 30,000 Words. New York:
Teachers College, Columbia University, 1944.
Wechsler, D. Wechsler Adult Intelligence Scale-Revised: Manual. Cleveland, OH: The
Psychological
Corporation,
198 1.
Wechsler,
D., and Stone, C.P. Wechsler Memory Scale. New York: Psychological
Corporation,
1945.
Weinberger,
D.R.; Berman,
K.F.; and Illowsky,
B.P. Physiological
dysfunction
of
dorsolateral
prefrontal
cortex in schizophrenia:
III. A new cohort and evidence for a
monoaminergic
mechanism. Archives of General Psychiatry, 45:609-615, 1988.
Weinberger,
D.R.; Berman, K.F.; and Zec, R.F. Physiologic dysfunction
of dorsolateral
prefrontal
cortex in schizophrenia:
I. Regional cerebral blood flow evidence. Archives of
General Psychiatry, 43~114-125, 1986.
Wiggins, J.S. Personality and Prediction: Principles of Personality Assessment. Reading,
MA: Addison-Wesley
Publishing Company, 1980.
Yates, W.R.; Swayze, V.W. II; and Andreasen, N.C. Neuropsychological
effect of global
and focal cerebral
atrophy
in schizophrenia.
Neuropsychiatry,
Neuropsychoiogy,
and
Behavioral Neurology, 3:98-106, 1990.
Yeo, R.A.; Turkheimer,
E.; and Bigler, E.D. Neuropsychological
methods of localizing
brain dysfunction:
Clinical versus empirical approaches. Neuropsychiatry,
Neuropsychology,
and Behavioral Neurology, 3:290-303, 1990.
Zipursky, R.B.; Lim, K.O.; Sullivan, E.V.; Brown, B.W.; and Pfefferbaum,
A. Widespread
cerebral gray matter volume deficits in schizophrenia.
Archives of General Psychiatry, 49: 195205, 1992.