Document 208429

 How to Write a Scientific Article:
From First Draft to Publication
Su-­‐Ling Yeh and Art Woodward Department of Psychology National Taiwan University Preface
The main purpose of this manual is to provide writing instruction for researcher
assistants and graduate students who spent much time conducting research, but have
little time or training to write a manuscript in order to submit it for publication – even
though it is well recognized that publication is the key for success in academia. A
second purpose is to help busy advisors by increasing the efficiency of their advising
activities, not only during manuscript writing, but also during revision after receiving
reviews from a journal.
By filling out the checklists as you follow the guidelines and procedures, you
will be able to write your own ideas in a logical step-by-step way, using the numerous
research articles that are included as “living examples” to imitate. Although the
example papers in this manual are mainly perception research, we hope they will be
useful in other related fields as well.
In the fall semester of 2009, we taught a course named “Scientific Writing in
Perception Research” in the Psychology Department at the National Taiwan
University. It was meant to be a PI-centered lab writing course, after the students have
taken two previous courses that emphasize more general-level basic English writing
skills. In this course, we developed the basic structure of this manual. The course
continued for three semesters under the direction of Professor Yeh, with the second
semester emphasizing writing a new article and the third semester, revision. Thus we
developed two separate checklists and instructions for these two important
components of getting an article accepted in a peer-review journal. We used both
checklists for instructions in the fourth semester before finalizing this version of the
manual.
We express our gratitude to Kuan-Ming Chen and San-Yuan Lin who were the
teaching assistants for these courses. Without their technical support, this manual
would not be possible. We also thank the professors and students in the writing
courses who provided feedback useful for improvement of the manual.
Enjoy, and we hope you have a fruitful writing experience.
Su-Ling Yeh and Art Woodward
2011, 7, 27
Please use ‘Bookmarks’ and red underlined links to navigate. Part I: Checklists Checklist for New Article
Please print out this checklist and check each box when you have begun each step
Title of Article ________________________________________________________
Authors _____________________________________________________________
Date begun ___________________________________________________________
☐ Target journal 1 ______________________________________________
☐ Review instructions to authors (insert the link)
___________________________________________________________
☐ Target journal 2 ______________________________________________
☐ Review instructions to authors (insert the link)
___________________________________________________________
☐ Select the category of importance of the paper.
It belongs to the category of __________
0. Overall structure
☐ Prepare figures (procedure and data) and tables for writing and communication with peers
(be precise on correctness, ignore artistic aspects).
☐ Write the 4 Ws for the paper and the title
(W1 = Why was the research done; W2 = What was done; W3 = What was discovered; W4 = Why is the discovery
important?)
Discuss with advisor. Date: ___________________
A. First draft (follow the first draft writing tips)
☐ For each experiment separately and repeat for multiple experiments:
☐ Write the W1 for the introduction of experiment.
☐ Write the method section. Do not edit except to possibly insert some details
not available during writing.
☐ Write the results section.
☐ Write the discussion section.
Date completed________________
☐ Write the General Discussion.
Suggestion: write only the topic sentences and check with advisor before writing the
paragraphs. Add headings if the GD is long.
☐ Write the Introduction – make the introduction mainly introduce what you wrote in the
discussion section.
Suggestion: write only the topic sentences and check with advisor before writing the
paragraphs.
☐ Write the general conclusion section and cut, paste, and revise to make the abstract.
☐ Highlight the 4 Ws and give the first draft to one peer.
(Peer name____________________)
Date completed ____________________
B. Second draft
☐ Follow the second draft tips and revise the entire paper.
Date completed ____________________
C. Third draft
☐ Do (imagined) audience assessment (highlight all terminologies and ask yourself if the
audience would understand).
☐ Check completeness and decoding time for all tables and figures.
☐ Revise and complete the title.
☐ Check accuracy of references and cross check with citation in the text.
☐ Verify APA style is correct.
☐ Verify the consistency of the terms used (name of effect, factor or the proposed
hypothesis)
Note: These details, though not directly related to the research, they will create an impression of completeness of this
article for the editor or the reviewers.
Date completed _____________________
D. Give the third draft to your advisor
☐ Highlight the 4 Ws and with your advisor compare (explicitly) to the four examples.
(W1 = Why was the research done; W2 = What was done; W3 = What was discovered; W4 = Why is the discovery
important?)
☐ Submit to Papercheck.com (including statement requesting excellent editor)
☐ Check the files with the advisors that all of them are ready to submit.
☐ Submit to target journal __________________________________
Action editor: _________________
ID No.: _________________
Date of submission: _______________
Checklist for Revision
The date for receiving the decision letter: ______________________________
Title: ___________________________________________________________
Journal: _________________________________________________________
Editor: __________________________________________________________
Reviewer: _______________________________________________________
1. Read Editor’s message:
□ Accept
Congratulations! Go celebrate!
□ Minor revision
Follow the suggestion without spending too much time on considering
what to do
□ Major revision
□ Reject but can be resubmitted
□ Reject
2. Read the opinions of reviewers:
□ Highlight the key points in the decision letter. Save it to
‘JournalName_decision_marked.doc’
□ Discuss with other authors about the next step (link to manuscript triage)
□ Do the “divide and conquer” procedure
3. Divide and Conquer:
□ Save the results of ‘Divide and Conquer’ to another file, named as
‘JournalName_ Divide and Conquer.doc’
□ Using review mode to edit the text
□ Delete sentences that are comments or descriptions
□ Keep sentences with question marks, and is an obvious demand
□ Deal with major comments
□ Deal with minor comments (statistics, method descriptions…)
□ Do as reviewer suggested
□ Appropriately use footnote
□ Not do as reviewer suggested
□ Provide counter evidence
4. Write the cover letter and responses to reviewers,
□ Edit the ‘JournalName_decision_marked.doc’ to become cover letter and
responses to reviewers
□ Mark the text you changed in the revision (mark the changes in red as is
shown here)
□ Check the consistency between the letter/responses and the revision
5. Before submission, check How to respond appropriately):
□ Responded completely (back and forth between text marked in red and the
responses to reviewers)
□ Responded politely
□ Responded with evidence
□ Send the changed text for papercheck.com (only the changed part): specify to the
editor that this is a re-submission (e.g., “This is a major revision of a previous
manuscript that was submitted to a top-tier journal: XXX. I would like to have
the same editor with a Doc ID XXX to review this paper.”)
□ Write the cover letter to specify what was answered, and what was not (rebuttal),
according to the divide and conquer procedure.
Submitted to __________________________________
Action editor: _________________
ID No.: _________________
Date of submission: _________________
Part II: Guidelines for Writing Inside-out Procedure for Writing an Article
Su-Ling Yeh and J. Arthur Woodward
National Taiwan University
Department of Psychology
November 2009
Revised: June 2010
Note 1: Consult with your advisor at any time the need arises.
Note 2: Cross-referenced articles are attached as examples of each section of an article,
along with papers illustrating where and how the 4 Ws are written into an article.



Select target journal 1 and target journal 2.
Download the journal policy and instructions to authors. Read them.
Check the example of APA style APA style example
Follow Inside out procedure (do not edit until the entire first draft is completed)
In writing the first draft of each section, open one of the articles provided in this
document (e.g., the most similar to your paper). If necessary, find another article
you are familiar with that is suitably similar to your paper. Then use it as a
model for what to say first, second, etc, at what level, and in what amount. To
the extent it is useful, transfer the schema to your paper.
A. First draft (Follow the first draft writing tips.) Steps First draft writing tips
1) For each experiment separately, write (a) through (c):
a) Write a first draft of the method section. Do not edit except to possibly
insert some details not available during writing.
b) Write a first draft of the results section (including tables and figures).
c) Write a first draft of the discussion section.
2) Now stop and write the 4 Ws for the paper and the title. 4W1 4W2 W4 Titles
3) Write general discussion.
4) Review with at least one peer.
5) Write the general introduction – make the introduction mainly introduce what you
wrote in the discussion section.
6) Write the general conclusion section and (mainly) cut and paste the abstract.
B. Second draft Second draft writing tips
Follow the second draft tips and revise the entire paper.
C. Third draft
1) Do the (imagined) audience assessment (For example, highlight all technical
terms and decide if your audience would understand them as written.)
2) Do audience assessment to check completeness and decoding time for all tables
and figures.
7
3) Finish the title. Titles
D. Give the third draft to your advisor (the English should be sloppy)
1) Highlight the 4 Ws; author and advisor compare (explicitly) with the four
examples. 4Ws1 4Ws2
2) Submit to Papercheck.com.
(Be sure to include the statement to the editors of paper check.)
For example, modify the following example:
(This paper will be submitted to a top-tier journal: “Attention Perception &
Psychophysics” for brief report. In a previous review by a journal, the editor
requested more proof reading to get the English as c orrect as possible. He wrote “If
possible, the authors should seek the editing help of a professional English language
editor to correct and smooth usage. Please assign a senior editor familiar with
editing for top tier journals.)
3) Submit to journal.
~ End of Guidelines ~
8
Appendices
Steps for Writing the First Draft
Note 1: Use these guidelines and cross references to the attached articles:
(1) (C-Y): Chen & Yeh;
(2) (Th): Theeuwes, et al;
(3) (Ca): Cavanagh, et al;
(4) (M-E): Moore and Egeth; and
(5) (R):Rowe
to follow the steps from Part A above.
Note 2: Use the highlighted sections from attached articles to see examples of
how to write the 4 Ws into your article.
4W1 4W2
Experiment 1
Introduction to Experiment 1
1) Write the main purpose of this particular experiment (w1).
Th1 Th2 M-E
2) Critical manipulation needed to answer w1 (w2).
M-E
3) Do not list details of method or procedure unless it is necessary.
Ca
Method for Experiment 1
1) Write the details of experiment 1 so that others could replicate the
experiment (participants, stimuli, apparatus, design, and experimental
procedure including practice trials).
See for examples: Th M-E C-Y
2) Complete the figure to show the procedure from the point of view of the
participant.
M-E C-Y POV
Results for Experiment 1
1) Statistics (e.g., ANOVA) should be stated as supplements to main findings.
Th
2) Do not list unnecessary statistics (e.g., irrelevant high-level interactions); if
possible, condense them into one sentence, uses Fs > xx, ps < xx for
summary.
Th
3) Do not list the non-significant results unless it is critical.
M-E
4) Complete the tables for Experiment 1.
Discussion for Experiment 1
1) State the main finding first (w3).
Th1 Th2 M-E
2) Talk about possible confounding before concluding this experiment, and
provide a transition to the next experiment (not necessarily placed here; it
could be in the introduction of the next experiment too).
M-E Th
9
Return to Inside-out Procedure
Return to Checklist for New Article
3) Mention the other findings but do not stress too much if irrelevant to W1.
4) Note the difference between small w and big W. Ideally each experiment
should have their own 4 small ws.
Repeat the above for each experiment
General Discussion
Opening paragraph of the general discussion
1) State the research purpose and summarize the main findings (short paper
may go directly to the main findings).
C-Y Th
2) Combine the purpose and finding of each experiment in short sentences.
Th
3) This is the 3rd Ws, but not the 4th one (which should appear later).
4)
5)
6)
7)
Middle paragraph of the general discussion
Optional discussion of limitations and scope.
R
Relate to literature.
R
Final paragraph of general discussion
Write mainly the 4th W.
M-E
If possible, end with a conclusion that fits the topic sentence of the first
paragraph in the introduction.
R C-Y
General Introduction
Opening two paragraphs of the Introduction
1) Write the first paragraph in a general tone, better come with a daily example.
(general and familiar).
C-Y Ca R
2) If possible, write the research purpose in the first paragraph, and then
elaborate it in the following paragraphs.
C-Y
Additional topics to be covered in the introduction
3) Cover those that mentioned in the general discussion.
10
Return to Inside-out Procedure
Return to Checklist for New Article
4) The scope should be from general to specific, in a reverse order as in the
general discussion.
5) Write 1st, 2nd Ws about here.
Final paragraph of the introduction
8) Write an oververview of the experiments.
Ca
9) Write predictions, if applicable.
Ca
General Conclusion (conclusion section is optional)
The general conclusion section should include
1) Cover the 1st, 3rd, and 4th Ws here in a short paragraph.
Ca
2) If possible, end with a sentence that is related to the first paragraph and/or
the title.
Ca
Return to Inside-out Procedure
Return to Checklist for New Article
11
Describing the experimental procedure and creating figures
The most important principle in describing the experimental procedure, including tables and figures is this:
Describe the procedure from the point of view of the participant
Participant POV
1) Describe in words what the participants actually saw, in the order they saw it. Different
experimental conditions should be included so the critical differences in what the participants
experienced in the different conditions will be clear.
2) Describe in an accompanying figure a representation of what the participant actually saw and
when they saw it.
a) The figure should show the actual stimuli if possible, but usually they will be schematic
representations of what they saw. For example, when the stimuli are white on a black
background this consumes too much journal ink, so the background can be made white
and the figure black, with an accompanying explanation;
b) The figure should show the sequence and time line;
c) All symbols in the table should be defined (there should be no undefined symbols)
d) The labels and figure title should contain the units (inches, seconds, average milliseconds etc);
e) Different experiment conditions should be represented in the figure using the same names
of the independent variables as used in the text;
f) The dependent variables should be referred to by the same names as in the text;
g) The figure caption should closely match the words in the text section that refers to the
figure;
h) In figures and tables with several independent variables see the following examples form
published article (adopted from Fortin et al., 2010);
Figure 7. Experiment 4. (a) Mean reaction times to the
second stimulus as a function of memory set size in
switch and no-switch trials. Error bars represent the
SEM computed with the MSE.
Return to Inside-out Procedure
12
The 4 Ws
Art Woodward
Fall 2009
The four Ws are:
W1: Why did you do the research?
W2: What did you do? (Note this is a verbal explanation of what was done that
appears in the introduction)
W3: What did you discover?
W4: Why is the discovery important? (Note in answering this question, do not
simply restate the discovery. Instead,
explain why the discovery is important.)
Five articles are used to illustrate the 4 Ws:
(1) (S): Shams, L., Kamitani, Y., & Shimojo, S. (2000). What you see is what you
hear. Nature, Vol. 408, pp.788.
W1 W2 W3 W4(1) W4(2)
(2) (T-E): Tong, F. H., & Engel, S. A. (2001). Interocular rivalry revealed in the
human cortical blind-spot representation. Nature, 411, 195-199.
W1 W2 W3(1) W3(2) W4(1) W4(2) W4(3) W4(4)
(3) (S-K): Saxe, R. & Kanwisher, N. (2003). People thinking about thinking people:
The role of the temporo-parietal junction in theory of mind. NeuroImage. 19
1835-1842
W1 W2(1) W2(2) W2(3) W2(4) W3(1) W3(2) W4(1) W4(2)
(4) (W-K): Wojciulik, E., & Kanwisher, N. (1999). The Generality of Parietal
Involvement in Visual Attention. Neuron. 23 747-764
W1(1) W1(2) W2 W3 W4(1) W4(2) W4(3)
(5) (R): Rowe, G., Hirsh, J.B., & Anderson, A.K., (2006). Positive affect increases
the breadth of attentional selection. Proceedings of the National Academy of
Science, 104(1): 383-8.
W1 W2(1) W2(2) W2(3) W3(1) W3(2) W3(3) W4
Return to Inside-out Procedure
Return to Steps for Writing 1st Draft
Return to Checklist for New Article
1
13
First draft Writing Tips
Please do NOT do these while writing the first draft:
Do not insert any un-needed details like names, dates, etc, unless automatic.
1) Do not try to improve the English as you write – write in Chinese-English.
2) If you don’t know the English word, insert Chinese characters and move on.
3) Do not edit your first draft as you write. Complete the entire first draft before editing.
Instead, please DO these things as much as you can (even if it slows you down):
Before you write, spend 5 minutes planning what you will write.
Think about:
(1) An attractive title;
G
(2) Intro: Write General
Specific
and Familiar Not Fam.
Intro
Discussion: Do the reverse and
Step 1
Cross reference
Step 2
(Do not forget the 4 Ws)
F
Not F
Step 3
Attempt to take two steps
Gen.
Discussion
S
(3) Pause and think hard about the first sentence in each paragraph (called the topic
sentence). The topic sentence should name the topic of the paragraph and state the
controlling idea (see example on the next page);
(4) Concentrate on conciseness (use no unnecessary words and do not repeat);
(5) Concentrate on cohesion (connect the sentences logically);
(6) Concentrate on coherence (make all sentences in a paragraph be about the same topic
and obey the controlling idea of the topic sentence);
(7) Use headings to help the reader identify the different main topics in your document;
(8) Take the perspective of the reader (do not use words the reader may not understand; do
not expect readers to remember what they read on previous pages.)
14
Return to Inside-out Procedure
Practice for topic sentences and controlling idea
Use bubbl.us (http://bubbl.us/)
Write clear and well defined topic sentences. See the following examples.
1. There are three reason NTU is the best university in Taiwan.
Topic: _NTU is the best university in Taiwan_;
Controlling idea: _three reasons it’s the best__;
Give some specific examples of the controlling idea:
2. The food in Taiwan is unique in East Asia because of the special spices used.
Topic: __________________________________;
Controlling idea: __________________________;
Give some specific examples of the controlling idea:
3. Taiwanese food is unique in East Asia because of the many fresh ingredients.
Topic: __________________________________;
Controlling idea: __________________________;
Give some specific examples of the controlling idea:
Return to Inside-out Procedure
15
Get off to a good start with W1
Make sure sentences similar to these examples appear in the first several paragraphs of
your paper. Before submitting to a journal, review explicitly your 4 Ws in the paper by
comparing them to the examples below.
Example 1: Shams, Nature (ISI; 12/3/09;144)
“Vision is believed to dominate our multi-sensory perception of the world. Here we overturn this
established view by showing that auditory information can qualitatively alter the perception of an
unambiguous visual stimulus to create a striking visual illusion.”
Example 2: Tong & Engel. Nature (ISI; 12/3/09;155)
“Despite extensive research, the neural basis of binocular rivalry has remained highly
controversial. Specifically, it is debated whether discrepant monocular patterns rival because of
interocular competition or pattern competition. To resolve this issue, we used functional magnetic
resonance imaging (fMRI) to monitor rivalry-related activity in a monocular region of human V1
corresponding to the blind spot.”
Example 3: Saxes and Kanwisher, NeuroImage (ISI; 12/3/09; 213)
“The remarkable human facility with social cognition depends on a fundamental ability to reason
about other people. Specifically, we predict and interpret the behavior of people based on an
understanding of their minds: that is, we use a “theory of mind.” In this study we show that a
region of human temporo-parietal junction is selectively involved in reasoning about the contents
of other people’s minds.”
Example 4: Wojciulik and Kanwisher, Neuron (ISI; 12/3/09; 273)
“However, very little work has been directed to the crucial question of whether visual attention in
fact consists of a single general-purpose mechanism or whether it instead consists of a
heterogeneous set of different mechanisms, each involved in a different kind of selection. In the
present study, we used functional magnetic resonance imaging (fMRI) to address this question,
asking whether there is any region of the human brain that is activated by each of three very
different attention-requiring tasks yet not activated by a language task that is difficult but does not
place heavy demands on visual attention.”
Return to Inside-out Procedure
Return to Steps for Writing the 1st Draft
16
Categories of importance for documenting W4
Category of Importance
Develop a new theory or model
Example
Modify a famous theory or model


Rummelhart D.E. and McClelland J. L.
(Distributed Processing Models)
David Marr. (Level of analysis; computational
model of visual processing)
Young, T., and Helmholtz, H. (Trichromatic
Theory of Color Vision)
Gibson, J. J. (Direct perception)
Wolfe J.M. (Guided Search)
Overturn a widely held belief

Shams, L (Double Click Illusion)
Describe or explain a puzzling mystery

Develop a new method or paradigm



Ramachandran V.S. (e.g., explains phantom
pain)
“In this paper we present a new model for…”



“Here we overturn the widely held belief that…”
Discover a new phenomenon and rule out
alternative explanations

Pearl J. (Bayes Net Models)
Lauterbur, P., and Mansfield, P. (Magnetic
Resonance Imaging): they applied MR physics to
develop MRI scanner for human body; they won
a Nobel prize in Physiology/ Medicine, 2003.
Egly et al., 1994 (The double rectangle cueing
paradigm that can probe location- and objectbased attention in a single experimental
framework)
Tsuchiya and Koch (Continuous Flash
Suppression)
Kanwisher, N. (Repetition Blindness)
Resolve an existing debate or controversy

Tong, F.H. and Engle S.A. (Binocular Rivalry)
Discover the underlying connection
between two existing phenomena

Theory that influences other disciplines

Maxwell’s equations in Electronic and Magnetic.
[There may be no integral theories like this in
psychology]
Kahnmann & Tversky (Prospect Theory): Two
psychologists won the Nobel prize in economics
Useful for practical applications

“This study presents a new paradigm for..”


“This is the first study to …”
“Thus our study contributes to resolving the
controversy about…”
Karremans, Stroebe, & Claus, 2006 (Subliminal
priming that can be used in movies or commercial
advertisements)
Return to Inside-out Procedure
Return to Checklist for New Article
17
Some Ideas for Good Titles
A good title will attract the attention of potential readers, arouse their curiosity, and
correctly prime them for the content of the article.
1) Some guidelines and suggested characteristics are:
a) should contain familiar (older) information;
b) may contain a clear statement of the main finding “Conditions where implicit and
explicit cross-modal information are processed independently”
c) may contain a question that will be answered by the research, so curiosity is
aroused “Can implicit and explicit cross-modal sensory information be processed
independently”
d) may be a “play on words” or a metaphor, “What you hear is what you see”
e) The title could be some reflection of the first W, third W, or fourth W
2) Examples: Which of the above characteristics apply to the following real article titles?
“What you see is what you hear”
“Interocular rivalry revealed in the human cortical blind-spot representation”
“People thinking about thinking people: The role of the temporo-parietal junction in
theory of mind.”
“The Generality of Parietal Involvement in Visual Attention”
Return to Inside-out Procedure
18
Top-Down Second Draft Writing Tips
Creating your second draft
1) Revise your title if you can make it more attractive and interesting to your
audience.
2) Look at all your heading; add and remove headings as necessary;
3) Paragraph coherence:
a) look at each topic sentence and decide if it needs to be made more clear
for the reader;
b) Make sure all the sentences in the paragraph match the topic sentence. If
they don’t remove them, or create another paragraph.
4) Sentences
a) Re write any run on sentences;
b) Combine any sets of too short sentences;
c) Insert any missing details, such as dates, names, or English words you did
not know.
Note: You do not need to improve the English grammar very much. Let the editor show
you how to do that so you can practice revising according to professional edits. However
you do need to be sure the writing is concise, cohesive, and coherent. The editors at
Papercheck.com cannot improve the quality your thinking.
Here are some useful sites for practice concise, cohesive, and coherent:
Concise
http://grammar.ccc.commnet.edu/grammar/quizzes/wordy_quiz.htm
http://grammar.ccc.commnet.edu/grammar/quizzes/nova/nova8.htm
http://grammar.ccc.commnet.edu/grammar/quizzes/nova/nova11.htm
Cohesion
http://www.uefap.com/writing/exercise/parag/refer.htm#top
Coherence
http://lrs.ed.uiuc.edu/students/fwalters/cohex3d.html
Return to Inside-out Procedure
Return to Checklist for New Article
Some Useful Information When Having the Paper Edited
1.
Have the paper edited (sent to papercheck.com) after everything is ready (references,
figures, tables included)
2.
If you have a preferable editor, add the following: “Please have the editor who
edited document XXXXXX (ID) work on this document".
3.
Examples: “This paper will be submitted to a top-tier journal: xxx.” If applicable,
add: “In a previous review by a journal, it was suggested that we seek the editing
help of a native English speaker to correct and smooth usage.” Or “This is the final
version and we will submit the manuscript after revising according to your English
edits.”
Return to Inside-out Procedure
20
Figure 2.1. Sample One-Experiment Paper (The numbers refer to numbered
sections in the Publication Manual.)
Running head: EFFECTS OF AGE ON DETECTION OF EMOTION
1
Establishing a title, 2.01; Preparing the
manuscript for submission, 8.03
Effects of Age on Detection of Emotional Information
Christina M. Leclerc and Elizabeth A. Kensinger
Boston College
Formatting the author name (byline) and
institutional affiliation, 2.02, Table 2.1
Elements of an author note, 2.03 Author Note
Christina M. Leclerc and Elizabeth A. Kensinger, Department of Psychology,
2
EFFECTS OF AGE ON DETECTION OF EMOTION
Boston College.
Abstract
Writing the abstract, 2.04
This research
arch was supported by National Science Foundation Grant BCS 0542694
Age differences were examined in affective processing, in the context of a visual search task.
awarded to Elizabeth
beth A. Kensinger.
Young and older adults were faster to detect high arousal images compared with low arousal and
Correspondence
ndence concerning this article should be addressed to Christina M. Leclerc,
neutral items. Younger adults were faster to detect positive high arousal targets compared with
Department of Psychology,
sychology, Boston College, McGuinn Hall, Room 512, 140 Commonwealth
other categories. In contrast, older adults exhibited an overall detection advantage for emotional
Avenue, Chestnut
ut Hill, MA 02467. Email: [email protected]
images compared with neutral images. Together, these findings suggest that older adults do not
display valence-based effects on affective processing at relatively automatic stages.
Keywords: aging, attention, information processing, emotion, visual search
Double-spaced manuscript,
Times Roman typeface,
1-inch margins, 8.03
Paper adapted from “Effects of Age on Detection of Emotional Information,” by C. M. Leclerc and E. A. Kensinger,
2008, Psychology and Aging, 23, pp. 209–215. Copyright 2008 by the American Psychological Association.
21
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
3
Writing the introduction, 2.05
Effects of Age on Detection of Emotional Information
Frequently, people encounter situations in their environment in which it is impossible to
attend to all available stimuli. It is therefore of great importance for one’s attentional processes to
select only the most salient information in the environment to which one should attend. Previous
research has suggested that emotional information is privy to attentional selection in young
adults (e.g., Anderson, 2005; Calvo & Lang, 2004; Carretie, Hinojosa, Marin-Loeches, Mecado,
& Tapia, 2004; Nummenmaa, Hyona, & Calvo, 2006), an obvious service to evolutionary drives
Selecting to approach rewarding situations and to avoid threat and danger (Davis & Whalen, 2001; Dolan
the correct
tense, 3.18 & Vuilleumier, 2003; Lang, Bradley, & Cuthbert, 1997; LeDoux, 1995).
For example, Ohman, Flykt, and Esteves (2001) presented participants with 3 × 3 visual
Numbers
arrays with images representing four categories (snakes, spiders, flowers, mushrooms). In half
expressed
in words, the arrays, all nine images were from the same category, whereas in the remaining half of the
4.32
Ordering citations within
the same parentheses, 6.16
Numbers that represent
statistical or mathematical
functions, 4.31
arrays, eight images were from one category and one image was from a different category (e.g.,
Use of hyphenation for
compound words, 4.13,
discrepant
ant stimulus. Results indicated that fear
fear-relevant
r- relevant images were more quickly detected than Table 4.1
eight flowers and one snake). Participants were asked to indicate whether the matrix included a
elevant items, aand larger search facilitation effects were observed for participants who
fear-irrelevant
arful of the stimuli. A similar pattern of results has been observed when examining the
were fearful
EFFECTS OF AGE ON DETECTION OF EMOTION
4
n-grabbing
(includ ing those
attention-grabbing
nature of negative facial expressions, with threatening faces (including
Calvo & Lang, 2004; Carretie et al., 2004; Juth, Lundqvist, Karlsson, & Ohman, 2005;
nded to) identified more quickly than positive or neutral faces (Eastwood, Smilek, &
not attended
Nummenmaa
et 1988).
al., 2006).
e, 2001; Hansen
Merikle,
& Hansen,
The enhanced detection of emotional information is
From this
research,
it seems
that younger adults
show
benefits for
ited to threatening stimuli;
there
is evidence
thatclear
any high-arousing
stimulus
candetection
be
not limited
arousing of
information
environment.
It is lessvalenced
clear whether
these 2005;
effects
d rapidly, regardless
whether itinis the
positively
or negatively
((Anderson,
5 are preserved
detected
across the adult life span. The focus of the current research is on determining the extent to which
Continuity in presentation aging influences the early, relatively automatic detection of emotional information.
of ideas, 3.05
Regions of the brain thought to be important for emotional detection remain relatively
intact with aging (reviewed by Chow & Cummings, 2000). Thus, it is plausible that the detection
of emotional information remains relatively stable as adults age. However, despite the
preservation of emotion-processing regions with age (or perhaps because of the contrast between
the preservation of these regions and age-related declines in cognitive-processing regions; Good
et al., 2001; Hedden & Gabrieli, 2004; Ohnishi, Matsuda, Tabira, Asada, & Uno, 2001; Raz,
No capitalization in
naming theories, 4.16
2000; West, 1996), recent behavioral research has revealed changes that occur with aging in the
regulation and processing of emotion. According to the socioemotional selectivity theory
Citing one
work by six
or more
authors, 6.12
(Carstensen, 1992), with aging, time is perceived as increasingly limited, and as a result, emotion
regulation becomes a primary goal (Carstensen, Isaacowitz, & Charles, 1999). According to
socioemotional selectivity theory, age is associated with an increased motivation to derive
emotional meaning from life and a simultaneous decreasing motivation to expand one’s
knowledge base. As a consequence of these motivational shifts, emotional aspects of the
22
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
5
Using the colon between
two grammatically
complete clauses, 4.05
To maintain positive affect in the face of negative age-related change (e.g., limited time
remaining, physical and cognitive decline), older adults may adopt new cognitive strategies. One
such strategy, discussed recently, is the positivity effect (Carstensen & Mikels, 2005), in which
older adults spend proportionately more time processing positive emotional material and less
time processing negative emotional material. Studies examining the influence of emotion on
memory (Charles, Mather, & Carstensen, 2003; Kennedy, Mather, & Carstensen, 2004) have
found that compared with younger adults, older adults recall proportionally more positive
information and proportionally less negative information. Similar results have been found when
examining eye-tracking patterns: Older adults looked at positive images longer than younger
adults did, even when no age differences were observed in looking time for negative stimuli
Capitalization of words
beginning a sentence after
a colon, 4.14
(Isaacowitz, Wadlinger, Goren, & Wilson, 2006). However, this positivity effect has not gone
uncontested; some researchers have found evidence inconsistent with the positivity effect (e.g.,
Grühn, Smith, & Baltes, 2005; Kensinger, Brierley, Medford, Growdon, & Corkin, 2002).
Hypotheses and their
correspondence to research
design, Introduction, 2.05
Based on this previously discussed research, three competing hypotheses exist to explain
motional processing associated with the normal aging process. First,
age differences in emotional
emotional informationn may remain important throughout the life span, leading to similarly
OFin
AGE
ON DETECTION
OF Second,
EMOTION
younger
and older adults.
with aging,
facilitated detection of emotionalEFFECTS
information
Using the semicolon to
separate two independent
6
clauses not joined
by
a conjunction, 4.04
emotional informationn may take on additional importance, resulting in older adults’ enhanced
rapidly detect emotional information. We hypothesized that on the whole, older adults would be
al information in their environment. Third, older adults may focus
detection of emotional
slower to detect information than young adults would be (consistent with Hahn, Carlson, Singer,
principally on positivee emotional information and may show facilitated detection of positive, but
& Gronlund, 2006; Mather & Knight, 2006); the critical question was whether the two age
nal information.
not negative, emotional
groups would show similar or divergent facilitation effects with regard to the effects of emotion
The primary goal in the present experiment was to adjudicate among these alternatives.
on item detection. On the basis of the existing literature, the first two previously discussed
ed a visual search paradigm to assess young and older adults’ abilities to
To do so, we employed
hypotheses seemed to be more plausible than the third alternative. This is because there is reason
Using the comma between
elements in a series, 4.03
Punctuation with citations
in parenthetical material,
6.21
to think that the positivity effect may be operating only at later stages of processing (e.g.,
strategic, elaborative, and emotion regulation processes) rather than at the earlier stages of
processing involved in the rapid detection of information (see Mather & Knight, 2005, for
discussion). Thus, the first two hypotheses, that emotional information maintains its importance
across the life span or that emotional information in general takes on greater importance with
age, seemed particularly applicable to early stages of emotional processing.
Indeed, a couple of prior studies have provided evidence for intact early processing of
emotional facial expressions with aging. Mather and Knight (2006) examined young and older
Citing references in text,
inclusion of year within
paragraph, 6.11, 6.12
adults’ abilities to detect happy, sad, angry, or neutral faces presented in a complex visual array.
Mather and Knight found that like younger adults, older adults detected threatening faces more
quickly than they detected other types of emotional stimuli. Similarly, Hahn et al. (2006) also
found no age differences in efficiency of search time when angry faces were presented in an
array of neutral faces, compared with happy faces in neutral face displays. When angry faces,
compared with positive and neutral faces, served as nontarget distractors in the visual search
Prefixes and
suffixes that
do not require
hyphens,
Table 4.2
arrays, however, older adults were more efficient in searching, compared with younger adults,
23
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
7
negative stimuli were not of equivalent arousal levels (fearful faces typically are more arousing
than happy faces; Hansen & Hansen, 1988). Given that arousal is thought to be a key factor in
modulating the attentional focus effect (Hansen & Hansen, 1988; Pratto & John, 1991; Reimann
& McNally, 1995), to more clearly understand emotional processing in the context of aging, it is
necessary to include both positive and negative emotional items with equal levels of arousal.
In the current research, therefore, we compared young and older adults’ detection of four
categories of emotional information (positive high arousal, positive low arousal, negative high
arousal, and negative low arousal) with their detection of neutral information. The positive and
Prefixed words that
require hyphens,
Table 4.3
negative stimuli were carefully matched on arousal level, and the categories of high and low
arousal were closely matched on valence to assure that the factors of valence (positive, negative)
and arousal (high, low) could be investigated independently of one another. Participants were
presented with a visual search task including images from these different categories (e.g., snakes,
cars, teapots). For half of the multi-image
g arrays,
y , all of the images
g were of the same item,, and for
the remaining half of the arrays, a single target image of a different type from the remaining
Using abbreviations, 4.22; Explanation
of abbreviations, 4.23; Abbreviations
used often in APA journals, 4.25;
Plurals of abbreviations, 4.29
items was included. Participants were asked to decide whether a different item was included in
EFFECTS OF AGE ON DETECTION OF EMOTION
8
the array, and their reaction times were recorded for each decision. Of primary interest were
for the arousing items than shown by the young adults (resulting in an interaction between age
differences in response times (RTs)) based on the valence and arousal levels of the target
and arousal).
ung and older adults were equally focused on emotional
categories. We reasoned that if young
Method
information, then we would expectt similar degrees of facilitation in the detection of emotional
older adults were more affectively focused than
stimuli for the two age groups. By contrast, ifParticipants
Elements of the Method
section, 2.06; Organizing
a manuscript with levels
of heading, 3.03
adults (14 women, 10 men, Mage = 19.5 years, age range: 18–22 years) were
were younger adults, older adults should show eitherYounger
faster detection
speeds for all of the
recruited with flyers posted on the Boston College campus. Older adults (15 women, nine men,
utral items) than shown by young adults or greater facilitation
emotional items (relative to the neutral
Mage = 76.1 years, age range: 68–84 years) were recruited through the Harvard Cooperative on
Identifying
subsections
within the
Method
section, 2.06
Aging (see Table 1, for demographics and test scores).1 Participants were compensated $10 per
hour for their participation. There were 30 additional participants, recruited in the same way as
described above, who provided pilot rating values: five young and five old participants for the
assignment of items within individual categories (i.e., images depicting cats), and 10 young and
10 old participants for the assignment of images within valence and arousal categories. All
Using numerals to express
numbers representing age, 4.31
participants were asked to bring corrective eyewear if needed, resulting in normal or corrected
to normal vision for all participants.
Materials and Procedure
Participant (subject)
characteristics,
Method, 2.06
The visual search task was adapted from Ohman et al. (2001). There were 10 different
types of items (two each of five Valence × Arousal categories: positive high arousal, positive low
arousal, neutral, negative low arousal, negative high arousal), each containing nine individual
exemplars that were used to construct 3 × 3 stimulus matrices. A total of 90 images were used,
each appearing as a target and as a member of a distracting array. A total of 360 matrices were
presented to each participant; half contained a target item (i.e., eight items of one type and one
target item of another type) and half did not (i.e., all nine images of the same type). Within the
24
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
9
matrix. Within the 180 target trials, each of the five emotion categories (e.g., positive high
arousal, neutral, etc.) was represented in 36 trials. Further, within each of the 36 trials for each
emotion category, nine trials were created for each of the combinations with the remaining four
other emotion categories (e.g., nine trials with eight positive high arousal items and one neutral
item). Location of the target was randomly varied such that no target within an emotion category
was presented in the same location in arrays of more than one other emotion category (i.e., a
negative high arousal target appeared in a different location when presented with positive high
arousal array images than when presented with neutral array images).
The items within each category of grayscale images shared the same verbal label (e.g.,
mushroom, snake), and the items were selected from online databases and photo clipart
Latin abbreviations, 4.26
Numbers expressed in words
at beginning of sentence, 4.32
packages. Each image depicted a photo of the actual object. Ten pilot participants were asked to
write down the name corresponding
to eachhead:
object;
any object
didON
notDETECTION
consistently generate
Running
R
EFFECTS
OFthat
AGE
OF EMOTION
10
the intended response was eliminated from the set. For the remaining images, an additional 20
selected such that the arousal difference between positive low arousal and positi
positive high arousal
pilot participants rated the emotional valence and arousal of the objects and assessed the degree
was equal to the difference between negative low arousal and negative high arou
arousal.
of visual similarity among objects within a set (i.e., how similar the mushrooms were to one
between
Similarity ratings. Each item was rated for within-category and between-categories
another) and between objects across sets (i.e., how similar the mushrooms were to the snakes).
similarity. For within-category similarity, participants were shown a set of exemplars
exem
(e.g., a set
Valence and arousal ratings. Valence and arousal were judged on 7-point scales (1 =
of mushrooms) and were asked to rate how similar each mushroom was to the re
rest of the
negative valence or low arousal and 7 = positive valence or high arousal). Negative objects
mushrooms, on a 1 (entirely dissimilar) to 7 ((nearly identical
identical)) scale. Participants made these
received mean valence ratings of 2.5 or lower, neutral objects received mean valence ratings of
ratings on the basis of overall similarity and on the basis of the specific visual di
dimensions in
3.5 to 4.5, and positive objects received mean valence ratings of 5.5 or higher. High arousal
which the objects could differ (size, shape, orientation). Participants also rated hhow similar
objects received mean arousal ratings greater than 5, and low arousal objects (including all
objects of one category were to objects of another category (e.g., how similar the mushrooms
neutral stimuli) received mean arousal ratings of less than 4. We selected categories for which
were to the snakes). Items were selected to assure that the categories were equate
equated on withinboth young and older adults agreed on the valence and arousal classifications, and stimuli were
category and between-categories similarity of specific visual dimensions as well as for the
Italicization of anchors
of a scale, 4.21
overall similarity of the object categories ((p
(pss > .20). For example, we selected pa
particular
h
ti l cats
t so that
th t the
th mushrooms
h
i il to
t one another as were the
mushrooms
andd particular
were as similar
cats (i.e., within-group similarity was held constant across the categories). Our object selection
also assured that the categories differed from one another to a similar degree (e.g., that the
mushrooms were as similar to the snakes as the cats were similar to the snakes).
Procedure
Each trial began with a white fixation cross presented on a black screen for 1,000 ms; the
matrix was then presented, and it remained on the screen until a participant response was
recorded. Participants were instructed to respond as quickly as possible with a button marked yes
if there was a target present, or a button marked no if no target was present. Response latencies
and accuracy for each trial were automatically recorded with E-Prime (Version 1.2) experimental
25
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
11
software. Before beginning the actual task, participants performed 20 practice trials to assure
compliance with the task instructions.
Results
Elements of the
Results section, 2.07
Analyses focus on participants’ RTs to the 120 trials in which a target was present and
was from a different emotional category from the distractor (e.g., RTs were not included for
Abbreviations
accepted as
words, 4.24
arrays containing eight images of a cat and one image of a butterfly because cats and butterflies
are both positive low arousal items). RTs were analyzed for 24 trials of each target emotion
category. RTs for error trials were excluded (less than 5% of all responses) as were RTs that
Symbols, 4.45;
Numbers, 4.31
were ±3 SD from each participant ’s mean (approximately 1.5% of responses). Median RTs were
then calculated for each of the five emotional target categories, collapsing across array type (see
Table 2 for raw RT values for each of the two age groups). This allowed us to examine, for
example, whether participants were faster to detect images of snakes than images of mushrooms,
Nouns followed
by numerals or
letters, 4.17
regardless of the type of array in which they were presented. Because our main interest was in
examining the effects of valence and arousal on participants’ target detection times, we created
scores for each emotional target category that controlled for the participant’s RTs to detect
neutral targets (e.g., subtracting the RT to detect neutral targets from the RT to detect positive
high arousal targets). These difference scores were then examined with a 2 × 2 × 2 (Age [young,
older] × Valence [positive, negative] × Arousal [high, low]) analysis of variance (ANOVA). This
ANOVA revealed only a significant main effect of arousal, F(1, 46) = 8.41, p = .006, ηp2 = .16,
with larger differences between neutral and high arousal images (M = 137) than between neutral
and low arousal images (M = 93; i.e., high arousal items processed more quickly across both age
Reporting
p values,
decimal
fractions,
4.35
Statistical symbols,
4.46, Table 4.5
groups compared with low arousal items; see Figure 1). There was no significant main effect for
valence, nor was there an interaction between valence and arousal. It is critical that the analysis
Numbering and discussing
figures in text, 5.05
26
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
12
revealed only a main effect of age but no interactions with age. Thus, the arousal-mediated
effects on detection time appeared stable in young and older adults.
The results described above suggested that there was no influence of age on the
influences of emotion. To further test the validity of this hypothesis, we submitted the RTs to the
five categories of targets to a 2 × 5 (Age [young, old] × Target Category [positive high arousal,
Statistics
in text, 4.44
positive low arousal, neutral, negative low arousal, negative high arousal]) repeated measures
2
ANOVA. Both the age group, F(1, 46) = 540.32, p < .001,
ηp2
= .92, and the ta rget category,
Spacing, alignment,
and punctuation of
mathematical copy, 4.46
F(4, 184) = 8.98, p < .001, η p2 = .16, main effects were significant, as well as the Age Group ×
Target Category interaction, F(4, 184) = 3.59, p = .008, η p2 = .07. This interaction appeared to
reflect the fact that for the younger adults, positive high arousal targets were detected faster than
targets from all other categories, ts(23) < –1.90,p < .001, with no other target categories
differing significantly from one another (although there were trends for negative high arousal
Capitalize effects
or variables when
they appear with
multiplication
signs, 4.20
and negative low arousal targets to be detected more rapidly than neutral targets (p < .12). For
older adults, all emotional categories of targets were detected more rapidly than were neutral
targets, ts(23) > 2.56, p < .017, and RTs to the different emotion categories of targets did not
differ significantly from one another. Thus, these results provided some evidence that older
adults may show a broader advantage for detection of any type of emotional information,
whereas young adults’ benefit may be more narrowly restricted to only certain categories of
Elements of the
Discussion section, 2.08
emotional information.
Discussion
As outlined previously, there were three plausible alternatives for young and older adults’
performance on the visual search task: The two age groups could show a similar pattern of
enhanced detection of emotional information, older adults could show a greater advantage for
27
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
13
emotional detection than young adults, or older adults could show a greater facilitation than
young adults only for the detection of positive information. The results lent some support to the
first two alternatives, but no evidence was found to support the third alternative.
Clear statement of support or
nonsupport of hypotheses,
Discussion, 2.08
In line with the first alternative, no effects of age were found when the influence of
valence and arousal on target detection times was examined; both age groups showed only an
arousal effect. This result is consistent with prior studies that indicated that arousing information
can be detected rapidly and automatically by young adults (Anderson, Christoff, Panitz, De
Rosa, & Gabrieli, 2003; Ohman & Mineka, 2001) and that older adults, like younger adults,
continue to display a threat detection advantage when searching for negative facial targets in
nd neutral distractors (Hahn et al., 2006; Mather & Knight, 2006).
6 Given the
arrays of positive and
relative preservationn of automatic processing with aging (Fleischman, Wilson, Gabrieli, Bienias,
EFFECTS OF AGE ON DETECTION OF EMOTION
nnings & Jacoby, 1993),
3 it makes sense that older adults would remain able
& Bennett, 2004; Jennings
14
to take advantage off these automatic
alerting
systems
foreffects
detecting
high arousal
information.
processing,
given
that no
of valence
were observed
in older adults’ detection speed. In the
espite the similarity
in arousal-mediated
on detection
between
the two
However, despite
effects
present study,
older adults were
equally
fast to detect
positive
and negative information,
ent study did provide some evidence for age-related change (specifically,
age groups, the present
consistent with prior research that indicated that older adults often attend equally to positive and
ment) in the detection of emotional information. When examining RTs for
age-related enhancement)
negative stimuli (Rosler et al., 2005). Although the pattern of results for the young adults has
in detecting
positive
the five categories of emotionaldiffered
targets,across
younger
adults were
efficient
studies—in
the more
present
study and
in some past
research, young adults have shown
2 whereas
high arousal images (as presented
in Table
2),
older adults
displayed
an Anderson,
overall 2005; Calvo & Lang, 2004; Carretie
facilitated
detection
of positive
information
(e.g.,
ting all emotional
This2006),
pattern
advantage for detecting
images
compared
withNummenmaa
neutral images.
et al., 2004;
Juth
et al., 2005;
et al.,
whereas in other studies, young adults
nfluence ofhave
suggests a broader influence
emotion
onan
older
adults’ for
detection
of information
stimuli, providing
support & Dolan, 2002; Hansen &
shown
advantage
negative
(e.g., Armony
hat as individuals
age,
emotional
becomes
more salient.
for the hypothesis that
Hansen,
1988;
Mogg, information
Bradley, de Bono,
& Painter,
1997; Pratto & John, 1991; Reimann &
ng that thisMcNally,
second set
of findings
It is interesting
is clearly
inconsistent
with 1996)—what
the hypothesis
1995;
Williams,
Mathews,
& MacLeod,
is important to note is that the
ffect in older
information
that the positivity effect
adults
operates
at relatively
automatic
stages of
older
adults
detected
both positive
and negative
stimuli
at equal rates. This equivalent detection
of positive and negative information provides evidence that older adults display an advantage for
Use of an em dash to
indicate an interruption the detection of emotional information that is not valence-specific.
in the continuity of a
Thus, although younger and older adults exhibited somewhat divergent patterns of
sentence, 4.06;
emotional detection on a task reliant on early, relatively automatic stages of processing, we
Description of an
em dash, 4.13
found no evidence of an age-related positivity effect. The lack of a positivity focus in the older
adults is in keeping with the proposal (e.g., Mather & Knight, 2006) that the positivity effect
does not arise through automatic attentional influences. Rather, when this effect is observed in
older adults, it is likely due to age-related changes in emotion regulation goals that operate at
later stages of processing (i.e., during consciously controlled processing), once information has
been attended to and once the emotional nature of the stimulus has been discerned.
Although we cannot conclusively say that the current task relies strictly on automatic
processes, there are two lines of evidence suggesting that the construct examined in the current
28
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
15
Use of parallel construction
with coordinating conjunctions
used in pairs, 3.23
research examines relatively automatic processing. First, in their previous work, Ohman et al.
(2001) compared RTs with both 2 × 2 and 3 × 3 arrays. No significant RT differences based on
the number of images presented in the arrays were found. Second, in both Ohman et al.’s (2001)
study and the present study, analyses were performed to examine the influence of target location
on RT. Across both studies, and across both age groups in the current work, emotional targets
were detected more quickly than were neutral targets, regardless of their location. Together,
these findings suggest that task performance is dependent on relatively automatic detection
Discussion section ending
with comments on
importance of findings, 2.08
processes rather than on controlled search processes.
Although further work is required to gain a more complete understanding of the agehe early processing of emotional information, our findings indicate that
related changes in the
ults are similar in their early detection of emotional images. The current
young and older adults
EFFECTS OF AGE ON DETECTION OF EMOTION
her evidence that mechanisms associated with relatively automatic processing
study provides further
the life span
of emotional imagess are well maintained throughout the latter portion of References
16
Construction of an accurate and
complete reference list, 6.22;
General desciption of references, 2.11
3 Leclerc
(Fleischman et al., 2004; Jennings
& Jacoby,
& Hess,
2005). It
critical
that, dynamics supporting awareness.
Anderson,
A. K.1993;
(2005).
Affective
influences
onisthe
attentional
idence for a positive
focusofinExperimental
older adults’ controlled
processing
emotional
although there is evidence
Journal
Psychology:
General,of
154,
258– 281. doi:10.1037/00962005; Charles et al., 2003; Mather & Knight, 2005), the
information (e.g., Carstensen & Mikels,
3445.134.2.258
est that theAnderson,
tendency to
on the positive
doesD.,
notDe
always
present results suggest
A. focus
K., Christoff,
K., Panitz,
Rosa ,arise
E., &when
Gabrieli, J. D. E. (2003). Neural
ely automatic and correlates
rapid detection
information
in the environment.
tasks require relatively
of theof
automatic
processing
of threat facial signals. Journal of Neuroscience,
23, 5627– 5633.
Armony, J. L., & Dolan, R. J. (2002). Modulation of spatial attention by fear-conditioned
stimuli: An event-related fMRI study. Neuropsychologia, 40, 817–826.
doi:10.1016/S0028-3932%2801%2900178-6
Beck, A. T., Epstein, N., Brown, G., & Steer, R. A. (1988). An inventory for measuring clinical
anxiety: Psychometric properties. Journal of Consulting and Clinical Psychology, 56,
893– 897. doi:10.1037/0022-006X.56.6.893
Calvo, M. G., & Lang, P. J. (2004). Gaze patterns when looking at emotional pictures:
Motivationally biased attention. Motivation and Emotion, 28, 221–243. doi:
10.1023/B%3AMOEM.0000040153.26156.ed
Carretie, L., Hinojosa, J. A., Martin-Loeches, M., Mecado, F., & Tapia, M. (2004). Automatic
attention to emotional stimuli: Neural correlates. Human Brain Mapping , 22, 290–299.
doi:10.1002/hbm.20037
Carstensen, L. L. (1992). Social and emotional patterns in adulthood: Support for socioemotional
selectivity theory. Psychology and Aging, 7, 331–338. doi:10.1037/0882-7974.7.3.331
Carstensen, L. L., Fung, H., & Charles, S. (2003). Socioemotional selectivity theory and the
regulation of emotion in the second half of life. Motivation and Emotion , 27, 103– 123.
29
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
17
Carstensen, L. L., & Mikels, J. A. (2005). At the intersection of emotion and cognition: Aging
and the positivity effect. Current Directions in Psychological Science, 14, 117–121. doi:
10.1111/j.0963-7214.2005.00348.x
ather, M., & Carstensen, L. L.. (2003). Aging and emotional memory: The
Charles, S. T., Mather,
forgettablee nat
nature of negative images for older adults. Journal of Experimental
DETECTION OF EMOTION
y: G
Psychology:
General,EFFECTS
132, 310–OF
324.AGE
doi: ON
10.1037/0096-3445.132.2.310
18
Chow, T. W., & Cum
Cummings, J. L. (2000). The amygdala and Alzheimer’s disease. In J. P.
Grühn, D., Smith, J., & Baltes, P. B. (2005). No aging bias favoring memory for positive
Aggleton (Ed.
(Ed.), The amygdala: A functional analysis (pp. 656– 680). Oxford, England:
material: Evidence from a heterogeneity-homogeneity list paradigm using emotionally
niver
Oxford University
Press.
toned words. Psychology and Aging, 20, 579–588. doi:10.1037/0882-7974.20.4.579
Davis, M., & Wha
alen P. J. (2001). The amygdala: Vigilance and emotion. Molecular Psychiatry,
Whalen,
Hahn, S., Carlson, C., Singer, S., & Gronlund, S. D. (2006). Aging and visual search: Automatic
Automa
6, 13–
3 34. doi:
d 10.1038/sj.mp.4000812
13–34.
and controlled attentional bias to threat faces. Acta Psychologica, 123 , 312–
2 336. doi:
312–336.
Dolan, R.
J., & Vu
uille
Vuilleumier,
P. (2003).
Amygdala
EFFECTS
OF AGE ON
DETECTION
OFautomaticity
EMOTION in emotional processing. Annals
10.1016/j.actpsy.2006.01.008
19
of the New
w Yo
York Academy of Sciences, 985, 348–355.
Hansen, C. H., & Hansen , R. D. (1988). Finding the face in the crowd: An anger superiority
Kensinger, E. A., Brierley, B., Medford , N., Growdon, J. H., & Corkin, S. (2002). Effects of
Digital object identifier as
Eastwood, J. D., Smilek,
S
Smil
D., & Merikle, P. M. (2001). Negative facial expression captures
effect. Journal of Personality and Social Psychology , 54, 917–924.
917–
7 924. doi: 10.1037/0022normal aging and Alzheimer’s disease on emotional memory. Emotion, 2, 118– 134. doi: article identifier, 6.31;
attention an
nd ddisrupts performance. Perceptual Physiology, 65, 352–358.
and
Example of reference to a
3514.54.6.917
10.1037/1528-3542.2.2.118
periodical, 7.01
Fleischman, D. A.,
A. , Wilson,
W
R. S., Gabrieli, J. D. E., Bienias, J. L., & Bennett , D. A. (2004). A
Hedden, T., & Gabrieli,i J. D. E. (2004). Insights into the ageing mind: A view from cognitive
Lang, P. J., Bradley, M. M., & Cuthbert, B. N. (1997). Motivated attention: Affect, activation,
longitudina
al st
longitudinal
study of implicit and explicit memory in old persons. Psychology and Aging,
87–
7 96. doi:10.1038/nrn1323
neuroscience. Nature Reviews Neuroscience, 5, 87–96.
and action. In P. J. Lang, R. F. Simons, & M. Balaban (Eds.), Attention and orienting:
19, 617–
7 6225. ddoi: 10.1037/0882-7974.19.4.617
617–625.
Example
of reference
to a
Isaacowitz, D. M., Wadlinger, H. A., Goren, D., & Wilson , H. R.. (2006). Selective
preference
in
Sensory and motivational processes (pp. 97– 135). Mahwah, NJ: Erlbaum.
book chapter, print verison,
Good, C. D., John
srud I. S., Ashburner, J., Henson, R. N. A., Firston, K. J., & Frackowiak, R.
Johnsrude,
eyee trackingno
Psycholo
visual fixation away from negative images in old age? An eye-tracking
study.
DOI,Psychology
7.02, Example 25
Leclerc, C. M., & Hess, T. M. (2005, August). Age differences in processing of affectively
S. J.. (2001)
). A voxel-based morphometric study of ageing in 465 normal adult human
(2001).
0 48. doi:10.1037/08822 7974.21.1.40
of Aging, 21, 40–
doi:10.1037/0882-7974.21.1.40
primed information. Poster session presented at the 113th Annual Convention of the
brains. Neu
uroI
NeuroImage,
14, 21–36. doi:10.1006/nimg.2001.0786
Jennings, J. M., & Jacoby, L. L.. (1993). Automatic versus intentional uses of memory: Aging,
American Psychological Association, Washington, DC.
3 293. doi:10.1037/0882attention, and control. Psychology and Aging, 8, 283–
283–293.
LeDoux, J. E. (1995). Emotion: Clues from the brain. Annual Review of Psychology, 46, 209–
7974.8.2.283
235. doi:10.1146/annurev.ps.46.020195.001233
Juth, P., Lundqvist, D., Karlsson, A., & Ohman, A. (2005). Looking for foes and friends:
Mather, M ., & Knight, M. (2005). Goal-directed memory: The role of cognitive control in older
9 39
Perceptual and emotional factors when finding a face in the crowd. Emotion, 5, 379–
395.
adults’ emotional memory. Psychology and Aging, 20, 554–570. doi:10.1037/0882-
7974.20.4.554
doi:10.1037/1528-3542.5.4.379
Kennedy, Q., Mather, M., & Carstensen, L. L.. (2004). The role of motivation in the age-related
age -relate
Mather, M., & Knight, M. R. (2006). Angry faces get noticed quickly: Threat detection is not
positive bias in autobiographical memory. Psychological Science, 15, 208–
8 214.
208–214.
impaired among older adults. Journals of Gerontology, Series B: Psychological Sciences,
61B, P54–P57.
doi:10.1111/j.09566 7976.2004.01503011.x
doi:10.1111/j.0956-7976.2004.01503011.x
Mogg, K., Bradley, B. P., de Bono , J., & Painter, M. (1997). Time course of attentional bias for
threat information in non-clinical anxiety. Behavioral Research Therapy, 35, 297– 303.
Nelson, H. E. (1976). A modified Wisconsin card sorting test sensitive to frontal lobe defects.
Cortex, 12, 313–324.
30
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
20
Nummenmaa, L., Hyona , J., & Calvo, M. G. (2006). Eye movement assessment of selective
attentional capture by emotional pictures. Emotion, 6, 257–268. doi:10.1037/1528-
Article with more than
3542.6.2.257
seven authors, 7.01,
Ohman, A., Flykt, A., & Esteves, F. (2001). Emotion drives attention: Detecting the snake in the
Example 2
7/0096
grass. Journal of Experimental Psychology: General, 130, 466–478. doi:10.1037/0096-
EFFECTS OF AGE ON DETECTION OF EMOTION
3445.130.3.466
21
modu
Ohman, A., & Mineka, S. (2001). Fears, phobias, and preparedness: Toward an evolvedd module
Rosler, A., Ulrich, C., Billino, J., Sterzer, P., Weidauer , S., Bernhardt, T., …Kleinschmidt, A.
3of fear and fear learning. Psychological Review, 108, 483–522. doi:10.1037/0033(2005). Effects of arousing emotional scenes on the distribution of visuospatial attention:
295X.108.3.483
Changes with aging and early subcortical vascular dementia. Journal of the Neurological
orphol
Ohnishi, T., Matsuda, H., Tabira, T., Asada, T., & Uno, M. (2001). Changes in brain morphology
Sciences,229, 109–116. doi:10.1016/j.jns.2004.11.007
in Alzheimer’s disease and exaggerated aging process? American Journal of
Shipley, W. C. (1986).Shipley Institute of Living Scale. Los Angeles, CA: Western Psychological
Services.
Neuroradiology, 22, 1680–1685.
emporaalPetrides, M., & Milner, B. (1982). Deficits on subject-ordered tasks after frontal- and temporalSpielberger, C. D., Gorsuch, I., & Lushene, R. E. (1970). Manual for the State–Trait Inventory.
lobe lesions in man. Neuropsychologia, 20, 249–262. doi: 10.1016/0028Palo Alto, CA: Consulting Psychologists Press.
EFFECTS OF AGE ON DETECTION OF EMOTION
3932%2882%2990100-2
). Wechslerr Memory Scale
— Revised. San Antonio, TX: Psychological
Wechsler, D. (1987).
Scale—
Corporation..
22
Placement and format
negati
ve
Pratto, F., & John, O. P. (1991). Automatic vigilance:Footnotes
The attention-grabbing power of of
negative
footnotes,
2.12
1
Analyses Journal
of covariance
were conducted
with
these covariates,
with nodoi:
resulting
social information.
of Personality
and Social
Psychology,
61, 380–391.
). Technical manual for the Wechsler Adult Intelligence and Memory Scale–
Wechsler, D. (1997).
10.1037/0022-3514.61.3.380
influences of these variables on the pattern or magnitude of the results.
rk, NY: The Psychological Corporation.
III. New York,
2 These
of of target
Raz, N. (2000). Aging
ofdata
the brain
and its
impactwith
on cognitive
performance:
Integration
were also
analyzed
a 2 × 5 ANOVA
to examine
the effect
West, R. L. (1996). An application of prefrontal cortex function theory to cognitive aging.
andboo
ok
structural
functional
findings.
F. I. containing
M. Craik &neutral
T. A. Salthouse
(Eds.),
Handbook
categoryand
when
presented
only in In
arrays
images, with
the results
remaining
cal Bulletin, 120, 272–
2 292. doi: 10.1037/0033-2909.120.2.272
Psychological
272–292.
ofqualitatively
aging and cognition
ed.,
pp. 1–90).
Mahwah,
NJ: Erlbaum.
the same.(2nd
More
broadly,
the effects
of emotion
on target detection were not
athews , A., & MacLeod, C., (1996). The emotional Stroop task and
Williams, J. M., Mathews
mation..
Reimann,qualitatively
B., & McNally,
R. (1995).
impacted
by theCognitive
distractorprocessing
category. of personally relevant information.
ology. Psychological Bulletin, 120, 3–
3 24. doi: 10.1037/0033-2909.120.1.3
psychopathology.
Cognition and Emotion, 9, 324–340.
erman, N., Burgess, P. W., Emslie,
e H. C., & Evans, J. J. (1996). The
Wilson, B. A., Alderman,
Behaviourall Assessment of the Dysexecutive Syndrome. Flempton, Bury St. Edmunds,
hames Valley Test Company.
England: Thames
31
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
23
Table 1
Participant Characteristics
Younger group
Measure
M
SD
Years of education
13.92
1.28
Beck Anxiety Inventory
9.39
5.34
BADS– DEX
20.79
7.58
Selecting effective
STAI–State
45.79
4.44
STAI–Trait
45.64
4.50
presentation, 4.41;
Digit Symbol Substitution
49.62
7.18
Logical and effective
Generative naming
46.95
9.70
table layout, 5.08
Vocabulary
33.00
3.52
Digit Span– Backward
8.81
2.09
Arithmetic
16.14
2.75
Mental Control
32.32
3.82
Self-Ordered Pointing
1.73
2.53
CTS OF AGE
ON
DETECTION
OF
EMOTION
EFFECTS
WCST perseverative errors
0.36
0.66
Table 2
Older group
M
SD
16.33
2.43
6.25
6.06
13.38
8.29
47.08
3.48
45.58
3.15
31.58
6.56
47.17
12.98
35.25
3.70
8.25
2.15
14.96
3.11
23.75
5.13
9.25
9.40
1.83
3.23
F (1, 46)
18.62
3.54
10.46
1.07
0.02
77.52
.004
4.33
0.78
1.84
40.60
13.18
4.39
24
p
<.001
.066
.002
.306
.963
<.001
.951
.043
.383
.182
<.001
.001
.042
Note. The Beck Anxiety Inventory is from Beck et al. (1988); the Behavioral Assessment of the
Dysexecutive
Questionnaire (BADS–DEX) is from Wilson et al.
esponse Time
for Young and Older Adults
Raw Response
(RT) ScoresSyndrome—Dysexecutive
(1996); the State–Trait Anxiety Inventory (STAI) measures are from Spielberger et al. (1970);
ory
y
Young
gg
group
p
Older group
g p
Category
and the Digit Symbol
ve high arousal
825Substitution, Digit Span–Backward,
1,580 and Arithmetic Wechsler Adult
Positive
ve low arousal
899and Wechsler Memory Scale—III
1,636 measures are from Wechsler (1997).
Positive
Intelligence Scale—III
al
912
1,797
Neutral
ive high arousal
1,578
Negative
Generative naming 885
scores represent the total number
of words produced in 60 s each for letter
ive low arousal
896
1,625
Negative
F, A, and S. The Vocabulary measure is from Shipley (1986); the Mental Control measure is
Note. Values represent median response times, collapsing across array type and excluding arrays
from Wechsler (1987); the Self-Ordered Pointing measure was adapted from Petrides and Milner
of the same category as targets (i.e., positive high arousal represents the median RT to respond to
(1982); and the Wisconsin Card Sorting Task (WCST) measure is from Nelson (1976).
positivee high arousal targets, collapsing across positive low arousal, neutral, negative high
All values represent raw, nonstandardized scores.
arousal,, and negative low arousal array categories). The median response time values were
Elements of
table notes, 5.16
ed in milliseconds.
recorded
32
Return to Inside-out procedure
Figure 2.1. Sample One-Experiment Paper (continued)
EFFECTS OF AGE ON DETECTION OF EMOTION
25
Principles of figure use and
construction, types of figures;
standards, planning, and
preparation of figures, 5.20–5.25
.
Figure 1. Mean difference values (ms) representing detection speed for each target category
subtracted from the mean detection speed for neutral targets. No age differences were found in the
arousal-mediated effects on detection speed. Standard errors are represented in the figure by the
Figure legends
and captions, 5.23
error bars attached to each column.
33
Return to Inside-out procedure
Figure 2.2. Sample Two-Experiment Paper (The numbers refer to numbered sections in the Publication Manual. This abridged manuscript illustrates the organizational structure characteristic of
multiple-experiment papers. Of course, a complete multipleexperiment paper would include a title page, an abstract page,
and so forth.)
INHIBITORY INFLUENCES ON ASYCHRONY
1
Inhibitory Influences on Asychrony as a Cue for Auditory Segregation
Auditory grouping involves the formation of auditory objects from the sound mixture
reaching the ears. The cues used to integrate or segregate these sounds and so form auditory
objects have been defined by several authors (e.g., Bregman, 1990; Darwin, 1997; Darwin &
Carlyon, 1995). The key acoustic cues for segregating concurrent acoustic elements are
differences in onset time (e.g., Dannenbring & Bregman, 1978; Rasch, 1978) and harmonic
relations (e.g., Brunstrom & Roberts, 1998; Moore, Glasberg, & Peters, 1986). In an example of
the importance of onset time, Darwin (1984a, 1984b) showed that increasing the level of a
harmonic near the first formant (F1) frequency by adding a synchronous pure tone changes the
phonetic quality of a vowel. However, when the added tone began a few hundred milliseconds
before the vowel, it was essentially removed from the vowel percept.… [section continues].
General Method
Elements of empirical studies, 1.01
Overview
In the experiments reported here, we used a paradigm developed by Darwin to assess the
perceptual integration of additional energy in the F1 region of a vowel through its effect on
phonetic quality (Darwin, 1984a, 1984b; Darwin & Sutherland, 1984).…[section continues].
Stimuli
Amplitude and phase values for the vowel harmonics were obtained from the vocal-tract
transfer function using cascaded formant resonators (Klatt, 1980). F1 values varied in 10-Hz
steps from 360–550 Hz—except in Experiment 3, which used values from 350– 540 Hz—to
produce a continuum of 20 tokens.…[section continues].
Listeners
Paper adapted from “Inhibitory Influences on Asychrony as a Cue for Auditory Segregation,” by S. D.
Holmes and B. Roberts, 2006, Journal of Experimental Psychology: Human Perception and Performance, 32,
pp. 1231–1242. Copyright 2006 by the American Psychological Association.
34
Return to Inside-out procedure
Figure 2.2. Sample Two-Experiment Paper (continued)
INHIBITORY INFLUENCES ON ASYCHRONY
2
Listeners were volunteers recruited from the student population of the University of
Birmingham and were paid for their participation. All listeners were native speakers of British
English who reported normal hearing and had successfully completed a screening procedure
(described below). For each experiment, the data for 12 listeners are presented.…[section
Plural forms of nouns
of foreign origin, 3.19
continues].
Procedure
At the start of each session, listeners took part in a warm-up block. Depending on the
number of conditions in a particular experiment, the warm-up block consisted of one block of all
the experimental stimuli or every second or fourth F1 step in that block. This gave between 85
and 100 randomized trials. … [section continues].
Data Analysis
The data for each listener consisted of the number of /I/ responses out of 10 repetitions
for each nominal F1 value in each condition. An estimate of the F1 frequency at the phoneme
boundary was obtained by fitting a probit function (Finney, 1971) to a listener ’s identification
data for each condition. The phoneme boundary was defined as the mean of the probit function
(the 50% point).…[section continues].
Multiple Experiments, 2.09
Experiment
pe
e t1
iment, we used noise-band captors and compared their efficacy with that of a
In this experiment,
ASYCHRONY
ach noise-band
- INHIBITORY
captor had INFLUENCES
the same energyON
as that
of the corresponding pure
pure-tone captor. Each
pure--
3
nter frequency equal to the frequency of this tonal captor…[section
tone captor and a center
There were nine conditions: the three standard ones (vowel alone, incremented fourth,
continues].
Method
and leading fourth) plus three captor conditions and their controls. A lead time of 240 ms was
Abbreviating units
of measurement,
4.27, Table 4.4
used for the added 500-Hz tone.… [section continues].
Results and Discussion
Figure 4 shows the mean phoneme boundaries for all conditions and the restoration effect
for each captor type. The restoration effects are shown above the histogram bars both as a
boundary shift in hertz and as a percentage of the difference in boundary position between the
incremented-fourth and leading-fourth conditions.… [section continues].
Experiment 2
This experiment considers the case where the added 500-Hz tone begins at the same time
as the vowel but continues after the vowel ends.… [section continues].
Method
There were five conditions: two of the standard ones (vowel alone and incremented
Policy on metrication, 4.39; fourth), a lagging-fourth condition (analogous to the leading-fourth condition used elsewhere),
Style for metric units, 4.40 and a captor condition and its control. A lag time of 240 ms was used for the added 500-Hz
tone.… [section continues]
Results and Discussion
35
Return to Inside-out procedure
Figure 2.2. Sample Two-Experiment Paper (continued)
INHIBITORY INFLUENCES ON ASYCHRONY
4
1984; Roberts & Holmes, 2006). This experiment used a gap between captor offset and vowel
onset to measure the decay time of the captor effect …[section continues].
Method
There were 17 conditions: the three standard ones (vowel alone, incremented fourth, and
leading fourth), five captor conditions and their controls, and four additional conditions
(described separately below). A lead time of 320 ms was used for the added 500-Hz tone. The
captor conditions were created by adding a 1.1-kHz pure-tone captor, of various durations, to
each member of the leading-fourth continuum.…[section continues].
Results
Figure 6 shows the mean phoneme boundaries for all conditions. There was a highly
Use of statistical term rather
than symbol in text, 4.45
significant effect of condition on the phoneme boundary values, F(16, 176) = 39.10, p < .001.
Incrementing the level of the fourth harmonic lowered the phoneme boundary relative to the
vowel-alone condition (by 58 Hz, p < .001), which indicates that the extra energy was integrated
into the vowel percept.…[section continues].
Discussion
The results of this experiment show that the effect of the captor disappears somewhere
between 80 and 160 ms after captor offset. This indicates that the captor effect takes quite a long
time to decay away relative to the time constants typically found for cells in the CN using
physiological measures (e.g., Needham & Paolini, 2003).…[section continues].
Summary and Concluding Discussion
Darwin and Sutherland (1984) first demonstrated that accompanying the leading portion
of additional energy in the F1 region of a vowel with a captor tone partly reversed the effect of
the onset asynchrony on perceived vowel quality. This finding was attributed to the formation of
Running head: INHIBITORY INFLUENCES ON ASYCHRONY
5
a perceptual group between the leading portion and the captor tone, on the basis oof their common
onset time and harmonic relationship, leaving the remainder of the extra energy to integrate into
the vowel percept… .[section continues].
[Follow the form of the one-experiment sample paper to type references, the author note,
footnotes, tables, and figure captions.]
36
Return to Inside-out procedure
Figure 2.3. Sample Meta-Analysis (The numbers refer to numbered sections in the Publication Manual. This abridged manuscript illustrates the organizational structure characteristic of reports of
meta-analyses. Of course, a complete meta-analysis would
include a title page, an abstract page, and so forth.)
THE SLEEPER EFFECT IN PERSUASION
1
The Sleeper Effect in Persuasion:
A Meta-Analytic Review
Persuasive messages are often accompanied by information that induces suspicions of
invalidity. For instance, recipients of communications about a political candidate may discount a
message coming from a representative of the opponent party because they do not perceive the
source of the message as credible (e.g., Lariscy & Tinkham, 1999). Because the source of the
political message serves as a discounting cue and temporarily decreases the impact of the
message, recipients may not be persuaded by the advocacy immediately after they receive the
communication. Over time, however, recipients of an otherwise influential message may recall
the message but not the noncredible source and thus become more persuaded by the message at
Italicize key terms, 4.21
that time than they were immediately following the communication. The term sleeper effect was
used to denote such a delayed increase in persuasion observed when the discounting cue (e.g.,
or “dissociated”
the communication in the
noncredible source) becomes unavailable
THE SLEEPER
EFFECT INfrom
PERSUASION
2
sage recipients (Hovland, Lumsdaine, & Sheffield, 1949).…[section
memory of the message
retention, attitude and decay, and persuasion and decay . Because researchers often use the terms
opinion and belief, instead of attitude, we conducted searches using these substitute terms as
Method
well.
Description of meta-analysis, 1.02;
Sample of Studies
Guidelines for reporting meta-analysis,
Second, … [section continues].
2.10;
d reports related to the sleeper effect that were available by March
2003see
by also Appendix
We retrieved
Selection Criteria
rocedures. First, we searched computerized databases, including PsycINFO
means of multiple procedures.
We used the following criteria to select studies for inclusion in the meta-analysis.
rtation Abstracts International (1861– 2003), ERIC (1967–
7 2003), and the
(1887–2003), Dissertation
(1967–2003),
1. We only included studies that involved the presentation of a communication containing
tion-Index (1956–
6 2003), using the keywords sleeper effect,t delayed-action,
Social-Science-Citation-Index
(1956–2003),
persuasive arguments. Thus, we excluded studies in which the participants played a role or were
redibility,
source
expertise, attitude change, discounting cue, attitude
credibility, source credibility,
asked to make a speech that contradicted their opinions. We also excluded developmental studies
r , attitude and
persistence, attitude maintenance, persuasion, propaganda, attitude and memory
memory,
involving delayed effects of an early event (e.g., child abuse), which sometimes are also referred
to as sleeper effects.…[section continues] .
Identification of elements in a
series within a sentence, 3.04
Moderators
For descriptive purposes, we recorded (a) the year and (b) source (i.e., journal article,
unpublished dissertations and theses, or other unpublished document) of each report as well as
(c) the sample composition (i.e., high-school students, university students, or other) and (d) the
country in which the study was conducted.
We also coded each experiment in terms of .…[section continues].
Studies were coded independently by the first author and another graduate student.
Paper adapted from “The Sleeper Effect in Persuasion: A Meta-Analytic Review,” by G. Kumkale and D.
Albarracin, 2004, Psychological Bulletin, 130, pp. 143–172. Copyright 2004 by the American Psychological
Association.
37
Return to Inside-out procedure
Figure 2.3. Sample Meta-Analysis (continued)
THE SLEEPER EFFECT IN PERSUASION
3
was satisfactory (Orwin, 1994). We resolved disagreements by discussion and consultation with
colleagues. Characteristics of the individual studies included in this review are presented in
Table 1. The studies often contained several independent datasets such as different messages and
different experiments. The characteristics that distinguish different datasets within a report
appear on the second column of the table.
Dependent Measures and Computation of Effect Sizes
We calculated effect sizes for (a) persuasion and (b) recall–recognition of the message
content. Calculations were based on the data described in the primary reports as well as available
responses of the authors to requests of further information.…[section continues].
Analyses of Effect Sizes
wo major models used in meta-analysis:
metaa analysis: fixed-effectss and randomThere are two
ontinues]. THE SLEEPER EFFECT IN PERSUASION
effects.…[section continues].
4
Use at least
two subheadings
in a section, 3.02
To benefit from thee strengths place
of both
models,
we chose to
aggregate the effect sizes and to
over
time.…[section
continues].
ing both approaches.…[section
conduct analyses using
continues].
In light of these requirements,
we first examined whether discounting cues led to a decrease in
agreement withResults
the communication (boomerang effect). Next,.…[section continues].
alysis included a description
The data analysis
experimentsboomerang
we summarized,
an To determine whether or not a delayed
Ruling outofathe
nonpersisting
effect.
estimation of overalll effects, moderator
andrepresents
tests of mediation.
increase inanalyses,
persuasion
an absolute sleeper effect, one needs to rule out a nonpersisting
Sample of Studies and Datasets
boomerang effect, which takes place when a message initially backfires but later loses this
metaa analysis
appear in
Descriptive characteristics
of the
datasets
included
the present
meta-analysis
reverse
effect
(see panel
A ofinFigure
1).…[section
continues].
Table 2.…[section continues].
Average sleeper effect. Relevant statistics corresponding to average changes in
verage Effect
Overview of the Average
Sizes from the immediate to the delayed posttest appear in Table 4, organized by the
persuasion
A thorough understanding
of theconditions
sleeper effect
requires examining
(a) the betweendifferent
we considered
(i.e., acceptance-cue,
discounting-cue, no-message control,
es at each time
condition differences
point as well as
(b) the within-condition
changes
and message-only
control).
In Table 4, positive
effect that
sizestake
indicate increases in persuasion over
time, negative effect sizes indicate decay in persuasion, and zero effects denote stability in
persuasion. Confidence intervals that do not include zero indicate significant changes over time.
The first row of Table 4 shows that recipients of acceptance cues agreed with the message less as
time went by (fixed-effects, d + = –0.21; random-effects, d + = –0.23). In contrast to the decay in
persuasion for recipients of acceptance cues, there was a slight increase in persuasion for
recipients of discounting cues over time (d + = 0.08). It is important to note that change in
discounting-cue conditions significantly differed from change in acceptance-cue conditions,
(fixed-effects; B = –0.29, SE = 0.04), QB (1) = 58.15, p < .0001; QE(123) = 193.82, p <
.0001.…[section continues].
Summary and variability of the overall effect. The overall analyses identified a relative
sleeper effect in persuasion, but no absolute sleeper effect. The latter was not surprising, because
the sleeper effect was expected to emerge under specific conditions.…[section continues].
38
Return to Inside-out procedure
Figure 2.3. Sample Meta-Analysis (continued)
THE SLEEPER EFFECT IN PERSUASION
5
Moderator Analyses
Although overall effects have descriptive value, the variability in the change observed in
discounting-cue conditions makes it unlikely that the same effect was present under all
conditions. Therefore, we tested the hypotheses that the sleeper effect would be more likely (e.g.,
more consistent with the absolute pattern in Panel B1 of Figure 1) when…[section continues].
Format for references included
in a meta-analysis with less
than 50 references, 6.26
THE SLEEPER EFFECT IN PERSUASION
6
References
References marked with an asterisk indicate studies included in the meta-analysis.
Albarracín, D. (2002). Cognition in persuasion: An analysis of information processing in
response to persuasive communications. In M. P. Zanna (Ed.), Advances in experimental
social psychology (Vol. 34, pp. 61–130). doi:10.1016/S0065-2601(02)80004-1
… [references continue]
Johnson, B. T., & Eagly, A. H. (1989). Effects of involvement in persuasion: A meta-analysis.
Psychological Bulletin, 106, 290–314. doi:10.1037/0033-2909.106.2.290
*Johnson, H. H., Torcivia, J. M., & Poprick, M. A. (1968). Effects of source credibility on the
relationship between authoritarianism and attitude change. Journal of Personality and
Social Psychology, 9, 179–183. doi:10.1037/h0021250
*Johnson, H. H., & Watkins, T. A. (1971). The effects of message repetitions on immediate and
delayed attitude change. Psychonomic Science, 22, 101–103.
Jonas, K., Diehl, M., & Bromer, P. (1997). Effects of attitudinal ambivalence on information
processing and attitude-intention consistency. Journal of Experimental Social
Psychology, 33, 190–210. doi:10.1006/jesp.1996.1317
. . . [references continue]
[Follow the form of the one-experiment sample paper to type the author note, footnotes,
tables, and figure captions.]
39
Return to Inside-out Procedure
Part III: Examples Psychonomic Bulletin & Review
2008, 15 (2), 404-408
doi: 10.3758/PBR.15.2.404
Visual events modulated by sound
in repetition blindness
Yi-Chuan Chen and Su-Ling Yeh
National Taiwan University, Taipei, Taiwan
Repetition blindness (RB; Kanwisher, 1987) is the term used to describe people’s failure to detect or report an
item that is repeated in a rapid serial visual presentation (RSVP) stream. Although RB is, by definition, a visual deficit, whether it is affected by an auditory signal remains unknown. In the present study, we added two sounds before,
simultaneous with, or after the onset of the two critical visual items during RSVP to examine the effect of sound on
RB. The results show that the addition of the sounds effectively reduced RB when they appeared at, or around, the
critical items. These results indicate that it is easier to perceive an event containing multisensory information than
unisensory ones. Possible mechanisms of how visual and auditory information interact are discussed.
Open Gen Intro (1) General and Familiar opening Also connect with last paragraph
If a wind shear is encountered by an airplane, the captain will turn on the “fasten seatbelt” sign, the onset of
which is usually accompanied by a sound to avoid this
sign’s being ignored. Is an event containing multisensory
information easier to perceive than a unisensory one? In
this study, we examined whether a sound can ameliorate
Open Gen
the unobserved visual event by adopting the repetition
Intro (2):blindness (RB) paradigm.i.e., W1
Write the RB is the failure to report the second occurrence of a
repeated item in a rapid serial visual presentation (RSVP).
researchFor example, the sentence “I prefer green tea for tea time”
goal in may be misreported as “I prefer green tea for time,” even
though such an error leads to an ungrammatical sentence.
first
RB is considered a general phenomenon in visual perception, since it has been demonstrated with many different
paragraph
visual stimuli, including English words (Harris & Morris,
2001), Chinese characters (Yeh & Li, 2004), colors and
forms (Kanwisher, Driver, & Machado, 1995), and pictures (Bavelier, 1994).
By definition, RB is a visual deficit. Nevertheless, it is
still unknown whether the factors affecting RB are confined to the visual modality or whether, instead, stimuli
from other modalities can also influence RB. To date,
there have been no studies in which this issue has been
directly examined, although some previous studies have
provided a few insights. For example, Kanwisher and
Potter (1989) had their participants either see or hear
sentences and found RB but not repetition deafness.
They concluded that RB is a modality-specific phenomenon that is confined to vision (but see Miller & MacKay, 1994). Soto-Faraco and Spence (2002) found no
evidence for a cross-modal repetition deficit when the
two critical items, C1 and C2, were presented in different
modalities (e.g., visual C1 and auditory C2). This implies that the source of modality could be taken as a part
of episodic representations, so that the factors affecting
RB might be restricted to vision only.
Despite the foregoing, other studies have suggested that
the perception of visual events can be affected by auditory
input. For example, two visual streams crossing each other
can be perceived as streaming or bouncing, depending on
whether a sound is added (Sekuler, Sekuler, & Lau, 1997;
Shimojo, Watanabe, & Scheier, 2001). Another example
is the attentional blink (AB) for the visual target when it
is preceded by an auditory target (Arnell & Larson, 2002).
The proposed mechanism is that the auditory stimulus may
occupy attentional resources or distract attention away
from the visual stimuli (Arnell & Larson, 2002; Shimojo
et al., 2001; but see Duncan, Martens, & Ward, 1997). By
inference from these studies, it seems that adding sounds
to an RSVP stream should suppress the visual process,
due to lack of attention.
Recalling the seatbelt example mentioned earlier, however, makes us wonder whether there is a third possibility:
Sound enhances the visibility of the originally unobserved
visual event (i.e., enhances the visibility of C2 when it suffers from RB). To examine this, we added two different
beep sounds (S1 and S2) to the visual stream.1 If the added
sounds can enhance the visibility of C1 and C2 in RSVP,
they should be perceived more accurately, thus lessening
RB. The onset time of S2 was manipulated in order to probe
the effective time window of the sound facilitatory effect.
Method
Participants Method for Exp (1): Typical method section
Five groups of 20 undergraduates at National Taiwan University
participated in each of the five sound conditions in exchange for
course credit. They were all native speakers of Mandarin and were
naive concerning the purpose of the experiment. All of the participants had normal or corrected-to-normal vision and hearing.
S.-L. Yeh, [email protected]
Copyright 2008 Psychonomic Society, Inc.
404
40
Repetition Blindness Reduced by Sound 405
A
Repeated trial
Unrepeated trial
+
C1
ƌ ᄁ
F
Ҽ $
ჩ Ҽ
Ք
= ᄁ % +
ᄁ
F
Control trial
C2
C2
Ĺ
༮
43 msec/item
B
No sound
–86 msec
Onset
Offset
+86 msec
–86
0
+86
: S1
Method for Exp (2): Complete the figure
msec
: S2
Figure 1. (A) Examples of the rapid serial visual presentation sequence. In
the repeated trial, C1 and C2 were the same character (銘). In the unrepeated
trial, all the displays were the same as those in the repeated trial, except that
another character (輔) was substituted for C1. The control trial contained
only two characters. (B) The five sound conditions. The time scales of panels A
and B are matched. C1, Critical Item 1; C2, Critical Item 2; F, filler character; S1, Sound 1, 500 Hz; S2, Sound 2, 800 Hz.
Stimuli
The stimuli were presented on a 15-in. calibrated EIZO color
monitor and were controlled by a personal computer. The refresh
rate of the monitor was set at 70 Hz. The participants sat at a viewing
distance of 60 cm from the monitor in a dimly lit chamber.
Each trial consisted of seven items in an RSVP sequence, with
three Chinese characters and four symbols all in white and presented
one at a time for 43 msec in the center of a black background. All
the Chinese characters that were selected were ones commonly used.
The three characters included one filler character (F) and two critical
characters (C1 and C2). To avoid any unwanted priming effects, the
characters in each trial were not orthographically similar, except for
repeated items, and did not have any obvious semantic relationship.
Furthermore, the pronunciations did not sound as meaningful twoor three-character words, nor did they rhyme.
The F and C2 were in the Chia font and subtended a visual angle
of 1.15º  1.24º (width  height). C1 was in the Fong font (1.53º 
1.43º). C1 and C2 were chosen to be different in both font and size
in order to avoid the possible confounding of visual masking caused
by physically identical characters appearing at the same position.
A fixation cross (1.15º  1.24º) was shown at the beginning and
41
end of each trial. Symbols (*) in each trial were randomly selected
from a set of 30 signs, such as &, ▽, ≒, and $, without repeated
symbols occurring in the same trial. The sizes of the 30 symbols
ranged from 1.21º  0.80º to 1.43º  1.24º. In all the trials, the
presented order in RSVP was*, C1, F,*,*, C2,*(see Figure 1A).
The auditory stimuli (S1 and S2; 500 and 800 Hz, respectively) appeared simultaneously with the onsets of two visual items and lasted
for 10 msec each.
Design
The two factors of sound condition (five conditions of beep
sounds accompanying C1 and C2) and repetition (repeated or unrepeated) were manipulated. The five sound conditions were as
follows. The no-sound condition was solely a visual presentation
and served as the baseline for comparison. In the remaining four
conditions, S1 was always presented at the onset of C1, and S2 was
presented before C2 (86 msec), at the onset of C2, at the offset of
C2, or after C2 (186 msec). Except for the onset time of S2, other
items were the same in the four conditions, except for the no-sound
condition (Figure 1B). Different participants were assigned to these
five conditions.
Repetition Blindness Reduced by Sound 407
Discussion
Open Gen Dis (1): State main findings in short paper
In this study, we found that RB can be reduced when
two critical items are accompanied by different sounds.
We replicated the conventional RB effect in the no-sound
condition, while observing a reduced RB effect in the
86-msec, onset, and offset conditions. The 186-msec
condition revealed a magnitude of RB similar to that in
the no-sound condition.
Regarding the three possibilities mentioned earlier of
how sound would affect RB (independent, suppressed, or
enhanced), our results favor the enhanced account, since
the addition of the sounds reduced the magnitude of the
RB effect by facilitating character identification. Even
though processing the RSVP stream is demanding enough
to prevent influences from peripheral sounds (Santangelo,
Belardinelli, & Spence, 2007), in this study, the task­irrelevant sounds were not filtered out but, rather, still affected the demanding processing of the central stream.
However, unlike the cross-modal AB effect (Arnell &
Larson, 2002), we demonstrated that the presence of the
sound enhanced, rather than suppressed, visual perception. Indeed, there is mounting evidence showing sound
facilitatory effects in visual detection (e.g., Bolognini,
Frassinetti, Serino, & Làdavas, 2005; McDonald, TederSälejärvi, & Hillyard, 2000) and visual filling in (Sheth
& Shimojo, 2004). Nevertheless, these studies adopted
a single LED light or a color patch as the visual target.
Our results thus extend the sound facilitatory effect shown
in the literature in that it can be obtained not only in the
simple display, but also in a rapid and complex visual presentation (i.e., RSVP), which is more similar to our daily
visual world.
Bolognini et al. (2005) reported that sound could enhance the sensitivity of visual perception, whereas Park
and Kanwisher (1994) found that C2 sensitivity was reduced when it suffered from RB. Combining these two
findings, we suggest that the effect of added sounds in the
present study was to boost the lowered sensitivity to the
C2, making it more likely to be immune from RB. These
results imply that multisensory stimuli have a better opportunity to gain visual awareness (Kanwisher, 2001) than
do unisensory ones. Our results may thus have important
implications for the improvement of deficits in the perceiving of sequential events for right-parietal-damaged
patients (Battelli, Pascual-Leone, & Cavanagh, 2007).
The processing of visual items in an RSVP stream can
be understood within the type–token binding framework
(e.g., Bowman & Wyble, 2007; Chun, 1997; Kanwisher,
1987), which suggests that each visual event activates
two kinds of representations: type and token. A type is a
long-term representation used for recognition, whereas a
token is an episodic representation used to specify spatial
and temporal information. To successfully report a visual
event, the activated type needs to link to its token, a process called token individuation. Kanwisher (1987) suggested that RB results from the failure to individuate the
second token of the same activated type, due to the difficulty of binding one type with two tokens within a short
interval. This hypothesis has gained much support from
43
behavioral (e.g., Chun, 1997; Morris & Harris, 1999; Park
& Kanwisher, 1994) and event-related potential (Schendan, Kanwisher, & Kutas, 1997) studies. Attention is required either to facilitate the activation of the type node
(Bowman & Wyble, 2007) or to serve as a supplementary
component for type–token binding (Chun, 1997).
The cross-modal facilitatory effect in RB we found
here can be explained as follows. The presentation of the
sounds may covertly orient participants’ attention to focus
on the upcoming visual items (McDonald et al., 2000),
or it may boost the alertness of participants (Robertson,
Mattingley, Rorden, & Driver, 1998); namely, sounds can
modulate visual attention in the temporal domain. Another
possibility is that sounds help temporal segregation of the
RSVP stream, since they are dominant in the temporal
domain; not only does audition have its advantage in the
representations of temporal structure (Guttman, Gilroy, &
Blake, 2005; Welch, DuttonHurt, & Warren, 1986), but
also the discontinuous nature of sounds makes them more
dominant than visual streams (Shimojo & Shams, 2001).
C1 and C2 can be more distinguishable and, thus, suffer
less RB if they are segregated into two groups, rather than
appearing within the same group in the spatial domain
(Chun & Cavanagh, 1997; Kanwisher & Potter, 1989), so
that they are segregated in the temporal domain (MoreinZamir, Soto-Faraco, & Kingstone, 2003). In this case, the
temporal structure of the visual stream and the sounds
should be coordinated, which may occur after the stimuli
within each modality have been fully processed. A further
possibility is visual–auditory integration’s occurring at an
early perceptual stage. The different sounds adopted in
this study may have integrated with the repeated characters and modified their similarity, which is analogous to
the demonstration of the reduction of RB by making C1
and C2 more visually distinct (e.g., Bavelier, 1994; Chun,
1997). Although the temporal coincident rule with a particular time window of cross-modal integration (Stein &
Meredith, 1993) may account not only for the onset condition, but also for the 86-msec and offset conditions, this
integration account, however, presupposes a close connection between S2 and C2. Yet there is no a priori reason why
this should be the case.
Previous studies have shown that sound can improve
attentional deficits for spatial neglect patients through a
nonspecific alertness effect and a spatially specific crossmodal integration effect (Frassinetti, Pavani, & Làdavas,
2002; Van Vleet & Robertson, 2006). Similarly, all of the
above-mentioned mechanisms may be involved in the
cross-modal facilitatory effect in RB, but each may favor
a certain time window. For example, orienting requires
time to develop (Nakayama & Mackeben, 1989), so the
attentional boost account may best explain the 86-msec
condition. On the other hand, temporal coincidence is critical for the cross-modal integration account (Bolognini
et al., 2005). To tease these possible mechanisms apart,
however, manipulations of factors other than the temporal
relationship between sounds and visual items are required,
especially for the cross-modal integration account. For
example, the effect we observed here is enhancement,
rather than a new representation, as in the McGurk ef-
408 Chen and Yeh
fect in speech perception (McGurk & MacDonald, 1976).
Systematic investigations about whether pitch difference
of sounds is critical may help to clarify these issues.
Nevertheless, regardless of mechanism, we have shown
in this study that RB can be reduced by adding task­irrelevant sounds in RSVP. Multisensory stimuli, as compared with unisensory stimuli, have a better chance of
breaking through visual awareness and making invisible
stimuli visible. Hearing a sound accompanying the “fasten
seatbelt” sign may, indeed, serve as a better warning signal
than does seeing the visual notice alone.
Final Gen Dis (7)
Compare with first paragraph
.
Author Note
This research was supported by Grants NSC93-2413-H-002-017 and
NSC94-2752-H-002-008-PAE from the National Science Council of
Taiwan. We thank Catherine Caldwell-Harris, Anne Hillstrom, Steven
Most, Shinsuke Shimojo, and Charles Spence for their helpful comments on earlier drafts of the manuscript. Correspondence concerning
this article should be addressed to S.-L. Yeh, Department of Psychology,
National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei, Taiwan,
106 (e-mail: [email protected]).
References
Arnell, K. M., & Larson, J. M. (2002). Cross-modality attentional
blinks without preparatory task-set switching. Psychonomic Bulletin
& Review, 9, 497-506.
Battelli, L., Pascual-Leone, A., & Cavanagh, P. (2007). The “when”
pathway of the right parietal lobe. Trends in Cognitive Sciences, 11,
204-210.
Bavelier, D. (1994). Repetition blindness between visually different
items: The case of pictures and words. Cognition, 51, 199-236.
Bolognini, N., Frassinetti, F., Serino, A., & Làdavas, E.
(2005). “Acoustical vision” of below threshold stimuli: Interaction
among spatially converging audiovisual inputs. Experimental Brain
Research, 160, 273-282.
Bowman, H., & Wyble, B. (2007). The simultaneous type, serial token
model of temporal attention and working memory. Psychological Review, 114, 38-70.
Chun, M. M. (1997). Types and tokens in visual processing: A double
dissociation between the attentional blink and repetition blindness.
Journal of Experimental Psychology: Human Perception & Performance, 23, 738-755.
Chun, M. M., & Cavanagh, P. (1997). Seeing two as one: Linking
apparent motion and repetition blindness. Psychological Science, 8,
74-79.
Duncan, J., Martens, S., & Ward, R. (1997). Restricted attentional
capacity within but not between sensory modalities. Nature, 387,
808-810.
Frassinetti, F., Pavani, F., & Làdavas, E. (2002). Acoustical vision
of neglected stimuli: Interaction among spatially converging audiovisual inputs in neglect patients. Journal of Cognitive Neuroscience,
14, 62-69.
Guttman, S. E., Gilroy, L. A., & Blake, R. (2005). Hearing what the
eyes see. Psychological Science, 16, 229-235.
Harris, C. L., & Morris, A. L. (2001). Identity and similarity in repetition blindness: No cross-over interaction. Cognition, 81, 1-40.
Kanwisher, N. (1987). Repetition blindness: Type recognition without
token individuation. Cognition, 27, 117-143.
Kanwisher, N. (2001). Neural events and perceptual awareness. Cognition, 79, 89-113.
Kanwisher, N., Driver, J., & Machado, L. (1995). Spatial repetition
blindness is modulated by selective attention to color or shape. Cognitive Psychology, 29, 303-337.
Kanwisher, N., & Potter, M. C. (1989). Repetition blindness: The
effects of stimulus modality and spatial displacement. Memory &
Cognition, 17, 117-124.
McDonald, J. J., Teder-Sälejärvi, W. A., & Hillyard, S. A. (2000).
Involuntary orienting to sound improves visual perception. Nature,
407, 906-908.
McGurk, H., & MacDonald, J. (1976). Hearing lips and seeing voices.
Nature, 264, 746-748.
Miller, M. D., & MacKay, D. G. (1994). Repetition deafness: Repeated words in computer-compressed speech are difficult to encode
and recall. Psychological Science, 5, 47-51.
Morein-Zamir, S., Soto-Faraco, S., & Kingstone, A. (2003). Auditory capture of vision: Examining temporal ventriloquism. Cognitive
Brain Research, 17, 154-163.
Morris, A. L., & Harris, C. L. (1999). A sublexical locus for repetition
blindness: Evidence from illusory words. Journal of Experimental
Psychology: Human Perception & Performance, 25, 1060-1075.
Nakayama, K., & Mackeben, M. (1989). Sustained and transient components of focal attention. Vision Research, 29, 1631-1647.
Park, J., & Kanwisher, N. (1994). Determinants of repetition blindness. Journal of Experimental Psychology: Human Perception & Performance, 20, 500-519.
Robertson, I. H., Mattingley, J. B., Rorden, C., & Driver, J. (1998).
Phasic alerting of neglect patients overcomes their spatial deficit in
visual awareness. Nature, 395, 169-172.
Santangelo, V., Belardinelli, M. O., & Spence, C. (2007). The
suppression of reflexive visual and auditory orienting when attention
is otherwise engaged. Journal of Experimental Psychology: Human
Perception & Performance, 33, 137-148.
Schendan, H. E., Kanwisher, N. G., & Kutas, M. (1997). Early brain
potentials link repetition blindness, priming and novelty detection.
NeuroReport, 8, 1943-1948.
Sekuler, R., Sekuler, A. B., & Lau, R. (1997). Sound alters visual
motion perception. Nature, 385, 308.
Sheth, B. R., & Shimojo, S. (2004). Sound-aided recovery from and
persistence against visual filling-in. Vision Research, 44, 1907-1917.
Shimojo, S., & Shams, L. (2001). Sensory modalities are not separate
modalities: Plasticity and interactions. Current Opinion in Neurobiology, 11, 505-509.
Shimojo, S., Watanabe, K., & Scheier, C. (2001). The resolution
of ambiguous motion: Attentional modulation and development. In
J. Braun, C. Koch, & J. L. Davis (Eds.), Visual attention and control
circuits (pp. 243-264). Cambridge, MA: MIT Press.
Soto-Faraco, S. (2000). An auditory repetition deficit under low memory load. Journal of Experimental Psychology: Human Perception &
Performance, 26, 264-278.
Soto-Faraco, S., & Spence, C. (2002). Modality-specific auditory and
visual temporal processing of deficits. Quarterly Journal of Experimental Psychology, 55A, 23-40.
Stein, B. E., & Meredith, M. A. (1993). The merging of the senses.
Cambridge, MA: MIT Press.
Thalheimer, W., & Cook, S. (2002). How to calculate effect sizes from
published research articles: A simplified methodology. Retrieved
May 22, 2006 from work-learning.com/effect_sizes.htm.
Tsai, C. H. (1996). Frequency and stroke counts of Chinese characters.
Retrieved August 31, 1999, from technology.chtsai.org/charfreq/.
Van Vleet, T. M., & Robertson, L. C. (2006). Cross-modal interactions
in time and space: Auditory influence on visual attention in hemispatial neglect. Journal of Cognitive Neuroscience, 18, 1368-1379.
Welch, R. B., DuttonHurt, L. D., & Warren, D. H. (1986). Contributions of audition and vision to temporal rate perception. Perception
& Psychophysics, 39, 294-300.
Yeh, S. L., & Li, J. L. (2004). Sublexical processing in visual recognition of Chinese characters: Evidence from repetition blindness for
subcharacter components. Brain & Language, 88, 47-53.
Notes
1. When two repeated sounds were presented within a short interval,
the performance of the second sound either was facilitated (Soto-Faraco &
Spence, 2002) or deteriorated (Soto-Faraco, 2000), as compared with unrepeated sounds. Hence, we avoided using repeated sounds in our design.
2. The effect size was calculated by using Cohen’s d (Thalheimer &
Cook, 2002).
(Manuscript received June 14, 2006;
revision accepted for publication October 20, 2007.)
44
PSYCHOLOGICAL SCIENCE
Research Article
OUR EYES DO NOT ALWAYS GO WHERE
WE WANT THEM TO GO:
Capture of the Eyes by New Objects
Jan Theeuwes,1 Arthur F. Kramer,2 Sowon Hahn,2 and David E. Irwin2
1
TNO Human Factors Research Institute, Soesterberg, The Netherlands, and 2Beckman Institute and Department of Psychology,
University of Illinois at Urbana-Champaign
Abstract—Observers make rapid eye movements to examine the
world around them. Before an eye movement is made, attention is
covertly shifted to the location of the object of interest. The eyes typically will land at the position at which attention is directed. Here we
report that a goal-directed eye movement toward a uniquely colored
object is disrupted by the appearance of a new but task-irrelevant
object, unless subjects have a sufficient amount of time to focus their
attention on the location of the target prior to the appearance of the
new object. In many instances, the eyes started moving toward the new
object before gaze started to shift to the color-singleton target. The
eyes often landed for a very short period of time (25–150 ms) near the
new object. The results suggest parallel programming of two saccades: one voluntary, goal-directed eye movement toward the colorsingleton target and one stimulus-driven eye movement reflexively
elicited by the appearance of the new object. Neuroanatomical structures responsible for parallel programming of saccades are discussed.
It is well-known that the visual system is sensitive to events that
exhibit sudden change (Breitmeyer & Ganz, 1976). When a person
is engaged in a particular task (e.g., reading a book), something
might happen in the environment (e.g., someone switches on a desk
lamp at a nearby library table) that immediately captures the person’s attention even when he or she had no particular intention to
look for such events. Directing attention to these events may merely seem to distract the person from the intended task. It has been
argued, though, that detecting these sudden changes in the environment is significant for behaving organisms because these events
may require immediate identification and action (Todd & Van
Gelder, 1979; Yantis, in press).
A number of studies have demonstrated that an abrupt-onset object
(an object presented with a transient luminance change) captures
attention automatically (Jonides, 1981; Müller & Rabbitt, 1989;
Theeuwes, 1990; Yantis & Jonides, 1984). These studies show that
even when observers have no intention to look for an onset, an abrupt
onset among other nononset elements is processed first. On the basis
of these findings, it has been argued that sudden luminance changes,
such as the appearance of a new object in a scene, capture attention in
a purely stimulus-driven fashion (note that for attentional capture to
occur, it is not necessary that the new object has a luminance increment; see Yantis & Hillstrom, 1994).
Address correspondence to Jan Theeuwes, TNO Human Factors Research
Institute, P.O. Box 23, 3769 ZG Soesterberg, The Netherlands, e-mail:
[email protected], or Art Kramer, Beckman Institute, University of Illinois,
405 N. Mathews Ave., Urbana, IL 61801, e-mail: [email protected].
Even though there is a large body of evidence which suggests that
the appearance of a new object may capture attention, it is largely
unknown whether such an event also triggers a subsequent eye movement. There is, however, ample evidence to suggest that there is a
close relationship between the oculomotor and attentional systems.
Studies have shown that before a voluntary eye movement is made,
attention is covertly shifted to the location of the object of interest
(e.g., Hoffman & Subramaniam, 1995; Kowler, Anderson, Dosher, &
Blaser, 1995; Shephard, Findlay, & Hockey, 1986). The eye typically
will land at the position at which attention is directed (Deubel &
Schneider, 1996).
EXPERIMENT 1
In the present study, we used a visual search task in which
observers were required to make a voluntary, goal-directed saccade
to a color-singleton target. In half of the trials, simultaneously with
the presentation of the color-singleton target, a new object presented
with an abrupt onset appeared somewhere in the display. In contrast
to previous studies investigating the effect of onsets on attentional
deployment (e.g., Yantis & Jonides, 1984), in the current study, the
onset was never relevant for the task. The question addressed was
whether the appearance of a new yet irrelevant object would disrupt
the planning and execution of the goal-directed saccade toward the
singleton target. Intro Exp (1): Write main purpose
Method
Method for Exp (1): Write the details so others can replicate
Observers viewed displays containing six gray circles (3.7° in
diameter), each containing a small gray figure-eight premask (0.4° ×
0.2°), equally spaced around an imaginary circle with a radius of
12.6°. After 1,000 ms, all circles except one changed to red, and at
the same time, the small premasks inside the circles changed to letters. Observers were instructed that as soon as the colors of the circles changed, they were to quickly and accurately make a saccade
directly to the only gray circle left (a color singleton) and to determine whether the letter inside the gray circle was a c (to which they
responded by pressing a button with their left hand) or a reverse c (to
which they responded by pressing a button with their right hand).
The letters inside the red circles were distractor letters randomly
sampled without replacement from the set S, E, H, P, F, and U. A
pilot study was conducted to ensure that accurate target identification could be achieved only when the letter was fixated. Eye movements were recorded by means of an Eye Link tracker (250-Hz
temporal resolution and 0.2° spatial resolution) from 8 naive
observers each performing 64 practice and 256 experimental trials.
Copyright © 1998 American Psychological Society
VOL. 9, NO. 5, SEPTEMBER 1998
45
379
PSYCHOLOGICAL SCIENCE
Capture of the Eyes by New Objects
Fig. 1. Graphic illustration of the displays and the temporal sequence of an experimental trial (from left to right). Note that the target is defined
simultaneously with the appearance of the onset distractor. The gray circles are indicated by the dashed lines. The red circles are indicated by
the solid lines.
Res Exp (1): Statistics should be stated as supplement
On half of the trials, at one of four possible locations in the visual
field, an additional red circle was added to the display at the same
time that the color singleton was revealed (see Fig. 1 for examples of
the stimulus materials).1
Results and Discussion
Approximately 7.4% of the trials were discarded because
observers were not fixating the fixation dot at the beginning of the
trial (anticipation saccades). Those trials on which observers made
errors (i.e., pressed the wrong response key) were also excluded
(4.4%). Because the error rate was so low, error scores were not analyzed further.
Figure 2 presents the eye movement behavior of one representative
observer. The results displayed are sample points (every 4 ms) of the
path of the first saccade (additional saccades are not shown). When no
new object was added to the display (control condition), saccades generally went directly to the color-singleton target (Fig. 2a). In those trials in which a new object was added to the display, however, the eyes
1. It might be argued that the initial capture of attention by the new object
occurred not because of the sudden appearance of the new object but instead
because the new object appeared at the center of a cluster, or group, of three
closely spaced objects (see, e.g., the location of the new object in Figs. 2b, 2c,
and 2d). We tested this hypothesis in a control study by including an additional circle in the display in the same positions in which the new object appeared
in this study. However, in this control study, the extra circle appeared at the
beginning of each trial along with all of the other circles. In this case, observers
did not start moving their eyes in the direction of this additional circle. Therefore, it was the appearance of a new object and not a cluster of three closely
spaced objects that disrupted saccades to the singleton target.
findings
often went in the direction of the new object, stopped briefly, and then
went on to the target (Figs. 2b, 2c, and 2d).
The appearance of the new object (i.e., the abrupt onset) affected
not only eye movement behavior, but also reaction time (RT) to identify the letter inside the color singleton (RT of 785 ms without the new
object vs. 840 ms with the new object, F[1, 7] = 22.08, p < .01). The
sudden appearance of the new object did not affect the time it took the
eye to start moving from the centrally located fixation point (saccade
latency of 238 ms without the new object vs. 230 ms with the new
object); rather, the increase in manual RT was caused by the need for
an additional saccade to be made from the irrelevant onset stimulus to
the location of the color-singleton target on many trials.
Figure 3 summarizes the effect of the appearance of a new but
task-irrelevant object on the scan path of the eye. The maximum
angular deviation of the eye from a linear path between the fixation
and the target object was calculated over observers. As is clear from
Figure 3a, in the control condition, in which no new object was
added to the display, the eyes tended to go directly to the colorsingleton target. In contrast, Figures 3b, 3c, and 3d show that when
an irrelevant new object was presented, in many instances the eye
went in the direction of the new object. Even when the new object
appeared at a location opposite the singleton target (at 150°), the eye
tended to start moving in the direction of the new object. Overall, the
results indicate that in about half of the trials, the appearance of a
new object disrupted the planning and execution of the goal-directed
saccade toward the singleton target.2 Irrespective of where the new
2. Two different 90° target-distractor separations were used in the study:
The onset distractor could appear 90° away from the target in the same or in the
different hemifield. Given that there was no difference in the eye movement
and performance patterns for these two 90°-separation conditions, they were
collapsed for the purpose of presentation.
380
VOL. 9, NO. 5, SEPTEMBER 1998
46
to main
PSYCHOLOGICAL SCIENCE
J. Theeuwes et al.
A
control (no new object)
B
new object appearing at 30° from the target
C
new object appearing at 90° from the target
object appeared, the eyes tended to move in a direction that corresponded to the location of the new object.
When the eye started to move in the direction of the new object,
in most cases the eye stopped for a brief period before it went on to
the singleton target. Figure 4 shows the fixation durations after the
first saccade for those saccades that went in the direction of the new
object. For this analysis, we used only the 90° and 150° targetdistractor separations because these separations made it easy to discriminate saccades toward the new object from those toward the
target. Note that about 87% of the fixation durations were less than
150 ms, even though a complete change in the direction of the eye
movement was required to redirect the eyes in the direction of the
target.
We also explicitly asked observers after the experiment whether
they were aware that the appearance of the new object affected their
eye movements. Observers indicated that they were sure that their eye
movements were not affected by the appearance of the new object.
On the basis of these results, we conclude that the appearance of a
new object interferes with the planning and execution of a goaldirected eye movement to the color singleton. Earlier research has
demonstrated that the appearance of a new object involuntarily captures attention (Theeuwes, 1994; Yantis & Jonides, 1984). The current
results indicate that a new object not only captures attention, it also
captures the eyes. Dis Exp (1): State main finding
EXPERIMENT 2
Dis Exp (2): Mention posssible confounding in Exp 1...
D
new object appearing at 150° from the target
Fig. 2. Initial tracks that the eyes took as they left the fixation point
until the first fixation near one of the colored circles. Eye position was
digitized at 250 Hz. Thus, the points in the figure represent data points
acquired every 4 ms during the initial eye movement. Eye movement
behavior of an observer is shown for the control condition (a), in
which no new object was presented, and for three onset conditions:
when a new object was presented close to the singleton target (at 30°
of arc, corresponding to a distance of 6.4° of visual angle) (b), when a
new object was presented farther away from the singleton target (at
90° of arc, corresponding to a distance of 19.4° of visual angle) (c),
and when a new object was presented at the opposite side of the visual field (at 150° of arc, corresponding to a distance of 25.4° of visual
angle) (d). The results are collapsed and normalized with respect to the
position of the target singleton (marked here with a double circle) and
the position of the new object.
The results of the first study suggest that both the goal-directed
allocation of attention and the movement of the eyes to a clearly
defined target can be disrupted by the appearance of a new but taskirrelevant object in the visual field, even when this object appears quite
distant from the target. However, the data that we have reported thus
far do not define the boundary conditions for this capture effect on
attention and eye movements. ... to motivate Exp 2
Previous studies of attentional capture have reported that the
appearance of a new object does not influence performance when
observers have a sufficient amount of time to attend to the location of
a target prior to the appearance of the new object. For example, Yantis
and Jonides (1990; see also Theeuwes, 1991) found that as long as a
central arrow cue indicated the position of a target at least 200 ms
before the appearance of an onset elsewhere in the visual field, the
onset had no influence on the time it took subjects to identify the target letter.
In the present study, we examined whether precuing the location of
the color-singleton target would eliminate or reduce the influence of
the task-irrelevant onset on performance and eye movements. Given
the previous literature, which suggested that attention is often shifted
to an area of interest in advance of an eye movement (Deubel &
Schneider, 1996; Hoffman & Subramaniam, 1995), it was our expectation that providing subjects with sufficient time to attend to the location of the subsequent target and to program a goal-directed eye
movement to this location would preclude the misdirection of the eyes
to the task-irrelevant object.
Intro Exp (1): Write main purpose of this Exp &
Method provide prediction
The stimuli and procedures were equivalent to those of Experiment
1 with the following exceptions. A centrally located arrow cue, which
381
VOL. 9, NO. 5, SEPTEMBER 1998
47
PSYCHOLOGICAL SCIENCE
J. Theeuwes et al.
Open Gen Dis (1): State research goal & summarize main
GENERAL DISCUSSION
findings
Fig. 4. Fixation durations after the first saccade for those saccades
that went in the direction of the new object in Experiment 1. The
results displayed here are from the conditions in which the new object
appeared at 90° and 150° from the target and the eye started to move
in the direction of the new object.
Open Gen Dis (2): Combine the
purpose & findings of each Exp
Results and Discussion
The goal of the observers in our task was to make a saccade toward
the color-singleton target. In order to execute such a saccade, attention
first needs to be shifted to the location of the color singleton (Deubel
& Schneider, 1996; Kowler et al., 1995). Furthermore, we assume that
the shift of attention to the location initiated the programming of a saccade to the attended location (e.g., Deubel & Schneider, 1996; Morrison, 1984).3 In Experiment 1, simultaneously with the preparation of
a goal-directed saccade to the color singleton, a new object appeared
in the display, causing attention to be reflexively drawn to the location
of the new object. We assume that the reflexive shift of attention to the
new object also initiated the programming of a saccade, but to the
location of the new object. In fact, the results suggest that there is parallel programming of two saccades, one to the singleton target and one
to the new onset object. Depending on which eye movement program
is ready first, the eyes will start moving in the direction of the onset or
in the direction of the color singleton. If the eyes start moving in the
direction of the onset, as soon as the program for a saccade to the
color-singleton target is ready, inhibition of the reflexive saccade may
occur, causing the eyes to stop somewhere along the trajectory
between the initial center fixation point and the onset. Because the eye
movement program toward the color-singleton target has been prepared, after the eyes stop for a brief period of time, they will move to
the color singleton.4 This interpretation is in line with the very short
fixation durations that we found after the first saccade (see Fig. 4). In
fact, the majority of the fixations were too short to allow the programming of a completely new eye movement, which typically takes 150 to
200 ms (e.g., Becker, 1991; Findlay, 1997; Salthouse & Ellis, 1980).
Overall, these results suggest parallel programming of two saccades (see also McPeek, Skavenski, & Nakayama, 1996): one voluntary, goal-directed eye movement toward the color-singleton target and
one stimulus-driven eye movement reflexively elicited by the appearance of the new object. The parallel programming of two saccades has
been reported before in reading (Henderson & Ferreira, 1990; Morrison, 1984; Reichle, Pollatsek, Fisher, & Rayner, 1998): A reader may
fixate a particular word for a very brief period and immediately move
on to the next word. Parallel programming of two saccades has also
been reported in the double-step paradigm (Becker & Jurgens, 1979),
in which the position of a target changes just prior to a saccade.
Interestingly, shifting attention to the location of the target in
advance of the appearance of the task-irrelevant onset effectively eliminated the misdirection of the eyes to the onset and the performance disruption observed in Experiment 1. Within the theoretical framework
just described, the elimination of the effect of the onset would likely
have resulted from the temporal advantage provided to the programming of the goal-directed saccade by the location precue. However, it
is also conceivable that performance and eye movements (to the target)
The main question addressed in this study was whether cuing the
location of the color-singleton target would eliminate the influence of
the task-irrelevant distractor on target identification and eye movements. Because there were no significant differences between results
for the three SOAs, these results were combined for the analyses
reported here. Figure 5 summarizes the effect of the appearance of the
new but task-irrelevant object on the scan path of the eyes. The maximum angular deviation of the eyes from a linear path between fixation
and the target object was calculated and collapsed across the 7
observers.
As can be seen in the figure, the distribution of trials is quite similar regardless of whether an onset distractor was present (bottom two
panels) or absent (top two panels). These data suggest that onset distractors have little influence on the trajectory of eye movements when
subjects are able to shift their attention to the target location in
advance of the appearance of the onset. This observation was quantified by calculating the number of trials on which the eyes initially
went in the direction of the onset (onset trials) or additional circle
(control trials) and submitting these data to a two-way analysis of variance with target-distractor separation (90° vs. 150°) and condition
(onset vs. control trials) as factors. Neither the main effects nor the
interaction was statistically significant (ps > .40). The average proportion of onset trials on which the eyes initially went toward the onset
circle was 2.3%; in comparison, on 1.7% of the control trials, the eyes
initially went toward the nononset circle that appeared at a comparable spatial position. Res Exp (2): Condense statistics into one sentence
An analysis of the RT data presents a similar picture. The RTs on
the onset and control trials, collapsed across the three SOAs, were 617
3. We are not suggesting that the execution of the saccade necessarily foland 607 ms, respectively. These RTs were not significantly different lowed saccade programming, only that the allocation of attention initiated saccade programming (Hoffman, in press; Klein & Pontefract, 1994). Indeed, on
(p > .30).
In summary, the results of the present study establish an important those trials on which the preparation of the goal-directed saccade was comboundary condition on the attentional and oculomotor capture effects pleted first, the reflexive saccade to the onset was prepared but not executed.
4. On many trials on which the eyes initially moved toward the onset, they
observed in Experiment 1. Capture effects by the appearance of a taskdid not reach its location. Indeed, on average, the saccades toward the onset
irrelevant onset can be overcome when observers have sufficient time traversed only 70% of the distance between fixation and the onset before stopto attend and program an eye movement to the location of a subse- ping for a brief period of time and then moving to the location of the colorquent target stimulus. Dis Exp (1): State main finding
singleton target.
383
VOL. 9, NO. 5, SEPTEMBER 1998
49
P. Cavanagh et al. / Cognition 80 (2001) 47±60
47
COGNITION
Cognition 80 (2001) 47±60
www.elsevier.com/locate/cognit
Attention-based visual routines: sprites
Patrick Cavanagh a,*, Angela T. Labianca a, Ian M. Thornton b,1
a
Vision Sciences Laboratory, Department of Psychology, Harvard University, 33 Kirkland Street,
Cambridge, MA 02138, USA
b
Cambridge Basic Research, Nissan Research and Development Inc., Cambridge, MA 02142, USA
Received 12 October 1999; accepted 17 November 2000
Abstract
A central role of visual attention is to generate object descriptions that are not available
from early vision. Simple examples are counting elements in a display or deciding whether a
dot is inside or outside a closed contour (Ullman, Cognition 18 (1984) 97). We are interested
in the high-level descriptions of dynamic patterns ± the motions that characterize familiar
objects undergoing stereotypical action ± such as a pencil bouncing on a table top, a butter¯y
in ¯ight, or a closing door. We examine whether the perception of these action patterns is
mediated by attention as a high-level animation or `sprite'. We have studied the discrimination of displays made up of simple, rigidly linked sets of points in motion: either pairs of
points in orbiting motion or 11 points in biological motion mimicking human walking. We
®nd that discrimination of even the simplest dynamic patterns demands attention. q 2001
Elsevier Science B.V. All rights reserved.
Keywords: Attention; Tracking; Motion; Visual search
1. Introduction
General & familiar opening and connect with last paragraph
Open Gen Intro (1) Recognizing an item can call on much more than just an analysis of its static form.
Something that moves on a street and makes motor sounds is probably a car or a
truck (or maybe a 3-year-old boy). But in addition to characteristic sounds or properties, many objects have characteristic patterns of movement, revealed only over
* Corresponding author. Fax: 11-617-495-3764.
E-mail address: [email protected] (P. Cavanagh).
1
Present address: Max-Planck-Institut fuer Biologische Kybernetik, Spemannstrasse 38, D-72076
Tuebingen, Germany.
0010-0277/01/$ - see front matter q 2001 Elsevier Science B.V. All rights reserved.
PII: S 0010-027 7(00)00153-0
52
48
P. Cavanagh et al. / Cognition 80 (2001) 47±60
some duration of time. Rubber balls bounce on ¯oors, billiard balls bounce less,
butter¯ies dance up and down, Frisbees ¯y straight, a pencil bouncing off the ¯oor
takes an end over end tumble, and doors swing slowly through an arc when opening
and bang up against their endpoints. Many of these characteristic patterns of motion
are so familiar as to be suf®cient for recognition of the object. The case of biological
motion is perhaps the strongest evidence for this. A human form is easily recognized
from the motions of a set of lights attached to a person ®lmed while walking in the
dark (Johansson, 1973; Neri, Morrone, & Burr, 1998).
How do we accomplish this seemingly effortless recognition of motion patterns?
We are not aware of analyzing components of the motion and coming to intermediate decisions. The human walking just seems to pop out of the display. Johansson
(1973) proposed that the analysis relied on an automatic and spontaneous extraction
of mathematically lawful spatiotemporal relations. But is this act of recognition
really effortless? And what about the continued perception of the motion, an analysis
which continually adapts our impression of the walker's posture and progress to the
moving points in the display. Can that also be as effortless as it seems?
W1(1) We propose that these characteristic motions are analyzed and interpreted by a
special set of operators that we will call `sprites'. In this paper, we will only address
the attentional demands of these operators but we will nevertheless sketch our view
of their properties and the role they play. We consider a sprite to be the set of
routines that is responsible for detecting the presence of a speci®c characteristic
motion in the input array, for modeling or animating the object's changing con®guration as it makes this stereotypical motion, and for ®lling in the predictable details
of the motion over time and in the face of noisy or absent image details. Each
different characteristic motion pattern would have its own `sprite' that would be
built up over many exposures to the pattern. These stored recognition and animation
routines then allow sparse inputs to support rich dynamic percepts. Many others
have stressed that regularities in the world can be captured by ef®cient, higher-order
data structures such as chunks (Miller, 1956), schemata (Bartlett, 1932; Neisser,
1967), frames or scripts (Minsky, 1975; Schank & Abelson, 1977). Characteristic
patterns of motion ought to lend themselves quite well to similarly ef®cient representations.
Separate instances of a characteristic motion are seldom exact repetitions,
however. The path of a bouncing pencil can be quite chaotic, depending strongly
on many factors (starting position, rotation, surface properties, etc.). The regularities
of a bouncing pencil, or a butter¯y's ¯ight, or a walking human, lie at a higher level
of description of the motion. Signi®cant analysis of the motion pattern must precede
any recognition of the regularity and signi®cant computation is then required to use
the knowledge of the regularity to predict or animate subsequent motions.
This procedural aspect of a sprite is closely related to the concept of `visual
routines' addressed by Ullman (1984). These routines act on the representations
emerging from the initial stage of visual analysis to establish properties and relations
that are not explicitly represented in the ®rst stage. Ullman identi®ed elemental
processes such as counting, indexing, tracking, and region-®lling which could be
53
P. Cavanagh et al. / Cognition 80 (2001) 47±60
49
organized (compiled) into visual routines to perform a high-level task such as, for
example, judging whether a point is inside or outside a complex closed curve.
Importantly, in the original work by Ullman (1984), he assigned the critical
operation of shifting the focus of analysis to attentional processes so that his visual
routines were exclusively attention-based. In our case, abstracting the high-level
description of, say, a bouncing pencil or a point-light walker certainly calls on an
analysis of similar or greater complexity than the spatial tasks that Ullman (1984)
W1(2) described. Nevertheless, whether or not these high-level motion descriptors ± sprites
± require attention remains an open question. It is the central question that we
address in this paper.
In the two experiments reported below, we use visual search tasks to examine the
attentional load required to perceive a dynamic motion pattern. The displays present
one to four motion patterns and the subjects report the presence or absence of a
target motion. The relation between reaction time and number of items in the display
allows us to evaluate any increase in attentional load with each additional item. If,
for example, point-light walkers are recognized effortlessly, then there should be no
increase in reaction time as the number of walkers in the display increases from one
to four. Final Gen Intro (8,9): Write overview & prediction of both experiments
Our ®rst experiment examines simple con®gurations of two moving dots.
Although these patterns of motion are relatively simple they are not highly familiar.
If attention is required to discriminate between con®gurations, it may be because we
do not have highly ef®cient routines, sprites, to handle them. Our second experiment
examines highly familiar con®gurations involving human motion. The movements
in these stimuli are more complex but extremely familiar. If any dynamic patterns
can be discriminated without attentional load, we believe it should be these patterns.
2. Experiment 1
This is a special case where the method for an experiment is given
in detail in Intro.
Intro Exp (3) In the ®rst experiment, observers had to discriminate between two different orbital
Give only when
necessary
motions. In each stimulus, two lights rotate around each other while moving around
a central ®xation point. We will describe the two stimuli brie¯y before examining
the similarities between our displays and the classic wheel-generated motions
studied in many previous articles (Duncker, 1937; Johansson, 1973; Prof®tt, Cutting,
& Stier, 1979; Wallach, 1965).
In our ®rst stimulus (Fig. 1a), the motion is like that of a moon orbiting a planet
where the planet itself orbits a central `star' (the ®xation point). The moon traces out
a complex curve (of the cycloid family) around the central point whereas the planet
traces a circle. If the smoothly moving light (the `planet') is turned off, the complex
nature of the moon's motion is immediately evident as a looping path of wildly
varying velocity. With the smoothly moving `planet' turned on, however, the erratic
motion is no longer apparent as the ®rst light is now seen to rotate smoothly at
constant velocity around the `planet'.
In the second stimulus (Fig. 1b), the two lights rotate around a common center and
this central point rotates around the ®xation point. In this case, both lights trace
54
52
End of special case
P. Cavanagh et al. / Cognition 80 (2001) 47±60
one, two, three, or four of the dot pairs moving in the same direction at the same rate
around the ®xation. On a target present trial, one of the pairs is following the target
motion, say, tumbling, whereas the others are following the alternative motion,
orbiting in this case. We recorded the reaction time to respond that the target was
present or absent and analyzed the function relating the reaction time and the
number of dot pairs present. If the reaction time did not increase with the number
of dot pairs present, we would conclude that the extraction of the characteristic
motions calls only minimally on central, attentional resources.
As a control, we also measured the reaction time with only one of the dots present
in each pair. The target would then be the smoothly moving `planet' among singleton tumbling dots or one singleton tumbling dot among smoothly moving planets.
This gives a measure of the distinctiveness of the single dot motions that make up
the characteristic motions of the pairs.
2.1. Methods
2.1.1. Observers
Twelve paid volunteers, with an average age of about 21 years and with normal or
corrected-to-normal vision, participated in this experiment. All participants were
naive to the purpose of the experiment and gave informed written consent before the
experiments, which were approved by the F.A.S. Human Subjects Committee,
Harvard University.
2.1.2. Stimuli
The display was presented on a 14 inch 67 Hz Macintosh display driven by a
Macintosh 7500/100 programmed in Vision Shell. The two motion patterns we used
are shown in Fig. 1. Between one and four of these orbiting or tumbling pairs were
presented rotating around the central ®xation. The distance from the ®xation to the
center of rotation of the pair was 4.38 of visual angle at a ®xed viewing distance of 57
cm. The center±center separation of the two dots was 18 of visual angle. The dots
themselves had a diameter of 0.48 of visual angle. The two dots made one full cycle
around each other every 1.5 s while the pair made a full circuit around ®xation every
7.5 s. The local and global rotations were always in opposite directions and the
direction was set randomly on each trial. The stimuli, when there was more than one,
were spaced evenly around the circular path. The initial position of each local
rotation was set randomly at the beginning of each trial as was the starting location
of the rotation around the circular path. The dots had a luminance of 70 cd/m 2, and
were presented on a 20 cd/m 2 background. The temporal onset and offset of the
motion pairs on each trial was a step function. The ®xation mark was a single static
dot at the center of the display. It was identical in size to the moving dots (0.48 of
visual angle in diameter).
2.1.3. Procedure
In the control conditions, only one dot of each pair was shown. It was always the
smoothly moving dot (the planet) in the orbiting con®guration and either of the dots
57
P. Cavanagh et al. / Cognition 80 (2001) 47±60
Gen Con (1,2): Cover
1st, 3rd, & 4th Ws here
in a short paragraph
59
dif®cult. Motion takes place over time and considerable time appears to be required
to establish the trajectory of each pair. The static versions of the trajectories of our
two motion patterns are much easier to distinguish. The orbiting pattern traces out
something like a dollar sign and the tumbling pair something like a ®gure 8. When
we simply presented these short trajectories as spatial patterns all at once, the
discrimination was much more rapid (slope of less than 100 ms/item, averaged
over four subjects). What makes the real motion case so dif®cult is that the con®guration of dot motions needs to be made explicit, linking each dot to the next and
then to the central dot. This is the only way to discover if one of the dots is actually
maintaining a constant distance from the central dot.
These articulated links of constant length in tumbling and orbiting motion also
form the basis of the structure for the point-light walkers where the light at each joint
is separated from the next by a ®xed distance. However, for the walkers in Experiment 2, the processing time was much faster at about 200 ms per walker (compared
to about 1 s per dot pair). This rate is comparable to some estimates of the dwell time
of attention (Duncan, 1984). This suggests that attention was required to notice the
gait of each walker in turn but that little processing was required once each walker
had been selected. This is evidence that the analysis of a very familiar motion
pattern, despite its complexity, can be very rapid. We suggest that this rapid extraction of the motion pattern is the signature of the `sprite' responsible for recognizing
and animating the percept of a walking human form. Despite this rapid extraction,
our data also show that only one walker at a time can be analyzed. The search rate
was still substantial indicating that the operation of at least the `walker' sprite
requires attention.
The dual task results of Thornton et al. (1999) suggested that, in some cases, the
perception of biological motion can be automatic. However, a search task is perhaps
a more sensitive measure of attentional load because dual task interference only
reveals an interaction between the two tasks if the combined load exceeds the
available capacity. It cannot differentiate between no attentional demands and any
combination that is less than the limit available.
If even familiar patterns of motion require attention to be discriminated, what can
be the advantage of the routines that support the perception of the pattern? Clearly, it
is the same advantage that is offered by any recognition of a familiar pattern. Once
enough of the pattern is acquired to recognize it, the rest can be ®lled in from
memory. Sparse inputs can support rich percepts and in the case of a moving object,
®lling in implies a prediction of likely motions and tracking them with less data than
would be otherwise necessary. These advantages have formed the basis of many
theories of perception from schemata and schema theory (Bartlett, 1932; Neisser,
1967) to frames and scripts (Minsky, 1975; Schank & Abelson, 1977).
To conclude, we suggest that the visual system acquires and uses stored motion
patterns, sprites, which are characteristic of familiar events or objects: the motion of
a wheel, the jump of a ®sh out of water, the way a pencil bounces on the ¯oor when
dropped, and the way a fresh egg does not. We use these stored patterns to recognize
and then animate our perception of familiar events. Our experience of these animation routines might suggest that they are effortless but our study here shows that they
This final paragraph connects with the Introduction
64
60
P. Cavanagh et al. / Cognition 80 (2001) 47±60
are not. We claim that the animations are played out by attentive processes in the
same way that we can animate a mental image. In the visual case, the input image
data act like set points in the progress of the animation but the animation still
requires the support of attentive processes.
Acknowledgements
Thanks to Pawan Sinha for helpful discussions of this work and to Jane Raymond
for sharing her observations on Frisbees and butter¯ies. This research was supported
by grant EY09258 to P.C.
References
Bartlett, S. F. (1932). Remembering: a study in experimental and social psychology. Cambridge:
Cambridge University Press.
Cavanagh, P. (1992). Attention-based motion perception. Science, 257, 1563±1565.
Cutting, J. E. (1978). A program to generate synthetic walkers as dynamic point-light displays. Behavioral
Research Methods and Instrumentation, 10, 91±94.
Duncan, J. (1984). Selective attention and the organization of visual information. Journal of Experimental
Psychology: General, 113, 501±517.
Duncker, K. (1937). Induced motion. In W. D. Ellis (Ed.), A sourcebook of Gestalt psychology. London:
Routledge and Kegan Paul.
Inman, V. T., Ralston, H., & Todd, J. T. (1981). Human walking. Baltimore, MD: Williams & Wilkins.
Johansson, G. (1973). Visual perception of biological motion and a model for its analysis. Perception and
Psychophysics, 14, 201±211.
Miller, G. A. (1956). The magical number seven plus or minus two: some limits on our capacity for
processing information. Psychological Review, 63, 81±97.
Minsky, M. (1975). A framework for representing knowledge. In P. H. Winston (Ed.). The psychology of
computer vision (pp. 211±280). New York: McGraw-Hill.
Neisser, U. (1967). Cognitive psychology. New York: Appleton.
Neri, P., Morrone, M. C., & Burr, D. C. (1998). Seeing biological motion. Nature, 395, 894±896.
Prof®tt, D. R., & Cutting, J. E. (1979). Perceiving the centroid of con®gurations on a rolling wheel.
Perception and Psychophysics, 25, 389±398.
Prof®tt, D. R., Cutting, J. E., & Stier, D. M. (1979). Perception of wheel-generated motions. Journal of
Experimental Psychology: Human Perception and Performance, 5, 289±302.
Schank, R., & Abelson, R. (1977). Scripts, plans, goals and understanding. Hillsdale, NJ: Lawrence
Erlbaum Associates.
Thornton, I. M., Rensink, R. A., & Shiffrar, M. (1999). Biological motion processing without attention.
Perception, 28 (Suppl), 51.
Ullman, S. (1984). Visual routines. Cognition, 18, 97±159.
Wallach, H. (1965). Visual perception of motion. In G. Keyes (Ed.), The nature and the art of motion.
New York: George Braziller.
65
Journal of Experimental Psychology:
Human Perception and Performance
1997, Vol. 23, No. 2, 339-352
Copyright 1997 by the American Psychological Association, Inc.
0096-1523/97/$3.00
Perception Without Attention:
Evidence of Grouping Under Conditions of Inattention
Cathleen M. M o o r e and H o w a r d Egeth
Johns Hopkins University
Many theories of visual perception assume that before attention is allocated within a scene,
visual information is parsed according to the Gestalt principles of organization. This assumption has been challenged by experiments in which participants were unable to identify ~¢laat
Gestalt grouping patterns had occurred in the background of ptimary-t~k displays (A. Mack,
B. Tang, R. Tuma, S. Kahn, & I. Rock, 1992). In the present study, participants reported
which of 2 horizontal lines was longer. Dots in the background, if grouped, formed displays
similar to the Ponzo illusion (Experiments 1 and 2) or the Mtlller-Lyer illusion (Experiment
3). Despite inaccurate reports of what the patterns were, participants' responses on the
line-length discrimination task were clearly affected by the 2 illusions. These results suggest
that Gestalt grouping does occur without attention but that the patterns thus formed may not
be encoded in memory without attention.
An assumption of many theories of visual perception is
that a rich array of perceptual primitives is extracted from
visual scenes without attention, that is, preattentively (e.g.,
Duncan & Humphreys, 1989; Julesz, 1981; Kahneman &
Treisman, 1984; Neisser, 1967; Treisman, 1982, 1988;
Wolfe, 1994). This assumption is tenable on logical
grounds: The visual field must be at least partially organized
at preattentive levels of processing for there to be something
on which to base later--attentive--processing. In addition
to this logical justification, empirical evidence has suggested that perceptual organization occurs preattentively.
Dual-task studies, for example, have demonstrated that
when attention is occupied by a demanding primary task,
substantial perceptual processing can occur within the attentional periphery (e.g., Braun & Sagi, 1990, 1991; but s e e
Ben-Av, Sagi, & Braun, 1992). In addition, within visualsearch experiments, salient items and texture boundaries
seem to "pop out" from visual arrays of surrounding items
with no effort (e.g., Beck, 1982; Treisman, 1985). Results
like these have been taken as evidence that visual scenes
are organized into perceptual primitives before attention is
focused within the scene. Indeed, a substantial amount of
research has been dedicated to identifying what the
preattentive primitives are, based on results from dual-
task and visual-search experiments (e.g., Bravo & Blake,
1990; Brown, Weisstein, & May, 1992; Julesz & Bergen,
1983; Ramachandran, 1988; Treisman & Gormican,
1988).
An important feature of preattentive processing, according to the view just described, is that it organizes the visual
field. Therefore, it is reasonable that preattentive processes
have been assumed to include those mechanisms that give
rise to the Gestalt principles of perceptual organization (see
Pomerantz, 1981, for a review; Figure 1 illustrates two
principles of Gestalt grouping). The Gestalt principle that is
considered most often is grouping by similarity (e.g., Duncan & Humphreys, 1989; Humphreys, Quinlan, & Riddoch,
1989; Neisser, 1967; Treisman, 1982, 1988), but grouping
by proximity, closure, and good continuation have also
been assumed to apply preattentively (Grossberg, Mingolla,
& Ross, 1994; Pomerantz, 1981; Pomerantz & Pristach,
1989).
Object-based theories of perception have been especially
explicit in assuming that the Gestalt principles apply preattentively. These theories maintain that visual scenes are first
parsed, in part according to the Gestalt principles, and that
attention is then directed to the perceptual objects that result
from that parsing process (see Kanwisher & Driver, 1992,
for a review).
There is evidence to support the idea that attention is
allocated to objects. For example, it is difficult to attend to
more than one object simultaneously (e.g., Duncan, 1984;
Kahneman, Treisman, & Burkell, 1983; of. Vecera & Farah,
1994). At the same time, it is diffienlt to ignore distracting
information that is part of an attended object (e.g., Kahneman & Henik, 1981). Similarly, entire objects receive processing benefits when only part of the object is cued or
primed (Egly, Driver, & Rafal, 1994; see also Kahneman,
Treisman, & Gibbs, 1991). Finally, patients who suffer from
the attentional disorder visual neglect sometimes demonstrate object-based frames of reference in neglecting their
Cathleen M. Moore and Howard Egeth, Department of Psychology, Johns Hopkins University.
This research was supported by National Institutes of Health
Grant 5 T32 MH18215, awarded to the Department of Psychology
at Johns Hopkins University, and by National Science Foundation
Grant SBR-9319356. We thank Toby Mordkoff for helpful discussions concerning this work.
Correspondence concerning this article should be addressed to
Cathleen M. Moore, who is now at the Department of Psychology,
Moore Building, Pennsylvania State University, University Park,
Pennsylvania 16802. Electronic mail may be sent via Internet to
cmml 5 @psu.edu.
339
66
PERCEPTION WITHOUT ATI'ENTION
341
cations of how well the patterns could be perceived when
the backgrounds were attended--either partially or fully.
The results from several experiments using the inattention
method suggested that participants failed to group the dots
when they were not attending to the background (Mack et
al., 1992). Report accuracy of the patterns was at chance
following the inattention trials. In fact, participants often
denied that there had been a pattern and professed to have
been guessing about what the pattern might have been. In
contrast, on the divided-attention and full-attention trials,
reports concerning the patterns were quite accurate, which
indicated that the patterns could be seen when they were
attended. This pattern of results was found using several
different types of grouping stimuli in the backgrounds.
In summary, unlike the theories described in the first
section of this article, some theories of attention maintain
that very little perceptual organization occurs preattentively.
In support of this position is evidence that the Gestalt
principles of organization seem not to apply preattentively;
when they were tested under conditions of inattention, participants failed to report what grouping patterns had occurred in the background. This finding conflicts directly
with the assumptions concerning preattentive processing
that are part of many theories of perception, most explicitly
the object-based theories.
Thus, according to this view, assumptions concerning
preattentive perceptual organization must be tested under
conditions of inattention. Toward this end, Mack and Rock
and their colleagues developed an experimental method to
identify what can be perceived under conditions of inattention (Mack et al., 1992; Rock, Linnet, et al., 1992; see
Epstein & Babler, 1989, 1990, for a similar method). In one
study, they applied this method to the specific question of
whether Gestalt grouping can occur under conditions of
inattention (Mack et al., 1992).
The study took the following form. A fairly difficult
perceptual task was superimposed on a background of black
and white dots that were irrelevant to the task. On most
trials, the dots in the background were randomly black and
white. On the critical trial, however--the inattention trial-the dots were colored such that if grouping by similarity
occurred, a salient pattern would appear (e.g., alternating
rows of black and white dots). At the end of this trial,
participants were unexpectedly asked to make a forcedchoice response concerning what pattern had appeared in
the background. The logic was that if grouping occurs
without attention, then despite not having attended to the
background, participants would be able to report what pattern had appeared.
After the inattention trial, participants were no longer in a
state of inattention concerning the background. Therefore
only a single observation under conditions of inattention
could be obtained for each participant. Nonetheless, later in
the experiment a second patterned-background trial was
presented, and again participants were asked to make a
forced-choice response concerning what the pattern had
been. This trial was called the divided-attention trial because participants knew that something might occur in the
background. Finally, before the last trial in the experiment,
participants were asked to ignore the primary task and
simply report what pattern appeared in the background. We
refer to this trial as the full-attention trial. Whereas the
inattention trial provided an indication of how well the
patterns were perceived under conditions of inattention,
the divided-attention and full-attention trials provided indi-
Note this paper has a whole section for W1
Purpose of the Present Research
W1
et al.'s (1992) and Rock, Linnet, et al.'s (1992) definition of
"without attention." In another context, Treisman (1993), for example, did not use the terms preattention and inattention synonymously. She defined preattentive processing as "an inferred stage
of early vision, which [she] attribute[s] to the separate feature
modules. Before any conscious visual experience is possible, some
form of attention is required" (p. 13). By contrast, inattention is
used to refer to a state in which "attention is narrowly focused
elsewhere. In this case, even global feature boundaries should no
longer be available" (19. 15). Treisman's use of preattention and
inattention might correspond to what Mack et al. and Rock, Linnet,
et al. called divided or d/ffuse attention and inattention, respectively. Howevex~Treisman's preattenfion, unlike Mack et al.' s and
Rock, Linnet, et al.'s divided or diffuse attention, refers to processing that occurs without attention. Because this is the primary
concern of this article, we chose to use the terms preattention and
inattention in the manner described in the opening sentences of
this footnote.
68
Although participants were unable to report what grouping patterns had occurred on the inattention trials in Mack et
al.'s (1992) experiments, it does not necessarily follow that
the dots were not grouped. Instead, participants may have
been unable to remember what patterns had appeared. This
might have occurred for two different reasons. First, participants may have forgotten what the patterns were by the
time the question was answered. Alternatively, without having attended to the patterns, participants may never have
successfully encoded them in memory. In either case, participants would have been unable to report what the patterns
were, despite having perceived them at the time. To address
the question of whether Gestalt grouping occurs under conditions of inattention, it would be useful to obtain a measure
of grouping from the time during which the stimuli are
present. This was the purpose of the present research.
We took advantage of two simple perceptual illusionsB
the Ponzo illusion (see Figure 2A) and the MUller-Lyer
illusion (see Figure 2B). As in the experiments reported by
Mack et al. (1992), a difficult line-length discrimination
task was superimposed on a background of black and white
dots, which occasionally formed patterns. The patterns were
designed such that if they were perceived, they could influenceBthrough either the Ponzo illusion or the Mtiller-Lyer
illusion--the perceived lengths of the lines to be discriminated. Using displays like these, we could determine
whether the dots had been grouped on inattention trials
without having to ask the participants directly about what
they had seen. If reports of line length are influenced by the
patterns that are embedded in the dots, then this fmding
PERCEPTION WITHOUT ATTENTION
343
random-matrix trials. Participants simply reported which line segment appeared to be longer. The fourth, seventh, and eighth trials
were pattern-matrix trials; they were the inattention, dividedattention, and full-attention trials, respectively. Following both the
inattention and the divided-attention trials, participants reported
which line segment appeared to be longer. No response was
obtained following the full-attention trial; participants were asked
before the trial to ignore the line-segment task and simply look for
the pattern in the dot-matrix background.
After each of the three pattern-matrix trials, participants were
asked three questions concerning the pattern that had appeared in
the background: the direct query, the forced choice, and the
confidence rating. The direct query was the following: "Did you
notice any pattern in the background of dots on that last trial?" The
specific forced-choice question was different for the two experiments, but in each case it provided a forced-choice discrimination
concerning the pattern that had been presented. Finally, the confidence rating referred to the answer given to the forced-choice
question. The confidence rating was 1, 2, or 3, where 1 meant not
at all sure, 2 meant somewhat sure, and 3 meant very sure. As the
experimenter asked each question, a short form of the question was
presented on the dark background of the monitor (e.g., "pattern?";
"direction [up, down]?"; "confidence rating? [1 = not at all, 2 =
somewhat, 3 = very]"). The participant's responses also appeared
on the monitor as they were being entered by the experimenter.
Finally, at the end of the inattention block, participants were
asked the following question: "Did you notice that some of the
trials in the middle block included patterns like you've seen here?"
The participant's answer was entered into the computer, and that
marked the end of the experiment.
An illustration of the trial events is provided in Figure 5. The
fixation point at the beginning of each trial remained present for
1,000 ms. The trial display was then presented for 200 ms, after
which it was replaced by the mask display. The mask remained
present until a response was entered by the experimenter, after
which the display went blank, with only the gray background
visible. An intertfial interval of approximately 5 s in Experiments
1 and 2 and 8 s in Experiment 3--during which the display was
gray--preceded each trial.
ments 1 and 2, the lengths were chosen randomly from three
short-long pairs: (1.30", 1.50"), (1.50 °, 1.70"), (1.70", 1.90"). In
Experiment 3, the lengths were chosen randomly from three different sbort-long pairs: (5.00", 5.20"), (5.10", 5.30"), (5.20",
5.40*). On pattern-matrix trials, the line segments were the same
length (Experiments 1 and 2: 1.40"; Experiment 3: 5.20*). The
position of the line segments within the matrix was the same on all
trials within a given experiment. In Experiments 1 and 2, one line
segment was presented between the 5th and 6th rows, and the other
was presented between the 8th and 9th rows of the dot matrices. In
Experiment 3, they were presented in the 7th and 16th rows of the
dot matrices, on top of the dots. All line segments were centered
horizontally within the matrix. Finally, for the random-matrix
trials, in which one line segment was longer than the other, the
longer segment was presented equally often in the top and bottom
positions.
Task
The primary task in all three experiments was to report which of
the two black line segments appeared longer by vocally reporting
top or bottom. In addition to this primary task, participants were
sometimes asked questions about the displays. These questions are
described in detail in the Procedure section.
Procedure
Each participant took part in an individual session, which lasted
approximately 15-20 rain. Figure 4 illustrates the structure of the
experiments. Experiments 1 and 3 consisted of one practice block
and two experimental blocks of trials; Experiment 2 consisted of
one practice block and one experimental block of trials.
At the beginning of each session, participants were told that
each trial would begin with a fixation point, followed by a briefly
presented matrix of black and white dots that had two line segments embedded in it. They were asked to report verbally, after
each trial, which of the two line segments appeared longer: the top
or the bottom. If they did not know which was longer, they were
asked to guess.
Following these instructions, a block of 10 practice trials was
provided; these trials were always random-matrix trials. After each
trial, the experimenter entered the participant's response into the
computer and said whether the response was correct or not. Many
of the participants reported that they had been unable to see the
line segments on the first practice trial. All participants, however,
were able to see them and report which was longer by the end of
3 trials.
After the 10 practice trials, participants completed a block of 32
trials, plus 2 warm-up trials at the beginning of the block. We refer
to this block of trials as the illusion block. Participants were told
that the experimenter would no longer provide feedback concerning the participant's responses but would continue to enter the
responses into the computer. A random 16 of the experimental
trials were random-matrix trials, and the remaining 16 were
pattern-matrix trials, on which neither line segment was longer.
(The 2 warm-up trials were always random-matrix trials.) The
details of the patterns are described separately for each experiment.
Experiment 2 was complete at the end of this block.
Participants in Experiments 1 and 3 then completed a third block
of eight trials, which had a design similar to that of the experiments
reported by Mack, Linnet, et al. (1992) and Rock et al. (1992). This
block of trials is referred to as the inattention block. The first three
trials as well as the fifth and sixth trials of this block were
Experiment 1
Intro Exp (1): Write the main purpose
Experiment 1 took advantage of the Ponzo illusion (also
known as the railroad-track illusion): A line segment that is
located near the narrow end of two converging lines appears
longer than an identical line segment that is located nearer
to the diverging end (see Figure 2A). The dots on the
pattern-matrix trims in this experiment were colored such
that if grouped, they would form two converging lines, as in
the Ponzo illusion. Recall that the line segments on the
pattern-matrix trials were actually the same length. If participants are unbiased in their responses, then they should
respond top equally often as bottom. If the dots are grouped,
however, then the pattern of converging lines may cause
participants to perceive the line segment that is near to the
converging end o f the pattern as longer than the other
segment. If this occurs, then it would indicate that grouping
by similarity occurred.
Me~od Method for Exp (1)
Stimuli. Figure 6 provides an illustration of the two types of
pattern-matrix trials that we used in Experiment 1. The black dots
70
344
MOORE AND EGETH
Block 2
Illusion
Block 1
Practice
Block 3
Inattention
/
random-ma~
.~_ s
16p a ~ trials
16
10 random-matrix trials
Trial I
random
matrix
Trial 2
mQ~
random
matrix
IBI
. .[
pattern
matrix
+
Questions
Full
Attention
Trial 8
Trial
• . .
i •..
Inattention
Trial 4
Divided
Attention
7
Trial6
Trial 5
random
matrix
random
matrix
Trial 3
(Experiments '
random
matrix
pattern
matrix
+
Questions
i
•°•
i
ma~Jx
+
Questions
Method for Exp (2): Complete the figure
Figure 4. Structure of the experiments. Each experiment began with a practice block of 10
random-matrix trials. The practice block was followed by the illusion block, in which half of the
trials were random-matrix trials and the other half were pattern-matrix trials. In Experiments 1 and
3, the illusion block was followed by the inattention block, which had a design that was similar to
that of the experiments reported by Mack et al. (1992) and Rock, Linnet, et al. (1992). The first 3
trials were random-matrix trials. The 4th trial was the inattention trial, which was a pattern-matrix
trial and was followed by the three questions concerning what the pattern had been (see text for
details). The 5th and 6th trials were again random-matrix trials. The 7th and 8th trials were the
divided-attention and full-attention trials, respectively. They were pattern-matrix trials and were
followed by the same three questions.
centered within the dot-pattern lines, which were in turn centered
within the matrix. The distance between the endpoint of a line
segment located near the converging end of the pattern and the
edge of the nearest black dot was 0.62 ° . The distance between an
formed two converging black lines, as in the Ponzo illusion. On
some trials, the converging end of the pattern pointed toward the
top (Figure 6A); on other trials, it pointed toward the bottom
(Figure 6B). The lines for the length discrimination task were
71
PERCEPTION WITHOUT ATrENTION
347
Finally, the point-biserial correlation provides evidence
that the bias to report top that was observed in the illusion
block was probably not caused by a few participants who
happened to notice the patterns. Participants who got the
forced-choice question fight following the inattention trial
were no more likely to have shown the bias in the illusion
block than those who got the forced-choice question wrong.
On the whole, the results suggest that grouping by similarity did occur without attention but that the patterns could
not be remembered for subsequent report. This may be
because the patterns were forgotten or because they were
never successfully encoded. Dis Exp (1): State main finding
A possible criticism of our results is that although participants seemed to have experienced the Ponzo illusion, it is
possible that the responses were biased for reasons that do Dis Exp (2): Mention possible confounding
not involve grouping by s i ~ t y .
Specifically, the endFigure 8. Typical pattern-matrix display from Experiment 2. All
point of the line segment that appeared near the converging
but eight of the dots were white. Four of the black dots surrounded
the upper line segment by the same distance as the closest dots
end of the pattern on the pattern-matrix trials was closer to
were to the upper line segment in the illusion block of Experiment
a black dot than was the endpoint of the line segment that
1. The remaining four black dots surrounded the lower line segwas nearer to the diverging end of the pattern. It is possible
ment
by the same distance as the closest dots were to the lower line
that the line segment nearer to the converging end appeared
segment in the illusion block of Experiment 1. The two line
longer because the line segmentmwhieh was also blackm
segments for the length discrimination task were identical and
was blurred into the nearby black dot.
were centered within the black dots, which were, in turn, centered
Such blurring might occur for two different reasons. First,
within the matrix.
the line segments were presented parafoveally, where acuity
is not as good as at fixation. The line segment and nearby
black dot may have appeared continuous because the spatial
maximize the chances of blurring, thus providing a conservative
resolution was not sufficient to detect the gap between them.
control for Experiment 1.
Second, participants may have deliberately squinted in orProcedure. Experiment 2 included only the practice block and
der to blur their vision and make one of the line segments
the illusion block (see Figure 4). Other than this difference, the
appear longer than the other on the ambiguous patternprocedure was identical to that of Experiment 1.
matrix trials. Experiment 2 provided a control for these
alternative interpretations of the results of Experiment 1.
Results
The mean percent correct for reporting which line segment was longer on the random-matrix trials in the illusion
block was 88.44% (_+3.46%). For the pattern-maaix trials,
on which the two line segments were the same length, the
mean percentage of trials on which the top one was reported
as appearing longer than the bottom one was 49.06%
(+_4.68%). The 50% value, which was expected by chance,
is contained within this interval.
Experiment 2
Intro Exp (2): Critical manipulation needed to answer w1
The pattern-matrix trials in Experiment 2 included the
black dots that were nearest to the endpoints of the line
segments on the pattern-matrix trials in Experiment 1, but
they did not include the black dots that formed the rest of
the pattern. If the bias in responses on pattern-matrix trials
that was observed in Experiment 1 was caused by a blurring
together of the line segment and the nearby black dot, then
the bias should occur in this experiment as well. If instead
the bias was caused by the Ponzo pattern that was present in
the pattern-matrix trials, then it should not occur in this
experiment.
Discussion
Res Exp (3): Do not list the non-significant
results unless it is critical
The results provide no evidence to suggest that the bias
observed on the pattern-matrix trials in Experiment 1 was
caused by a blurring of the line segment with a nearby black
dot. The same opportunity was present in Experiment 2, but
no bias was observed.
In addition, the lack of bias on the pattern-matrix trials of
this experiment provides further evidence that participants
were not, in general, biased to report top more often than
bottom. If there were such a bias, then it would have
appeared in this experiment as well.
Given the results of Experiment 2, it seems likely that
participants' responses were influenced by the Ponzo illusion in Experiment 1, and therefore that grouping by sinai-
Method
Stimuli. Figure 8 provides an illustration of the pauem-matrix
trials that were used in Experiment 2. The pattern consisted of
eight black dots: four surrounding the endpoints of the top line
segment by the same distance as in Experiment 1 (i.e., 0.60 °) and
four surrounding the endpoints of the bottom line segment by the
same distance as in Experiment 1 (i.e., 2.60°). In Experiment 1, the
line segments were surrounded by only two dots at those exact
distances (see Figure 6A). Four dots were used in Experiment 2 to
74
PERCEPTION WITHOUT ATTENTION
351
requires focused attention. If Gestalt grouping cannot be
References
done under conditions of divided attention, then surely it
Baylis, G. C. (1994). Visual attention and objects: Two-object cost
cannot be done under conditions of inattention. Perhaps the
with equal convexity. Journal of Experimental Psychology: Huinattention method has failed to isolate conditions of
man Perception & Performance, 20, 208-212.
inattention.
Baylis, G.C., & Driver, J. (1992). Visual parsing and response
One reason for the difference in results between our study
competition: The effects of grouping. Perception & Psychophysand that of Ben-Av et al. (1992) may be that the displays
ics, 51, 145-162.
used in the two studies differed in terms of how easily
Baylis, G.C., & Driver, J. (1993). Visual attention and objects:
grouping could occur. The grouping patterns in our study
Evidence for hierarchical coding of location. Journal of Experwere quite salient: Black and white dots on a gray backimental Psychology: Human Perception and Performance, 19,
ground were used for grouping by similarity. In addition,
451-470.
the matrices were perfectly aligned, with symmetrical dots
Beck, J. (1982). Textural segmentation. In J. Beck (Ed.), Organization and representation in perception (pp. 285-317). Hillsforming straight rows and columns. In contrast, the groupdale, NJ: Erlbaum.
ing patterns in the Ben-Av et al. (1992) displays were not as
Ben-Av, M. B., Sagi, D., & Braun, J. (1992). Visual attention and
salient. In some experiments, white plus signs and Ls on a
perceptual grouping. Perception & Psychophysics, 52, 277-294.
black background were used for grouping by similarity. In
Braun,
J., & Sagi, D. (1990). Vision outside the focus of attention.
other experiments, displays with differences in distance as
Perception & Psychophysics, 48, 45-58.
small as 20% were used for grouping by proximity. Finally,
Braun, J., & Sagi, D. (1991). Texture-based tasks are little affected
their stimuli were randomly rotated and were jittered. It is
by a second task which requires peripheral or central attentive
therefore possible that Gestalt grouping does occur preatfixation. Perception, 20, 483--500.
tentively when the patterns are relatively salient but not
Bravo, M., & Blake, R. (1990). Preattentive vision and perceptual
when the patterns are less easily resolved.
groups. Perception, 19, 515-522.
A second reason that Ben-Av et al. (1992) may have
Brown, J. M., Weisstein, N., & May, J. G. (1992). Visual search
for simple volumetric shapes. Perception & Psychophysics, 51,
failed to find evidence of Gestalt grouping is that the
40-48.
method of report may have required that the patterns be
Driver, J., & Baylis, G. C. (1989). Movement and visual attention:
encoded in memory. Participants were asked to concentrate
The spotlight metaphor breaks down. Journal of Experimental
on the primary task. They may therefore have had to rely on
Psychology: Human Perception and Performance, 15,
their memories to make reports concerning the grouping
448 -456.
patterns. If our conclusion is correct---that is, that Gestalt
Driver, J., & Halligan, P. W. (1991). Can visual neglect operate in
grouping patterns cannot be encoded in memory without
object-centered coordinates? An affirmative single-ease study.
attention--then it follows that participants would be poor at
Cognitive Neuroscience, 8, 475-496.
reporting grouping patterns under these conditions.
Duncan, J. (1984). Selective attention and the organization of
A final criticism of our study concerns how our evidence
visual information. Journal of Experimental Psychology: Genfor preattentive Gestalt grouping can be reconciled with
eral 113, 501-517.
previous evidence that grouping occurs after the resolution
Duncan, J., & Humphreys, G. W. (1989). Visual search and stimulus similarity. Psychological Review, 96, 433-458.
of several perceptual constancies (Rock & Brosgole, 1964;
Egly, R., Driver, J., & Rafal, R. D. (1994). Shifting visual attention
Rock, Nijhawan, et al., 1992) and after amodal completion
between objects and locations: Evidence from normal and parihas occurred (Palmer, 1996). There are two possible reasons
etal lesion subjects. Journal of Experimental Psychology: Genfor this apparent conflict. First, it is possible that all of these
eral 123, 161-177.
perceptual phenomena--grouping, constancies, and arnodal
Epstein,
W., & Babler, W. (1989). Perception of slant-in depth is
completion--occur preattentively. Of these three phenomautomatic. Perception & Psychophysics, 45, 31-33.
ena, only grouping has been tested using the inattention
Epstein, W., & Babler, W. (1990). In search of depth. Perception
method. Second, it is possible that grouping occurs at mul& Psychophysics, 48, 68-76.
tiple levels of processing. Neither of these possibilities can
Gibson, B. S. (1994). Visual attention and objects: One versus two
be ruled out by any of the data discussed here.
or convex versus concave? Journal of Experimental Psychology:
In summary, our results indicate that at least one form of
Human Perception and Performance, 20, 203-207.
Gestalt grouping can occur under conditions of inattention,
Grabowecky, M., Robertson, L. C., & Treisman, A. (1993). Preattentive processes guide visual search. Journal of Cognitive
insofar as inattention is defined within Mack et al.'s (1992)
Neuroscience, 5, 288-302.
and Rock, Linnet, et al.'s (1992) method. Although particGrossberg, S., Mingolla, E., & Ross, W.D. (1994). A neural
ipants may be unable to report what grouping patterns
theory of attentive visual search: Interactions of boundary, surappeared on a previous trial, their behavior can be systemface, spatial, and object representations. Psychological Review,
aticaUy influenced by what patterns are present on a current
101, 470-489.
trial. These results, in conjunction with those of Mack et al.
Halligan, P. W., & Marshall, J. C. (1993). When two is one: A case
and Rock, Linnet, et al. suggest that Gestalt grouping does
study of spatial parsing in visual neglect. Perception, 22,
occur preattentively but that encoding the results in memory
309-312.
may require attention. This conclusion is consistent with the
Hochberg, J. (1974). Higher-order stimuli and interresponse contheories of perception that maintain that substantial perceppiing in the perception of the visual world. In R. B. MacLeod &
tual organization precedes the allocation of attention within
H.L. Pick (Eds.), PercePtion: Essays in honor of James J.
a visual scene. Final Gen Dis (6): Write mainly the 4th WGibson (pp. 17-39). Ithaca, NY: Cornell University Press.
78
brief communications
reduced by the 1996 feed ban but remained
constant, I estimate that only two late-stage
infected animals under 30 months old will
be slaughtered for consumption in France.
In summary, robust cohort-based epidemiological analyses should form a suitable framework for re-examination of the
potential risks posed by the consumption
of beef from countries with native-born
BSE cases.
Christl A. Donnelly
Department of Infectious Disease Epidemiology,
Imperial College School of Medicine, St Mary’s
Campus, Norfolk Place, London W2 1PG, UK
e-mail: [email protected]
W3
W2
a
What you see
is what you hear
ision is believed to dominate our
multisensory perception of the world.
Here we overturn this established view
by showing that auditory information can
qualitatively alter the perception of an
unambiguous visual stimulus to create a
striking visual illusion. Our findings indicate that visual perception can be manipulated by other sensory modalities.
We have discovered a visual illusion that
is induced by sound: when a single visual
flash is accompanied by multiple auditory
beeps, the single flash is incorrectly perceived as multiple flashes. These results
were obtained by flashing a uniform white
disk (subtending 2 degrees at 5 degrees
eccentricity) for a variable number of times
(50 milliseconds apart) on a black background. Flashes were accompanied by a
variable number of beeps, each spaced 57
milliseconds apart. Observers were asked to
judge how many visual flashes were presented on each trial. The trials were randomized and each stimulus combination
was run five times on eight naive observers.
Surprisingly, observers consistently and
incorrectly reported seeing multiple flashes
whenever a single flash was accompanied
by more than one beep (Fig. 1a). Control
conditions and catch trials (Fig. 1 legend)
indicate that the illusory flashing phenomenon is a perceptual illusion and is not due
to the difficulty of the task, cognitive bias or
other factors.
Figure 1b shows that observers’ performance was the same, irrespective of
whether a single flash was accompanied by
two beeps, or two flashes by one or no
beeps, suggesting that the illusory double
flash is perceptually equivalent to the physical double flash. This was confirmed by the
testimonies of the observers after the exper-
V
788
4
3
2
1
n=8
1
b
Number of perceived flashes
W1
Illusions
Number of perceived flashes
1. Office International des Epizooties. http://www.oie.int/eng/
info/en_esbmonde.htm (as of 1 December 2000).
2. French Ministry of Agriculture and Fisheries. http://
www.agriculture.gouv.fr/alim/sant/mala/cell-testESB/
Esb/1-1711.pdf (as of 1 December 2000).
3. Donnelly, C. A., Santos, R., Ramos, M., Galo, A. & Simas, J. P.
J. Epidemiol. Biostat. 4, 277–283 (1999).
4. Anderson, R. M. et al. Nature 382, 779–788 (1996).
5. Donnelly, C. A., Ghani, A. C., Ferguson, N. M. & Anderson,
R. M. Nature 389, 903 (1997).
6. Ferguson, N. M., Donnelly, C. A., Woolhouse, M. E. J. &
Anderson, R. M. Phil. Trans. R. Soc. Lond. B 352, 803–838 (1997).
7. Donnelly, C. A. & Ferguson, N. M. Statistical Aspects of BSE
and vCJD: Models for Epidemics (Chapman & Hall/CRC,
London, 1999).
8. Doherr, M. G., Heim, D., Vandevelde, M. & Fatzer, R. Vet. Rec.
145, 155–160 (1999).
9. Butler, D. Nature 382, 5 (1996).
10. Schreuder, B. E. C., Wilesmith, J. W., Ryan, J. B. M. & Straub,
O. C. Vet. Rec. 141, 187–190 (1997).
11. Summary of the Spongiform Encephalopathy Advisory
Committee Meeting on 29 November 1999; News Release,
22 December 1999.
4
2
3
Number of beeps
4
3
W4
2
1
n=8
1
2
3
Number of flashes
4
Figure 1 Illusory flashing. a, Perceived number of visual flashes
by eight observers plotted as a function of the number of auditory
beeps for a single flash. The number of perceived flashes did not
increase linearly with the third and fourth beeps because they fell
outside the optimal window of audiovisual integration, as revealed
by our next experiment. b, Perceived number of flashes by eight
observers plotted as a function of the actual number of flashes
presented for trials with no sound (dashed line), and trials with
single beeps corresponding to catch trials (grey line). Observers
performed the task very well in the absence of sound (dashed
line). The results of the catch trials (grey line) confirm that the
observers’ responses were not determined by their auditory percepts. The curve in a (for a single flash) is superimposed for comparison. Further details can be obtained from L.S.
iment. The performance of non-naive subjects (results not shown) indicated that the
illusion persisted even when subjects were
aware of the fact that the disk was actually
flashed only once.
We next investigated the temporal properties of this illusion by varying the relative
© 2000 Macmillan Magazines Ltd
80
timing of visual and auditory stimuli. The
illusory flashing effect declined from 70
milliseconds separation onwards. However,
illusory flashing occurred as long as the
beep and flash were within approximately
100 milliseconds of each other, which is
consistent with the integration time of polysensory neurons in the brain1,2.
Our results indicate that the illusory
flashing is caused by an alteration of visual
perception by auditory stimuli. The modification of the visual percept by sound, however, was not categorical. It was rather
selective, as sound did not have a fusing
effect when multiple flashes were accompanied by a single beep. We suggest therefore
that the direction of cross-modal interactions is partly dependent on the type of
stimulus. Consistent with previous observations in other modalities3, we propose
that the percept of a continuous stimulus in
one modality is rendered more malleable
by a discontinuous stimulus in another
modality than vice versa.
The influence of auditory cues on visual
perception has been demonstrated in other
settings, in which perceived visual intensity
is affected by the presence of an auditory
stimulus4. This influence, however, is quantitative and does not alter the phenomenological quality of the percept. Others have
shown that the perceived direction of
ambiguous visual motion is influenced by
auditory stimulation5.
Our results extend these previous findings by showing that visual perception can
be qualitatively altered by sound even when
the visual stimulus is not ambiguous. The
conditions under which this alteration
occurs — the stimulus configuration and
the task — are very simple. The illusion is
also surprisingly robust to variation in the
many parameters we manipulated (including disk eccentricity and contrast, spatial
disparity between sound and flash, shape
and texture of the flashing pattern, flash
and beep durations). The simplicity and
robustness of the illusory flashing phenomenon indicate that it reflects a fundamental
and widespread property of polysensory
mechanisms in the brain.
Ladan Shams*, Yukiyasu Kamitani*,
Shinsuke Shimojo*†
W4
*California Institute of Technology,
Division of Biology, MC 139-74, Pasadena,
California 91125, USA
e-mail: [email protected]
†NTT Communication Science Laboratories,
Human and Information Science Laboratory,
Atsugi, Kanagawa 243-0198, Japan
1. Meredith, M. A., Nemitz, J. W. & Stein, B. E. J. Neurosci. 10,
3215–3229 (1987).
2. Wallace, M. T. & Stein, B. E. J. Neurophysiol. 71, 429–432
(1994).
3. Saldaña, H. M. & Rosenblum, L. D. Percept. Psychophys. 54,
406–416 (1993).
4. Stein, B. E., London, N., Wilkinson, L. K. & Price, D. D.
J. Cogn. Neurosci. 8, 497–506 (1996).
5. Sekuler, R., Sekuler, A. B. & Lau, R. Nature 385, 308 (1997).
NATURE | VOL 408 | 14 DECEMBER 2000 | www.nature.com
letters to nature
cortices in one', with twice the number of neurons underneath a
square millimetre of surface than the remainder of neocortex12. The
density of neurons in the rest of neocortex scales similarly as in V1
(ref. 12), and all of the hominoid thalamic nuclei also have neuronal
densities that vary in about the same was as in LGN13±15; altogether,
then, the relations between number of thalamic neurons and
number of neocortical neurons is about the same as the 3/2
power scaling law described above for the primary visual cortex.
The conservation of these scaling relations raises the possibility
that a similar basis for the scaling laws exists for all cortical areas. In
this view, each cortical area would be provided with a map of some
sortÐperhaps one with very abstract quantitiesÐand the job of the
cortex would be to extract some characteristic of the map at each
point that would be represented as a location code by the neurons in
each map `pixel'. Note that the information in the map need not be
supplied by thalamus; this structure would only have to determine
the number of pixels in the map. If n pixels are present in a cortical
region, then the number of neurons per pixel needed to maintain
the same resolution within a pixel as across pixels would vary as n1/2.
M
A 3/2 power relation would result.
12. Rockel, A. J., Hiorns, R. W. & Powell, T. P. The basic uniformity in structure of the neocortex. Brain
103, 221±244 (1980).
13. Armstrong, E. A quantitative comparison of the hominoid thalamus. IV. Posterior association
nucleiÐthe pulvinar and lateral posterior nucleus. Am. J. Phys. Anthropol. 55, 369±383 (1981).
14. Armstrong, E. A quantitative comparison of the hominoid thalamus: II. Limbic nuclei anterior
principalis and lateralis dorsalis. Am. J. Phys. Anthropol. 52, 43±54 (1980).
15. Armstrong, E. Quantitative comparison of the hominoid thalamus. I. Speci®c sensory relay nuclei.
Am. J. Phys. Anthropol. 51, 365±382 (1979).
16. Frahm, H. D., Stephan, H. & Baron, G. Comparison of brain structure volumes in insectivora and
primates. V. Area striata (AS). J. Hirnforsch. 25, 537±557 (1984).
17. Shulz, H.-D. Metrische Untersuchagen an den Schichten des Corpus Geniculatum Laterale tag- und
Nachtaktiven Primaten. Thesis, Johann Wolfgang Goethe-Universitaet Frankfurt (1967)).
18. Fritschy, J. M. & Garey, L. J. Quantitative changes in morphological parameters in the developing
visual cortex of the marmoset monkey. Brain Res. 394, 173±888 (1986).
Acknowledgements
Much of the work described here was done at the Santa Fe Institute and the Aspen Center
for Physics, and I thank those institutions for their support. I also thank T. Albright,
S. Gandhi and I. Brivanlou for comments on an earlier draft.
Correspondence and requests for materials should be addressed to the author
(e-mail: [email protected]).
.................................................................
Methods
To obtain the relation between the number of LGN and V1 neurons, I must make use of
three separately determined scaling relations. First, for haplorhines, the volume V of the
grey matter in V1 is related to the LGN volume v by a power law4,16 (see Fig. 2),
Interocular rivalry revealed in the
human cortical blind-spot
representation
V ˆ Ava
where A = 14.61 6 4.78 and a = 1.125 6 0.057; volumes are measured in cubic millimetres
and refer to both hemispheres.
These volumes can be converted to numbers of neurons, if the neuronal densities in
LGN and V1 are known. The numbers of neurons n in the LGN, as a function of LGN
volume v, conform to a power law17 for 23 haplorhines and 17 strepsirhines,
Frank Tong* & Stephen A. Engel²
* Department of Psychology, Princeton University, Princeton, New Jersey 08544,
USA
² Department of Psychology, University of California Los Angeles, Los Angeles,
California 90095, USA
n ˆ Bv b
with b = 0.659 6 0.06 for haplorhines and b = 0.683 6 0.22 for the strepsirhines, values
that are not signi®cantly different. The scale factor B, however, is different with a value of
0.071 6 0.025 for haplorhines and 0.046 6 0.029 for strepsirhines. This power law is based
on data from ref. 17 combined with data from ref. 4.
The relation between V1 volume V and the number of V1 neurons N also follows the
power law,
..............................................................................................................................................
N ˆ DV d
where D = 0.232 and d = 0.902. This relation is obtained by using the observation12 that
0.195 million neurons are found beneath a square millimetre of V1 surface in primates (the
value is corrected for 18% shrinkage), and the weak power law dependence of V1 thickness
t on cortical surface area S described12,18 by;
t ˆ 0:825S 0:108
When V and v are converted to N and n with the equations above, a power law results
(Fig. 1) with an exponent l:
l ˆ ad=b ˆ 1:54 6 0:072
Received 7 December 2000; accepted 31 January 2001.
1. Finlay, B. L. & Darlington, R. B. Linked regularities in the development and evolution of mammalian
brains. Science 268, 1578±1584 (1995).
2. Barton, R. A. & Harvey, P. H. Mosaic evolution of brain structure in mammals. Nature 405, 1055±
1058 (2000).
3. Andrews, T. J., Halpern, S. D. & Purves, D. Correlated size variations in human visual cortex, lateral
geniculate nucleus, and optic tract. J. Neurosci. 17, 2859±2868 (1997).
4. Stephan, H., Frahm, H. D. & Baron, G. New and revised data on volumes and brain structures in
insectivores and primates. Folia primatol. 35, 1±29 (1981).
5. Schein, S. J. & de Monasterio, F. M. Mapping of retinal and geniculate neurons onto striate cortex of
macaque. J. Neurosci. 7, 996±1009 (1987).
6. Ahmad, A. & Spear, P. D. Effects of aging on the size, density, and number of rhesus monkey lateral
geniculate neurons. J. Comp. Neurol. 334, 631±643 (1993).
7. Hubel, D. H. & Wiesel, T. N. Receptive ®elds and functional architecture of monkey striate cortex.
J. Physiol. (Lond.) 195, 215±243 (1968).
8. Bonhoeffer, T. & Grinvald, A. Iso-orientation domains in cat visual cortex are arranged in pinwheellike patterns. Nature 353, 429±431 (1991).
9. Swindale, N. V. How many maps are there in visual cortex? Cereb. Cortex 10, 633±643 (2000).
10. Issa, N. P., Trepel, C. & Stryker, M. P. Spatial frequency maps in cat visual cortex. J. Neurosci. 20, 8504±
8514 (2000).
11. Frahm, H. D., Stephan, H. & Stephan, M. Comparison of brain structure volumes in Insectivora and
Primates. I. Neocortex. J. Hirnforsch. 23, 375±389 (1982).
W1
NATURE | VOL 411 | 10 MAY 2001 | www.nature.com
81
To understand conscious vision, scientists must elucidate how the
brain selects speci®c visual signals for awareness. When different
monocular patterns are presented to the two eyes, they rival for
conscious expression such that only one monocular image is
perceived at a time1,2. Controversy surrounds whether this binocular rivalry re¯ects neural competition among pattern representations or monocular channels3,4. Here we show that rivalry
arises from interocular competition, using functional magnetic
resonance imaging of activity in a monocular region of primary
visual cortex corresponding to the blind spot. This cortical region
greatly prefers stimulation of the ipsilateral eye to that of the
blind-spot eye. Subjects reported their dominant percept while
viewing rivalrous orthogonal gratings in the visual location
corresponding to the blind spot and its surround. As predicted
by interocular rivalry, the monocular blind-spot representation
was activated when the ipsilateral grating became perceptually
dominant and suppressed when the blind-spot grating became
dominant. These responses were as large as those observed during
actual alternations between the gratings, indicating that rivalry
may be fully resolved in monocular visual cortex. Our ®ndings
provide the ®rst physiological evidence, to our knowledge, that
interocular competition mediates binocular rivalry, and indicate
that V1 may be important in the selection and expression of
conscious visual information.
Despite extensive research, the neural basis of binocular rivalry
has remained highly controversial. Speci®cally, it is debated whether
discrepant monocular patterns rival because of interocular competition or pattern competition. Human psychophysical studies have
provided evidence that rivalry results from interocular competition
among monocular neurons in primary visual cortex (V1)3. However, single-unit recordings in awake, behaving monkeys have
© 2001 Macmillan Magazines Ltd
195
letters to nature
yielded negligible evidence of rivalry-related activity in V1, inconsistent effects in visual areas V4 and MT, and strong effects in
inferotemporal cortex4±6. These neurophysiological ®ndings instead
indicate that rivalry may result from competition among incompatible pattern representations at higher levels of the visual pathway,
well after inputs from the two eyes have converged in V1.
To resolve this issue, we used functional magnetic resonance
imaging (fMRI) to monitor rivalry-related activity in a monocular
region of human V1 corresponding to the blind spot. The blind
spot is a part of the retina that has no photoreceptors; its size is
around 4 ´ 68 and it is about 158 medial to the fovea (Fig. 1a). In
human primary visual cortex, the blind spot is represented as a
relatively large monocular region (around 10 mm ´ 5 mm; J. C.
Horton, personal communication) that receives direct input
solely from the ipsilateral eye and not from the (contralateral)
a
Blind-spot eye
stimulation
Ipsilateral eye
stimulation
Blind
spot
Fovea
Blind-spot eye
Ipsilateral eye
6
6
3
3
0
0
–3
–3
–6
0
15
Time (s)
30
0
15
30
MR signal change (%)
MR signal change (%)
b
Fovea
blind-spot eye. The monocular V1 blind-spot representation is large
enough for functional imaging7.
Functional MRI has suf®cient sensitivity and temporal resolution
to detect rivalry-related responses in stimulus-selective extrastriate
areas8. We predicted that if rivalry arises from interocular competition, then the ipsilateral-responsive neurons in the V1 blind-spot
representation should show increased activity when subjects
perceive a grating pattern presented to the ipsilateral eye and
suppressed activity when subjects perceive a rivalrous grating
presented to the blind-spot eye. Such excitation and suppression
should occur even though both gratings are constantly present.
Furthermore, if rivalry is fully resolved through interocular
competition, then these neural responses during rivalry should be
identical to those evoked by actual stimulus alternations between
the ipsilateral grating and the blind-spot grating.
We performed three types of fMRI scan: V1 blind-spot localization scans, rivalry scans and stimulus alternation scans. Before MRI
scanning, subjects mapped the visual ®eld location of the right eye's
blind spot by manipulating the location and size of a black ¯ickering
circular probe. During V1 blind-spot localization scans, subjects
maintained ®xation while viewing on/off sequences of a ¯ickering
checkerboard pattern using either their left ipsilateral eye or right
blind-spot eye (Fig. 1a). The checkerboard was presented in the
region of visual space corresponding to the blind spot and its
immediate surround with a diameter of 88, almost twice the size
of the blind spot. Although the blind spot could not register the
central portion of the checkerboard, the stimulus was perceptually
®lled in owing to stimulation of the blind spot's surround9.
The V1 blind-spot representation was reliably identi®ed on the
basis of the voxels in the left calcarine sulcus that showed a greater
response to stimulation of the ipsilateral eye than of the blind-spot
eye (see Methods). Figure 1 shows the fMRI response and anatomical locus of this region in one subject. The V1 blind-spot
representation was highly monocular, responding vigorously to
stimulation of the ipsilateral eye (Fig. 1b, left) and negligibly to
stimulation of the blind-spot eye (Fig. 1b, right). In all subjects, this
monocular region was located in the depth of the calcarine sulcus
(Fig. 1c), a location that is always contained within primary visual
cortex10.
During rivalry scans, a red vertical grating was presented to one
eye and a green horizontal grating was presented to the other eye in
the visual location corresponding to the blind spot and its surround
W2
–6
Time (s)
a
c
Rivalry
Stimulus
Percept
b
Stimulus alternation
Yoked
stimulus
Figure 1 Localization of the V1 blind-spot representation. a, Viewing conditions. Subjects
maintained ®xation on a reference point while viewing a ¯ickering checkerboard pattern
(stimulus size 88, check size 18, temporal frequency 7.5 Hz, contrast 100%) with either
the ipsilateral eye or blind-spot eye. The centre of the checkerboard fell on the blind spot
(optic nerve head, size ,48 ´ 68) of the right eye but not the left eye. b, Average fMRI
responses in the V1 blind-spot representation of one subject during stimulation of the
ipsilateral or blind-spot eye. Data represent mean 6 s.e. of eight stimulus periods (on
15 s, off 15 s) and are expressed in per cent signal change relative to the mean MR level
throughout the scan. This region is activated by ipsilateral stimulation only. c, V1
representation of the right eye's blind spot in the left calcarine sulcus (three voxels
highlighted in white, voxel size 3.1 ´ 3.1 ´ 4 mm, slice plane perpendicular to calcarine).
Functional data are superimposed on high-resolution anatomical T2-weighted images.
196
Time (s)
Figure 2 Binocular rivalry and stimulus alternation tasks. a, Rivalrous oscillating sinewave gratings were presented to the blind-spot eye and ipsilateral eye (size 88). When
viewed through red and green ®lter glasses, only the green horizontal grating could be
seen through one eye and only the red vertical grating through the other eye (spatial
frequency 0.67 cycles per degree, speed 2 Hz, direction reversal every 500 ms, contrast
75%, mean luminance through matching ®lter 3.4 cd m-2). Although both gratings were
constantly present, subjects reported alternately perceiving either the red or green
grating. b, On stimulus alternation scans, the physical stimulus alternated between the
red grating and green grating using the sequence of reported alternations from a previous
rivalry scan. Subjects reported when the stimulus changed to the red or green grating.
82
© 2001 Macmillan Magazines Ltd
NATURE | VOL 411 | 10 MAY 2001 | www.nature.com
letters to nature
W3
rivalry versus stimulus alternation (F , 1). Positive responses for
the ipsilateral grating were larger than negative responses for the
blind-spot grating (F = 102, P , 0.005) but rivalry and stimulus
alternation remained equivalent across both types of neural
response.
The equivalence between fMRI responses for rivalry and stimulus
alternation indicates that it is likely that rivalry has been entirely
resolved among monocular neurons in the V1 blind-spot representation, such that neural activity entirely re¯ects the subject's
perceptual state. Thus, functionally equivalent neural responses
are observed when the subject's conscious state alternates between
the ipsilateral grating and blind-spot grating during constant
rivalrous stimulation and when the physical stimulus itself alternates between each grating shown alone.
Our results show that binocular rivalry is resolved in monocular
visual cortex and provide physiological evidence to support interocular competition. These theories predict that left-eye versus righteye inputs are alternately suppressed during rivalry because of lateral
inhibition among monocular V1 neurons3 or feedback inhibition
from V1 to monocular layers of the lateral geniculate nucleus12. In
contrast, theories of pattern competition propose that rivalry occurs
among binocular pattern neurons at much higher levels of the visual
pathway and not among monocular neurons4. Our ®ndings therefore help to resolve the neural basis of binocular rivalry.
Previous studies have found evidence of rivalry-related neural
activity but none has established the involvement of monocular
neurons4±6,8,13±18. In a single-unit study in monkeys, only 3 out of 33
V1 neurons showed signi®cant responses corresponding to conscious perception, suggesting that rivalry takes place at higher levels
of the visual pathway4. However, reanalysis revealed that across this
V1 population, responses during rivalry equalled one-third of the
magnitude of stimulus alternation responses13. These results are
more consistent with an fMRI study showing reliable rivalry
responses in human V1 that were about half the magnitude of
stimulus alternation responses13. (Unfortunately, this study could
not isolate monocular responses.) The authors suggested many
factors that might account for the stronger rivalry effects in human
V1, including interspecies differences, the indirect nature of fMRI in
estimating neural activity, and the effects of eye movements on
single-unit recordings.
In our view, the present ®nding of equally powerful rivalry and
stimulus alternation responses strongly suggests that binocular
rivalry is resolved in monocular visual cortex. Although fMRI
provides an indirect estimate of neural activity, any factors that
might in¯ate response amplitudes during rivalry would also do so
during stimulus alternation. In contrast, certain factors may have
diluted rivalry responses in previous studies. These include suboptimal viewing conditions that lead to frequent perceptual blends,
and variability in the accuracy or timing of subjects' perceptual
report. Isolating factors that weaken rivalry responses is an important direction for future research.
Although competition among binocular pattern neurons alone
cannot account for our ®ndings, it remains possible that feedback
signals from binocular neurons to monocular neurons might yield
the interocular suppression that we observe. However, such a theory
fails to explain why pattern competition should lead to selection at
the monocular level. Furthermore, it remains unclear how feedback
projections from binocular neurons might target a speci®c monocular channel. Given these dif®culties, interocular competition
provides the most compelling explanation for rivalry in monocular
visual cortex.
Our data also address a debate regarding whether common or
separate neural mechanisms underlie binocular rivalry and related
phenomena involving pattern rivalry. For example, two low-contrast
patterns presented to one eye can weakly rival with each other1,19.
Moreover, one of two dichoptic patterns can maintain dominance
even when the patterns are frequently swapped between eyes20,21.
Such rivalry probably involves high-level pattern competition. One
proposal is that pattern competition may generally account for
binocular rivalry20,22. However, our results suggest that a separate
mechanism of interocular competition entirely accounts for the
binocular rivalry in our subjects.
Finally, our ®ndings show that neurons can re¯ect conscious
perception at a much earlier level of the visual pathway than
previously thought4,23. These results have one of two implications.
One possibility is that certain aspects of conscious vision begin to
emerge at the very earliest stage of cortical processing among monocular V1 neurons. Alternatively, our ®ndings may suggest a new role
for V1 as the `gatekeeper' of consciousness, a primary cortical region
that can select which visual signals gain access to awareness. In either
case, our study ®rmly establishes the importance of primary visual
M
cortex in binocular rivalry and conscious vision.
W3
W4
W4
W4
W3
Methods
2
Normalized MR signal change
Ipsilateral grating
Blind-spot grating
Rivalry
Rivalry
Stimulus
alternation
Stimulus
alternation
–1
MRI acquisition
Subjects were scanned at the UCLA Division of Brain Mapping on a 3T General Electric
scanner using a head coil. We collected functional images using 5±6 slices oriented
perpendicular to the calcarine sulcus with the ®rst slice beginning about 10 mm anterior to
the occipital pole (slice thickness 4 mm, inter-slice distance 1 mm, in-plane resolution
3.125 ´ 3.125 mm). We used standard T2*-weighted echoplanar imaging to localize the V1
blind-spot representation (TR = 2.5 s, TE = 45 ms, ¯ip angle 808) and for rivalry and
stimulus alternation scans (TR = 1.0 s, TE = 45 ms, ¯ip angle 458). Functional images were
superimposed on high-resolution T2-weighted anatomical images (in-plane resolution
0.78 ´ 0.78 mm). A bite bar minimized head motion.
V1 blind-spot localization
–2
Figure 4 fMRI response amplitudes for rivalry versus stimulus alternation. Amplitude of
fMRI responses (peak-to-trough difference, time window 0 to +6 s) of each subject (n = 4)
for switches to the ipsilateral grating (left) and blind-spot grating (right) during rivalry
versus stimulus alternation. Responses are normalized relative to each subject's mean
response magnitude across the four conditions. fMRI responses for rivalry versus stimulus
alternation did not reliably differ (F , 1).
198
Subjects
Four healthy right-handed volunteers (three women), aged 21±28, participated. All had
normal or corrected-to-normal visual acuity and normal stereo-depth perception. Two
subjects were right-eye dominant and two were left-eye dominant. Before the fMRI
experiment, subjects received training on how to localize their blind spot in visual space
and to report their online perception during binocular rivalry and stimulus alternation.
1
0
W4
On separate fMRI scans, subjects viewed a counterphasing black/white checkerboard
pattern with either their left ipsilateral eye or right blind-spot eye as shown in Fig. 1a. (In
the actual experiment, a real ®xation point was placed around 158 to the left of the subject's
midline and the stimulus appeared roughly on the midline at a distance that closely
corresponded to the subject's horopter.) Each scan consisted of an initial 30-s rest period
followed by eight cycles of stimulation (15 s) and rest (15 s). We calculated activation maps
and difference maps using multiple regression24. A linear model consisting of sinusoidal
response functions was ®t to the fMRI activity observed during the eight cycles of visual
84
© 2001 Macmillan Magazines Ltd
NATURE | VOL 411 | 10 MAY 2001 | www.nature.com
NeuroImage 19 (2003) 1835–1842
www.elsevier.com/locate/ynimg
Rapid Communication
People thinking about thinking people
The role of the temporo-parietal junction in “theory of mind”
R. Saxea,* and N. Kanwishera,b
a
Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
b
McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
Received 15 January 2003; revised 12 March 2003; accepted 14 April 2003
Abstract
Humans powerfully and flexibly interpret the behaviour of other people based on an understanding of their minds: that is, we use a
“theory of mind.” In this study we distinguish theory of mind, which represents another person’s mental states, from a representation of the
simple presence of another person per se. The studies reported here establish for the first time that a region in the human temporo-parietal
junction (here called the TPJ-M) is involved specifically in reasoning about the contents of another person’s mind. First, the TPJ-M was
doubly dissociated from the nearby extrastriate body area (EBA; Downing et al., 2001). Second, the TPJ-M does not respond to false
representations in non-social control stories. Third, the BOLD response in the TPJ-M bilaterally was higher when subjects read stories about
a character’s mental states, compared with stories that described people in physical detail, which did not differ from stories about nonhuman
objects. Thus, the role of the TPJ-M in understanding other people appears to be specific to reasoning about the content of mental states.
© 2003 Elsevier Science (USA). All rights reserved.
W4
Keywords: fMRI; Social cognitive neuroscience; False belief; Mentalising; Superior temporal sulcus; EBA
W1
The remarkable human facility with social cognition
depends on a fundamental ability to reason about other
people. Specifically, we predict and interpret the behaviour
of people based on an understanding of their minds: that is,
we use a “theory of mind.”1 In this study we show that a
region of human temporo-parietal junction is selectively
involved in reasoning about the contents of other people’s
minds.
Brain regions near the temporo-parietal junction (TPJ)
have been implicated in a broad range of social cognition
tasks (Allison et al., 2000; Gallagher and Frith, 2003; Green
and Haidt, 2003). Regions near the TPJ have preferential
responses to human faces (e.g., Hoffman and Haxby, 2000),
bodies (e.g., Downing et al., 2001) and biological motion
(e.g., Grossman et al., 2000). There is also some evidence
that regions within human TPJ are involved in theory of
mind (ToM). A number of studies have reported increased
responses in the TPJ when subjects read verbal stories or see
pictorial cartoons that require inferences about a character’s
(false) beliefs, compared with physical control stimuli
(Fletcher et al., 1995; Brunet et al., 2000; Gallagher et al.,
2000; Castelli et al., 2000; Voegely et al., 2001. A number
of other brain regions have also been implicated in theory of
mind; see reviews by Gallagher and Frith, 2003, and Greene
and Haidt, 2003).
What is the role of the TPJ in these tasks? ToM reasoning
depends upon at least two kinds of representation: a representation of another person per se and a representation of
that other person’s mental states (see Leslie, 1999). While a
representation of a person per se is a likely prerequisite for
* Corresponding author. Department of Brain and Cognitive Sciences,
MIT, NE20-464, 77 Massachussetts Avenue, Cambridge, MA 02139,
USA. Fax: ⫹1-617-258-8654.
E-mail address: [email protected] (R. Saxe).
1
The term “theory of mind” has a more restricted sense, referring to the
suggestion that the structure of knowledge in the mind is analogous to a
scientific theory (e.g., Carey, 1985; Wellman and Gelman, 1992). For
discussions about the so-called theory-theory, see Carruthers and Smith,
1996, and Malle et al., 2001. In this study, we use the term theory of mind
in a broader sense, to refer to any reasoning about another person’s
representational mental states (also called “belief-desire psychology,” e.g.,
Bartsch and Wellman, 1995).
1053-8119/03/$ – see front matter © 2003 Elsevier Science (USA). All rights reserved.
doi:10.1016/S1053-8119(03)00230-1
86
1836
Rapid Communication / NeuroImage 19 (2003) 1835–1842
ToM, achieving a representation of others’ mental states is
the core responsibility of a ToM. Some authors suggest that
the TPJ is involved only in the preliminary stages of social
cognition that “aid” ToM, not in ToM reasoning itself (e.g.,
Gallagher and Frith, 2003). We here provide evidence
against this suggestion, and argue on the contrary that a
region of the TPJ is selectively involved in representation
other peoples’ mental states.
Neuroimaging studies have followed developmental psychology in using “false belief” stories as the prototypical
problem for ToM reasoning (Fletcher et al., 1995; Gallagher
et al., 2000; see also Vogeley et al., 2001). In these scenarios, a character’s action is based on the character’s false
belief (Wimmer and Perner, 1983). False beliefs provide a
useful behavioural test of a ToM, because when the belief is
false, the action predicted by the belief is different from the
action that would be predicted by the true state of affairs
(Dennett, 1978). Note, though, that everyday reasoning
about other minds, by adults and children, depends on
attributions of mostly true beliefs (e.g., Dennett, 1996; Bartsch and Wellman, 1995).
Previous investigations of the neural correlates of ToM
(Fletcher et al., 1995; Gallagher et al., 2000) have compared
false belief (“theory of mind”) stories with two control
conditions: “non-theory of mind stories,” which describe
actions based on the character’s true beliefs, and “control”
stories, consisting of unrelated sentences. These authors
found that the TPJ response was high during theory of mind
stories, but was also high during non-theory of mind stories.
They concluded (see also Gallagher and Frith, 2003) that the
TPJ is not selectively involved in ToM. This conclusion
does not follow. Because the non-theory of mind stories
invite inferences about the character’s (true) beliefs, a region involved in reasoning about other minds should show
a high response to these stories, as well as to the so-called
theory of mind stories. (For an argument against the use of
unrelated sentences as the baseline condition, see Ferstl and
von Cramon, 2002.)
We propose two basic tests for a region selectively involved in ToM reasoning. First, it must show increased
response to tasks/stimuli that invite ToM reasoning (about
true or false beliefs) compared with logically similar nonsocial controls. Second, the region must respond not just
when a person is present in the stimulus, but specifically
when subjects reason about the person’s mental states. Below, we provide evidence that a subregion of the TPJ, here
called the TPJ-M, passes both these criteria for a selective
role in ToM.
his/her false belief. Descriptions of human actions required
analysis of mental causes, in the absence of false beliefs.
We compared these conditions to two non-social control
conditions, (1) mechanical inference control stories, which
required the subject to infer a hidden physical (as opposed
to mental) process, such as melting or rusting (for examples,
see Appendix 1), and (2) descriptions of nonhuman objects.
Unlike previous studies, we did not cue or instruct subjects to attend specifically to mental states. With this design
we were able to look for regions of cortex in individual
subjects that are selectively and spontaneously involved in
understanding the mental (as opposed to physical) causes of
events.
To test whether the response to ToM stories was a response to the presence of a person in the stimulus, we
presented still photographs of people, and nonhuman objects. Downing et al. (2001) reported a bilateral region near
the posterior superior temporal cortex that responds preferentially to the visual appearance of human bodies, compared
with a range of control objects (the extrastriate body area,
EBA). We tested directly the functional and anatomical
relationship between the EBA and the (proposed) TPJ-M.
W2
W2
Methods
Twenty-five healthy right-handed adults (12 women)
volunteered or participated for payment. All subjects had
normal or corrected-to-normal vision and gave informed
consent to participate in the study.
Subjects were scanned in the Siemens 1.5-(9 subjects)
and 3.0-T (16 subjects) scanners at the MGH-NMR center
in Charlestown, MA, using a head coil. Standard echoplanar
imaging procedures were used [TR ⫽ 2 s, TE ⫽ 40 (3 T) or
30 (1.5 T) ms, flip angle 90°]. Twenty 5-mm-thick nearcoronal slices (parallel to the brainstem) covered the occipital lobe and the posterior portion of the temporal and
parietal lobes.
Stimuli consisted of short center-justified stories, presented in 24-point white text on a black background (average number of words ⫽ 36). Stories were constructed to fit
four categories: false belief, mechanical inference, human
action, and nonhuman descriptions (Appendix 1). Each
story was presented for 9500 ms, followed by a 500-ms
interstimulus interval. Each scan lasted 260 s: four 40-s
epochs, each containing four stories (one from each condition), and 20 s of fixation between epochs. The order of
conditions was counterbalanced within and across runs.
Subjects were asked to press a button to indicate when they
had finished reading each story. Subjects read a total of 8 (4
subjects) or 12 (21 subjects) stories per condition.
Fourteen of the subjects from Experiment 1 (7 women)
were also scanned on an EBA localizer in the same scan
session, all at 3.0 T. Stimuli consisted of 20 grayscale
photographs of whole human bodies (including faces) in a
range of postures, standing and sitting, and 20 photographs
W2
Experiment 1
We devised a new version of the false belief stories task
(Fletcher et al., 1995) to compare reasoning about true and
false beliefs to reasoning about non-social control situations. ToM stories described a character’s action caused by
87
1840
Rapid Communication / NeuroImage 19 (2003) 1835–1842
W3
Fig. 4. Experiment 2. Average percent signal change in left and right
TPJ-M, defined in individual subjects (n ⫽ 14) as voxels that respond
significantly more to false belief (FB) than to false photo (FP) stories (P ⬍
0.0001, uncorrected. Response magnitude for these two conditions is illustrative only, since these data were used to determine the region of
interest). In the TPJ-M bilaterally the BOLD response to physical people
stories was significantly lower than to desire stories (P ⬍ 0.05), and not
significantly different from nonhuman description stories (P ⬍ 0.1,
repeated-analysis of variance). Response decreases are commonly observed in the TPJ vicinity during demanding nonsocial tasks (Shulman et
al., 1997; Gusnard and Raichle, 2001).
w2
during either physical people or nonhuman description stories (both paired samples t tests P ⬍ 0.05), which did not
differ from each other (Fig. 4). The left and right TPJ-M did
not differ. Thus, these regions are not involved in the detection of any person in verbal stories, but respond selectively to stories in which describe (or imply) characters’
mental states. Did any regions show the predicted profile of
a response to a person per se? At a lower threshold, a
separate whole brain analysis (P ⬍ 0.001, uncorrected) of
physical people ⬎ nonhuman descriptions revealed regions
of frontal cortex [dorsal medial prefrontal (⫺3 57 39), and
right lateral frontal cortex (39 15 54)].
Discussion
The results of Experiment 2 confirm that the TPJ-M
shows an increased response to stimuli that invite ToM
reasoning compared with logically similar nonsocial controls (false photograph stories). Second, the TPJ-M does not
show an increased response to the mere presence of a person
in the stimulus (physical people stories). The right and left
TPJ-M responses to physical people stories did not differ,
thus resolving the ambiguity of the apparently lateralized
response to photographs of bodies in Experiment 1.
General discussion
stimuli, and is not merely an effect of the difficulty or
logical structure of false belief stories, since the TPJ-M did
not respond to the more difficult and logically similar false
photograph stories.
We asked whether the TPJ-M represents the simple presence of another person (possibly via detecting a human
body and/or biological motion) or is involved specifically in
ToM. We found that the TPJ-M was anatomically and
functionally distinct from the nearby EBA (Downing et al.,
2001), which responded preferentially to the visual appearance
of human bodies, suggesting the presence of at least two
distinct regions involved in social information processing.
A key innovation of this study over previous studies was
the inclusion in Experiment 2 of physical people stories,
which described the physical appearance of human bodies.
Previous studies (Fletcher et al., 1995; Gallagher et al.,
2000) have included “physical” stories describing acting
people, which produced greater activation in the TPJ than a
scrambled sentence control. Our data show that the TPJ-M
response was no greater to stories that described other
people in physical detail than that to stories describing the
physical details of nonhuman objects—and was significantly lower than to stories that did invite a mental state
interpretation (desire stories).
Could the TPJ-M activation reflect mental imagery of the
biological motion or goal-directed action described in the
false belief, human action, and desire stories? We think this
is unlikely. Saxe, R., Xiao, D.K., Kovacs, G., Perrett, D.,
and Kanwisher, N. (unpublished data) found that the TPJ-M
response to a movie of a walking person was much lower
than its response to false belief stories. If the response of the
TPJ-M to verbal stories was merely a consequence of subjects’ imagining biological motion, we would predict the
opposite. Also the TPJ-M was doubly dissociated from its
neighbour, the pSTS-VA (visual analysis of action), which
responded more to the movies than to verbal stories.
In all, our results show that a region of the TPJ2 is
involved in reasoning about other minds, not just in understanding stories involving people per se (Gallagher and
Frith, 2003; p 80). But critically, neighbouring subregions
of cortex have different functional profiles, highlighting the
necessity of careful within-subject comparisons. The
TPJ-M, identified here by responses to (false) belief stories,
W4
2
What is the relationship between the TPJ-M and attention? Selective
attention leads to increases in regions of the TPJ during social perception
tasks (e.g., Narumoto et al., 2001; Winston et al., 2002), and to decreases
in regions of the TPJ during visual attention tasks (Shulman et al., 1997;
Gusnard and Raichle, 2001; Jiang Y., Kanwisher, N., unpublished data).
Downar et al. (2001) proposed “a role for the TPJ in detecting behaviourally relevant events in the sensory environment” (p. 1256) that is
interfered with by demanding visual attention. One possibility is that the
mental states of other people constitute a particular category of such
“behaviourally relevant” stimuli. Alternatively, these results may reflect
functionally and anatomically distinct subregions within the TPJ. Direct
testing of the relationship between the TPJ-M and selective attention is an
important avenue for future work.
W3
In two experiments we found greater BOLD response in
a region within the TPJ bilaterally (here called TPJ-M)
while subjects read stories that describe or imply a character’s goals and beliefs than during stories about nonhuman
objects. This pattern is robust across subjects, tasks, and
91
Rapid Communication / NeuroImage 19 (2003) 1835–1842
may play a broad role in social and even moral cognition
(Moll J et al., 2002; Greene and Haidt, 2003).
1841
False belief (FB) sample story
John told Emily that he had a Porsche.
Actually, his car is a Ford. Emily
doesn’t know anything about cars
though, so she believed John.
—
When Emily sees John’s car she
thinks it is a
porsche
ford
W4
Acknowledgments
This work was funded by grants NEI 13455 and NIHM
66696. Our thanks especially to Yuhong Jiang for comments and conversation, and to Ben Balas, Robb Rutledge,
Miles Shuman, and Amal Dorai for help with data collection
and analysis.
False photograph (FP) sample story
A photograph was taken of an apple hanging
on a tree branch. The film took half an hour to
develop. In the meantime, a strong
wind blew the apple to the ground.
—
The developed photograph shows the apple on the
ground
branch
Appendix
Experiment 1
Instructions: “Read each story silently to your self.
Please make sure you understand what is happening; it is
more important that you understand the story, than that you
go as fast as possible. When you are done reading the story,
press the button.”
Desire sample story
For Susie’s birthday, her parents decided
to have a picnic in the park. They wanted
ponies and games on the lawn. If it rained,
the children would have to play inside.
—
Susie’s parents wanted to have her birthday
inside
outside
Theory of mind (ToM) sample story
A boy is making a paper mache project
for his art class. He spends hours
ripping newspaper into even strips.
Then he goes out to buy flour. His
mother comes home and throws all the
newspaper strips away.
Physical people sample story
Emily was always the tallest kid in her
class. In kindergarten she was already
over 4 feet tall. Now that she is in
college she is 6⬘4⬙. She is a head taller
than the others.
—
In kindergarten Emily was over
4 ft
6 ft
. . .tall
Mechanical inference (MI) sample story
A pot of water was left on low heat
yesterday in case anybody wanted tea.
The pot stayed on the heat all night.
Nobody did drink tea, but this morning,
the water was gone.
Nonhuman description sample story
Human action sample story
Nine planets and their moons, plus various
lumps of debris called asteroids and
comets, make up the sun’s solar system.
The earth is one of four rocky planets
in the inner solar system.
—
The solar system has
four
nine
. . .planets.
Jane is walking to work this
morning through a very industrial
area. In one place the crane is
taking up the whole sidewalk. To
get to her building, she has to
take a detour.
Experiment 2
References
Instructions: “Please read each story carefully. After
each story, you will be given one fill-in-the-blanks question
about the story. Underneath will be two words that could fill
in the blank. Choose the correct word (to make the sentence
true in the story) by pressing the left button to choose the
left-hand word, and the right button to choose the right-hand
word.”
Allison, T., Puce, A., et al., 2000. Social perception from visual cues: role
of the STS region. Trends Cogn. Sci. 4, 267–278.
Bartsch, K., Wellman, H., 1995. Children Talk about the Mind. Oxford
University Press, New York.
Brunet, E., Sarfati, Y., et al., 2000. A PET investigation of the attribution
of intentions with a nonverbal task. Neuroimage 11, 157–166.
92
Neuron, Vol. 23, 747–764, August, 1999, Copyright 1999 by Cell Press
The Generality of Parietal Involvement
in Visual Attention
Ewa Wojciulik*†k and Nancy Kanwisher‡§
* Department of Psychology
University of California, Los Angeles
Los Angeles, California 90095
† Institute of Cognitive Neuroscience
University College London
17 Queen Square
London WC1N 3AR
United Kingdom
‡ Department of Brain and Cognitive Sciences
Massachusetts Institute of Technology
Cambridge, Massachusetts 02139
§ Nuclear Magnetic Resonance Center
Massachusetts General Hospital
Charlestown, Massachusetts 02129
W1
Summary
Functional magnetic resonance imaging (fMRI) was
used to determine whether different kinds of visual
attention rely on a common neural substrate. Within
one session, subjects performed three different attention experiments (each comparing an attentionally demanding task with an easier task using identical stimuli): (1) peripheral shifting, (2) object matching, and (3)
a nonspatial conjunction task. Two areas were activated in all three experiments: one at the junction of
intraparietal and transverse occipital sulci (IPTO), and
another in the anterior intraparietal sulcus (AIPS).
These regions are not simply involved in any effortful
task, because they were not activated in a fourth experiment comparing a difficult language task with an
easier control task. Thus, activity in IPTO and AIPS
generalizes across a wide variety of attention-requiring
tasks, supporting the existence of a common neural
substrate underlying multiple modes of visual selection.
Introduction
Visual attention, which is the ability to selectively process only a subset of the information present in the
retinal image, has been extensively studied over the last
two decades, using a wide variety of experimental tasks
and stimuli (e.g., Posner and Petersen, 1990; Allport,
1993; Desimone and Duncan, 1995). Despite the many
differences in these experimental paradigms, researchers generally refer to the selective process in each of
these situations as visual attention, implying a common
mechanism. However, very little work has been directed
to the crucial question of whether visual attention in fact
consists of a single general-purpose mechanism involved
in all forms of selective visual processing or whether it
instead consists of a heterogeneous set of different mechanisms, each involved in a different kind of selection. In
the present study, we used functional magnetic resonance
W2
W1
k To whom correspondence should be addressed at U.K. address
(e-mail: [email protected]).
94
imaging (fMRI) to address this question, asking whether
there is any region of the human brain that is activated
by each of three very different attention-requiring tasks
yet not activated by a language task that is difficult but
does not place heavy demands on visual attention.
Recent imaging results suggest that parietal areas,
especially the superior parietal lobule (SPL) and the intraparietal sulcus (IPS), participate in many different attention tasks and may subserve a general visual attention function. Parietal activity has been associated with
endogenous and exogenous shifts of spatial attention
(Corbetta et al., 1993; Nobre et al., 1997), maintenance
of attention on peripheral stimuli and divided attention
(Vandenberghe et al., 1997), feature integration (Corbetta et al., 1995), attentive tracking of moving dots
(Culham et al., 1998), nonspatial attention (Coull et al.,
1996; Coull and Nobre, 1998), object-based attention (Fink
et al., 1997), response selection to visually presented
stimuli (Iacoboni et al., 1996), object-oriented action
(Faillenot et al., 1997), and overt and covert attention
shifts (with and without eye movements, respectively;
Corbetta et al., 1998). A few studies have directly compared the effects of one type of attention with another
and found overlapping parietal activations (SPL and/or
IPS) for exogenous and endogenous attention (Corbetta
et al., 1993; Nobre et al., 1997), object-based and spacebased attention (Fink et al., 1997), spatial and temporal
orienting (Coull and Nobre, 1998), overt and covert attention shifts (Corbetta et al., 1998), and attentive tracking
and attention shifts (Culham et al., 1998). The great diversity of tasks used in previous studies, combined with
the similarity of parietal activations across them, suggests that some regions of parietal cortex may play a
very general role in visual attention, rather than supporting any one particular task-specific function. However,
because the different attentional tasks in prior studies
were rarely run on the same subjects, past results are
consistent with the importantly different hypothesis that
nearby but nonoverlapping cortical regions are activated by different attentional tasks, suggesting functional specificity.
To distinguish between these two alternatives, it is
necessary to examine brain activation at a fine grain
within individual subjects and to show that the same
voxels are activated in each of the different attentional
comparisons in a particular subject. In the present study,
each of seven subjects participated in three different
visual attention experiments within a single scanning
session. The tasks and stimuli were very different across
the three experiments (to provide a strong test of generality) yet identical in stimuli and motor requirements
within each experiment (to provide a relatively pure measure of attentional effects). Data were then analyzed
using a voxel-by-voxel analysis testing for Activation
Overlap across Multiple Tasks (AOMT) for each subject’s
data individually (see also Price and Friston, 1997; GrillSpector et al., 1998). Only voxels that showed significant
activation in each of the three different attentional tasks
are candidate loci for a common neural substrate involved in multiple types of visual selection.
Neuron
756
significant differences between the difficult language
task and the visual control task in any region (all twotailed ps . 0.25; note, however, that this analysis was
based on the AOMT regions of each subject individually,
therefore reducing the power to find greater activity for
the easier visual task than for the difficult language task,
as observed in the group analysis; see above).
In summary, there is no evidence that IPTO and AIPS
activity is related to general (not specifically visual) effort. Although the language task produced consistently
higher activity in left temporal cortex than did the easier
visual task (four of four subjects), this was not true in
the AOMT regions, as tested either with the KS test or
with ANOVA. Furthermore, all AOMT regions showed
significant difficulty by experiment interactions, with
strongest responses for the difficult visual task and only
weak responses for the difficult language or easy visual
tasks. It appears that these areas are specifically involved in visually demanding tasks.
Discussion
Figure 6. Interaction of Task Difficulty (Easy/Difficult) by Experiment
(E1c/E2) in the AOMT Regions
Results of the comparisons between the easy (gray bars) and difficult (black bars) tasks of each experiment are shown below the
corresponding bars in each graph (p values are one tailed for E1c).
The y axis plots PSC; error bars indicate standard error. PSC was
extracted for each subject and each AOMT separately.
overlap with the group AOMT voxels (Figure 5B). The
group analysis suggests that there may not have been
sufficient power in the individual subjects’ data to detect
this difference (note that the number of trials in E2 was
about half of that in each of E1a–E1c, since the difficult
language task required a longer interstimulus interval to
allow for the much longer response times; this procedure reduced the power in the individual subjects’
analyses).
Visual versus Nonvisual Effort: Interaction
of Experiment by Difficulty
The apparent interaction in the AOMT regions between
task difficulty and experiment (i.e., significant activations for the difficult versus easy tasks in E1 but not in
E2) was further tested by ANOVA. The ANOVA compared
the percent signal change (PSC, extracted for each subject and each AOMT separately) of easy and difficult
conditions of E1c and E2 (i.e., the two experiments that
all four subjects completed within the same session
as the language control experiment; see Experimental
Procedures). As shown in Figure 6, a significant interaction between task difficulty (easy and difficult) and experiment (E1c and E2) was found for all four AOMT
regions (all ps at least , 0.05). The significant interactions indicate that the AOMT areas responded most
strongly during the visually demanding task and only
weakly during all other tasks (difficult language or two
easy visual tasks). Planned comparisons, comparing difficult with easy conditions for each experiment separately, revealed significantly stronger activity for the
conjunction task than for the feature task in all four
AOMT regions (all one-tailed ps at least , 0.05) but no
W3
fMRI was used to determine whether multiple visual
attention tasks may activate common regions of parietal
cortex, therefore showing a generality of function rather
than functional specificity. Within a single session, subjects performed three different visually demanding tasks
(each with its own control condition) that differed widely
from each other in the kind of attentional selection involved as well as in stimuli and their spatial layout. The
common element across the three experiments was that
each contrasted one condition that placed high demands on visual attention with another condition (using
identical stimuli and matched motor components) in
which attentional requirements were minimal. The AOMT
analysis allowed us to look for exact voxel-by-voxel
overlap in activation within subjects across the three
attention-requiring tasks: peripheral shifting/maintenance
of attention, peripheral object matching, and a nonspatial conjunction task.
Two bilateral regions showed activation overlap
across all three visual attention experiments. One was
found in the posterior fundus of the IPS, at or close to
the junction with the transverse occipital sulcus (IPTO;
Brodmann’s area 19). Although the size of the IPTO overlap in individual subjects was small (averaging about
300 mm3 at the most stringent AOMT threshold), it was
very consistent across subjects (present in six of seven
subjects in the right hemisphere and seven of seven in
the left). The second area of activation overlap was
found in AIPS in five of seven subjects; for all five subjects, this area lay on the medial bank of the IPS (i.e.,
SPL, Brodmann’s area 7). The left AIPS was about the
same size as the more posterior AOMT at IPTO, while
the right AIPS was substantially larger (about 1 cm3).
The group analysis confirmed these results, showing
massive bilateral activation overlap through the entire
extent of the IPS, from its posterior end at IPTO, where
it extended into posterior SPL in the right hemisphere,
to the anterior segment close to the PCS (Figure 2B).
The large size of the activation overlap in the group
data (15.4 and 6.3 cm3 in right and left hemisphere,
respectively) suggests that the individual subjects’ analyses underestimated the extent of the overlap. On the
103
Parietal Involvement in Visual Attention
757
other hand, averaging across subjects produces spatial
blurring due to imperfect coregistration and differences
in anatomical and functional organization across subjects, which can possibly result in an overestimate of
the overlap in the group data. Nonetheless, even when
the group AOMT threshold was increased to p , 10215
(i.e., single comparison threshold of p , 1025), the IPS
overlap remained extensive (7.0 and 2.3 cm3 in right and
left hemisphere, respectively) but began to differentiate
into a posterior focus at IPTO and a more anterior IPS
focus, as observed for individual subjects. Thus, the
group and the individual subject analyses yielded a
highly consistent and robust pattern of activation overlap in at least two bilateral areas in the IPS. (Note, however, that it is possible that future research with higher
spatial resolution may uncover functional specialization
at a finer grain than we can resolve with present techniques.)
The overlapping activations at IPTO and AIPS suggest
that these regions perform a more general attention
function than could be inferred based on any one experiment in this study or on previous studies with single
comparisons. Alternative accounts of the overlapping
activations in terms of either more general processes
(e.g., any effortful task) or more specific ones (e.g., attentional shifting alone) can be discounted. First, while a
difficult language task (as compared with an easier visual task) performed on visually presented words produced strong activations in left superior temporal cortex, it failed to do so in IPTO and AIPS. Indeed, the
group analysis revealed stronger activity for the easier
visual task than for the difficult language task in left and
right IPTO and right AIPS, as would be expected if the
visual task placed higher demands on visual attention
than did the language task. Furthermore, a significant
interaction of experiment (E1c versus E2) by task difficulty was found in all four regions, with strongest responses for the attentionally demanding task of E1c
(conjunction task) and only weak responses for the easy
visual tasks of both experiments or the difficult language
task of E2. Thus, the IPTO and AIPS areas are not simply
activated by any effortful task.
Second, while some of the activations in both E1a
(peripheral attention shifting . central maintenance) and
E1b (peripheral object matching . central color matching) may reflect peripheral space–based selection, and
possibly active suppression of spatial distractors or of
eye movement, these mechanisms should not be active
in E1c (nonspatial conjunction . feature task). In that
comparison, all stimuli were presented at fixation and
without spatial distractors, such that subjects attended
to the same location in both conditions, and there was
no need to plan, make, or suppress eye movements.
These results show that eye movement preparation, execution, or suppression is not necessary to activate
these areas, while focused attention is sufficient for their
activation. Finally, while the stimuli and their spatial layout differed between experiments, emphasizing the general response properties of these areas, the displays
and motor requirements were kept identical within each
experiment, thus excluding any potential confounds due
to differences in retinal stimulation or motor components.
Our results fit well with the heterogeneity of attentional
W4
W4
104
deficits observed in visuospatial neglect, a disorder
characterized by a marked impairment in the ability to
detect or respond to objects in the contralesional space.
Parietal neglect patients show the classic deficits not
only in spatial attention (exogenous and endogenous
orienting as well as feature integration in the contralesional space; e.g., Posner et al., 1984; Eglin et al., 1989)
but also in object-based attention (e.g., Driver and Halligan, 1991; Behrman and Tipper, 1994) and temporal
(nonspatial) attention (Husain et al., 1997). Husain et
al. (1997) showed that when neglect patients have to
identify a target object in a rapid visual stream, their
ability to detect a second object is profoundly impaired,
even though both objects are presented in the same
(foveal) location. Furthermore, the severity of the temporal impairment correlated strongly with the magnitude
of the spatial bias, suggesting a close link between spatial and temporal attention, as observed in our data.
These and our results indicate that parietal function cannot be reduced to one single mode of visual selection;
instead, our data suggest that IPTO and AIPS reflect a
common neural substrate underlying multiple modes of
visual selective processing.
IPTO and AIPS: Location and Function
A comparison of the location of IPTO with respect to
the retinotopic areas of the human brain (Sereno et al.,
1995; Tootell et al., 1997) suggests that it is located in
the vicinity of area V3A, which is transected by the TrOS
but which does not appear to continue in the IPS. In
humans, area V3A has a retinotopic representation of
the whole contralateral visual field and shows high responsiveness to motion (Tootell et al., 1997). However,
our stimuli were stationary. In addition, the overlapping
activation at IPTO often extended beyond the junction
into posterior IPS, a region that is likely to be beyond
the borders of V3A. These considerations suggest that
IPTO may constitute a functional and anatomical region
that is separate from V3A. In a recent paper, Tootell et
al. (1998) found that, among other areas, both V3A and
“V7” (a newly discovered retinotopic visual area adjacent to V3A) were strongly modulated by spatial attention. Based on the anatomical location of V7 and its
susceptibility to attentional manipulations, it is possible
that IPTO corresponds to V7.
Several recent imaging papers reported activations
in an area that showed close correspondence to IPTO
coordinates. Culham et al. (1998) found a posterior intraparietal focus active during both discrete attentional
shifting and continuous attentive tracking. In addition,
Faillenot et al. (1997) reported a similar activation focus
in the right hemisphere in a task involving the matching
of (as opposed to pointing at or grasping) successive
novel shapes, and both Jonides et al. (1993) and Courtney et al. (1996) found that visuospatial memory tasks
also activate a similar right hemisphere region. More
surprisingly, a focus very similar to IPTO was activated
during the viewing of various objects (as compared with
textures) defined by luminance, texture, or motion (GrillSpector et al., 1998). Although this study did not manipulate attention explicitly, one might expect that attention
is more engaged by passive viewing of objects than of
random texture fields. These papers not only extend the
Parietal Involvement in Visual Attention
759
by areas in the IPS). This hypothesis is readily testable
and makes straightforward predictions: if some of the
IPS areas support an inhibitory function, they should be
more active in situations that require distractor suppression; on the other hand, if some of them contribute
to enhancement, they should show stronger activity in
attentionally demanding tasks than in control tasks, independent of the presence of distractors.
Whether or not this hypothesis is confirmed, the important point to note here is that none of the attentional
functions that have been previously attributed to parietal
regions (e.g., shifts of spatial attention and processes
related to eye movement preparation or execution) can
account for activity in IPTO and AIPS. Instead, the broad
range of attentional tasks that activate the same IPS
areas indicate that these regions play a more general
role in visual attention.
shifts of attention but rather generalizes to nonspatial
W4 attention
tasks as well (see also Husain et al., 1997).
Attention Shifts and Nonspatial Attention
in Parietal Cortex
Although the three comparisons generally resulted in
similar regions of activation, some parts of the SPL (lateral, but medial to the AOMT regions) appeared to be
activated uniquely in peripheral shifts of attention, while
activity in lateral IPS (IPL) appeared to be associated
uniquely with nonspatial attention (Figure 4D). The apparent specificity of these activations for the two different types of attention tasks has to be treated with caution
given the many differences between the experiments (in
particular, differences in the attended, peripheral versus
foveal, locations). Nonetheless, the procedure of running multiple attentional tasks in the same subjects
allows us to dissociate the activity in these regions from
visual effort (i.e., all three tasks compared difficult conditions with easy ones, but these two regions showed
activity specific to only one of the comparisons). Our
results thus extend previous findings that have associated spatial shifts of attention with posterior parietal
cortex (e.g., Corbetta et al., 1993; Nobre et al., 1997),
suggesting that parietal involvement in attentional shifting can be dissociated from visual effort.
Conversely, the results of the nonspatial conjunction
task show that spatial shifting of attention is not necessary to activate parietal cortex; extensive regions in the
IPS and SPL also subserve mechanisms involved in nonspatial attention, with lateral IPS/IPL apparently activating uniquely in the nonspatial task. Previous imaging
research produced conflicting results; whereas Rees
et al. (1997) found no parietal activity in a nonspatial
attention task, Coull et al. (1996) and Coull and Nobre
(1998) both reported activations in SPL or IPS. However,
the task of Coull et al. (1996) did not dissociate nonspatial attention from working memory, and Coull and Nobre
(1998) found only one parietal region more active in their
temporal than in their spatial cueing tasks (left IPS) or
in both spatial and temporal tasks as compared with
the neutral baseline (left parietal cortex). The most likely
reason why we found such extensive and highly consistent parietal activations and the previous studies found
none (Rees et al., 1997) or few (Coull and Nobre, 1998)
is that in our experiment, the displays contained multiple
temporal distractors. Our results thus clearly demonstrate that parietal activity is not constrained to spatial
Finally, our results extend the findings of Corbetta et
al. (1995), who reported parietal involvement in a spatial
conjunction task, to also include nonspatial conjunction.
Parietal cortex appears to be involved in feature integration, independent of whether the task depends on spatial (Corbetta et al., 1995) or temporal (E1c) selection,
although this conclusion is not yet definitive given the
differences in difficulty between the conjunction and
feature tasks of both studies.
Conclusion
In summary, we demonstrated that at least two regions
in the dorsal pathway, IPTO and AIPS, were activated
in each of three different visual attention tasks tested,
despite wide variation in task, stimuli, and spatial layout.
Their function cannot be reduced to one single type of
visual attention (e.g., attentional shifting alone) nor can
it be accounted for by effortful processing in general.
These findings provide evidence for the existence and
precise anatomical locus of a common neural substrate
underlying multiple modes of visual selective processing.
W4
W4
106
Experimental Procedures
Subjects
Seven healthy subjects (two men; all under 40 years old) participated
with informed consent in E1a–E1c within one session. Four of these
subjects (all women; three were native speakers of English, and one
was a nonnative but fluent speaker) also participated in E2. Two of
them participated in E2 within the same session as in E1, and two
in another session on a separate day; the latter two also performed
E1c within the second session. Six subjects described themselves
as right-handed, and one as left-handed. One subject’s data from
E1b were used to test attentional modulation of face processing
(Wojciulik et al., 1998). The experimental procedures were approved
by the Harvard University Committee on the Use of Human Subjects
in Research and by the Massachusetts General Hospital Subcommittee on Human Studies.
Experiment 1a: Peripheral Shifting versus Central Maintenance
Stimuli and Experimental Design
The displays (Figure 1A) consisted of one central dot and eight
peripheral dots arranged in a circle whose diameter was about 158.
The dots were light gray and appeared on a darker gray background.
A colored cross, one arm gray, the other half red and half blue, was
superimposed on the central dot. The cross rotated counterclockwise once every 4 s, such that with each rotation, the red (or blue)
part of the cross pointed to the next peripheral dot. The whole
display, except for the cross, blinked on and off every 400 ms. Each
time the display blinked on, the central dot or one of the four dots
indicated by the arms of the cross became smaller (i.e., there was
always one smaller dot in the display). The cross remained in a
particular orientation (e.g., red arm pointing to top right dot) for 4 s
(five blinks of the display) and then rotated counterclockwise, pointing to the next dot, and so on for all eight dots. During the 4 s when
the cross was in a particular orientation, on one of the five blinks
(determined randomly) the attended dot, peripheral or central, became smaller and reverted back to the larger size on the next blink.
The large dots were about 1.38 in size. When the dots became
smaller, they were about 18 in the periphery and 0.88 in the center.
There were 18 epochs in total, 6 epochs of the central (C) maintenance condition, which alternated with six periods of peripheral
shifting; the latter periods were split into 6 left (L) and 6 right (R)
visual field epochs. Each epoch (C, L, or R) lasted 16 s. For four
subjects, the order of the epochs was C-L-R, repeated six times,
and for three subjects, C-R-L, repeated six times. Before the central
Positive affect increases the breadth
of attentional selection
G. Rowe*, J. B. Hirsh*, and A. K. Anderson*†‡
*Department of Psychology, University of Toronto, Toronto, ON, Canada M5S 3G3; and †Rotman Research Institute, Baycrest Centre for Geriatric Care,
Toronto, ON, Canada M6A 2E1
Edited by Edward E. Smith, Columbia University, New York, NY, and approved October 29, 2006 (received for review June 21, 2006)
The present study examined the thesis that positive affect may
serve to broaden the scope of attentional filters, reducing their
selectivity. The effect of positive mood states was measured in two
different cognitive domains: semantic search (remote associates
task) and visual selective attention (Eriksen flanker task). In the
conceptual domain, positive affect enhanced access to remote
associates, suggesting an increase in the scope of semantic access.
In the visuospatial domain, positive affect impaired visual selective
attention by increasing processing of spatially adjacent flanking
distractors, suggesting an increase in the scope of visuospatial
attention. During positive states, individual differences in enhanced semantic access were correlated with the degree of impaired visual selective attention. These findings demonstrate that
positive states, by loosening the reins on inhibitory control, result
in a fundamental change in the breadth of attentional allocation to
both external visual and internal conceptual space.
PSYCHOLOGY
significantly linked with an increased capacity for creativity and
novel thinking.
The present study examined the hypothesis that the increased W1
cognitive flexibility and creative thinking associated with positive mood reflects a fundamental change in selective attention
(23–25). In contrast with the proposed tunnel vision of negative
affective states (9), positive affect may serve the opposite
function: to enhance the scope of attention (6). Preliminary
evidence for this assertion comes from studies examining global
precedence, whereby positive mood is associated with greater
global or holistic processing (i.e., seeing the forest before the
trees) versus local processing (i.e., the trees before the forest)
(23, 24). Under positive mood, individuals indicate a square
made of triangles is more similar to a square than a triangle.
Rather than a genuine change in the manner or breadth of how
attention may be allocated, such cognitive biases have been
interpreted within the affect-as-information framework (26),
whereby happy moods increase access to what is in mind during
the task at hand. In the case of global precedence, positive affect
will accentuate further a bias toward global configurations (27).
Consistent with attention as having a measurable ‘‘breadth’’ or
scope, research has suggested attentional focus can vary in its spatial
extent, which has led to the use of different metaphors in characterizing the nature of attention. For example, attention has been
compared with the beam from a spotlight (28, 29) or to the zoom
lens of a camera (30, 31), suggesting that attention can be either
narrowly focused or more widely distributed to include surrounding
stimuli. Although attention can act in ways unlike a spotlight (e.g.,
refs. 32–34), more than just a metaphor, convergent physiological
evidence from the extent of activation in primary visual cortex
suggests that attention does have a measurable spatial scope (35).
We hypothesize that positive affect may result in a relaxation of
attentional selection, thus increasing the breadth of the proverbial
spotlight of spatial attention.
As selective attention is associated with the inhibitory filtering
of task-irrelevant distraction (36), increased attentional breadth
would be reflected in a decreased capacity to inhibit processing
of spatially adjacent irrelevant information. Our operational
definition of broader attention in the visuospatial domain is thus
an impairment of spatial selective attention, resulting in a more
leaky filtering of unattended information, whereby ignored
information is more fully processed (37, 38). To this end, we used W2
the Eriksen flanker task (39) in which observers are asked to
selectively attend to a central target and ignore irrelevant
flanking distractors. Failure of selective attention is demonstrated when the to-be-ignored flankers influence performance,
indicated by a slowing in response to the central target when
flanked by response-incompatible letters. Also consistent with
attention 兩 emotion 兩 creativity 兩 inhibition 兩 problem solving
Open Gen Intro (1): General & familiar
V
NEUROSCIENCE
iewing the world ‘‘through rose-colored glasses’’ may be less
proverb and more empirical fact. Converging evidence
suggests that affective states are associated with changes in
attention that may affect differentially perception and cognition
(e.g., refs. 1–7). Attentional processes are those aspects of
cognition that allow the control of perception, thought, and
behavior and are generally acknowledged to depend on inhibitory control, such as suppression of irrelevant information and
response inhibition (8). It has been proposed that executive
control (5) and, thereby, the focus of attention may be influenced
by the current affective state of the observer (6). For instance,
it has been long hypothesized that arousal during negative
affective states is associated with a constriction of attentional
focus (7). Evidence for this narrowing of attention (9) sometimes
is referred to as ‘‘weapon focus,’’ where attention is narrowed at
the expense of encoding peripheral details (10). Although the
interaction between negative affect and attention is an active
focus of research, including attentional biases in affective disorders such as anxiety or depression (e.g., ref. 4), much less is
appreciated regarding the role positive affect and well-being may
have on attention.
Within the emerging field of positive psychology, the ‘‘broadenand-build’’ theory suggests that a primary function of positive
emotions is to broaden people’s thought-action repertoires (11, 12),
increasing their flexibility and enhancing their global scope. Consistent with such views, a robust and widely confirmed finding is that
positive affect is linked to a creative and more generative mindset
that results in greater cognitive flexibility across diverse situations,
including medical diagnosis (13, 14), industrial negotiations (15),
intuitive judgments (16), decision making (17), and creative
problem-solving tasks (18, 19). For example, on the remote associates task (RAT; ref. 20), a test of creative problem solving, people
are more likely to solve unusual word associations when they are in
a positive, compared with negative or neutral, mood (16, 18, 19, 21).
Similarly, positive mood generates more solutions to the Duncker
(22) candle task (18, 19), which can be solved only by using the
elements in an unconventional way. Thus, positive affect has been
Author contributions: G.R., J.B.H., and A.K.A. designed research; G.R. and J.B.H. performed
research; G.R. and A.K.A. analyzed data; and G.R. and A.K.A. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS direct submission.
‡To
Define abbreviation when it first appears in the text
www.pnas.org兾cgi兾doi兾10.1073兾pnas.0605198104
whom correspondence should be addressed. E-mail: [email protected].
© 2006 by The National Academy of Sciences of the USA
112
PNAS 兩 January 2, 2007 兩 vol. 104 兩 no. 1 兩 383–388
Results
8
Mood Induction. Positive and sad music induction increased and
decreased positive affect, respectively (Fig. 1). Mood valence and
overall arousal ratings were submitted to separate one-way
ANOVAs with four levels of the mood induction factor: initial/
preinduction and after neutral, positive, and negative mood
inductions. Valence ratings differed significantly depending on
induction phase [F(3, 66) ⫽ 28.22, P ⬍ 0.001]. The initial mood
of participants before mood induction and task performance was
slightly positive and was nonsignificantly decreased during neutral mood induction, t (66) ⫽ 1.57, P ⬎ 0.12. Positive [t (66) ⫽
4.81, P ⬍ 0.001] and negative [t (66) ⫽ ⫺4.23, P ⬍ 0.001] mood
inductions resulted in a similar magnitude of increased and
decreased positive affect relative to neutral conditions. By
contrast, the overall level of subjective arousal of participants
was consistent with a moderate level of alertness that did not
differ across mood induction phase [F(3, 66) ⬍ 1].
6
5
4
3
2
Fig. 1. Effect of mood manipulations. Participants rated degree of mood
valence after neutral, happy, and sad mood manipulations (from 1 extremely
unpleasant to 9 extremely pleasant). Dotted line, initial mood.
RAT. Consistent with a broader spread of semantic activation,
positive mood was associated with increased access to remote
semantic associations (Fig. 2a). The number of completed
remote associate items was submitted to a repeated measures
ANOVA with mood (happy, sad, and neutral) as a within-subject
variable. A main effect of mood [F(2, 46) ⫽ 3.56, P ⫽ 0.04]
revealed that RAT performance depended on mood. Significantly more RAT problems were correctly solved when participants were in a happy [mean (M) ⫽ 6.08, SD ⫽ 3.16] compared
with sad (M ⫽ 5.17, SD ⫽ 3.40, t ⫽ 2.16, P ⫽ 0.04) or neutral
(M ⫽ 4.75, SD ⫽ 2.64, t ⫽ 2.30, P ⫽ 0.031) mood. The sad vs.
neutral difference was not reliable (P ⫽ 0.50).
Average number correct (± s.e.m.)
a
8
7
6
5
4
3
2
b
60
Flanker incompatibility (ms ± s.e.m.)
the spatially limited scope of attentional focus, previous research
has found that flanker interference decreases with increasing
distance from the central target, despite response competition
demands remaining constant (39, 40). The present study manipulated distractor eccentricity to allow a more fine-grained
analysis of the influence of positive affective state on the scope
of visual selective attention. W2
If positive affect results in a more fundamental broadening of
the scope of attentional selection, then it may have a common
influence on processing of external visual stimulation and internal conceptual representations. Attentional state has been
shown to influence the scope of semantic access (41, 42).
Reduced capacity for attentional selection during positive affect
then may facilitate access to a greater diversity of semantic
W2 information (21). To index increased scope of semantic access,
we used the RAT (20), where participants are asked to override
typical semantic associations to find semantically distant or
remote associations. If positive mood is associated with an
underlying broadening of attention, from perceptual to conceptual processing, then impaired selective attention, as indexed by
the Eriksen flanker task, would be associated with facilitated
access to remote semantic associations, as indexed by performance on the RAT.
55
1
Flanker Task. Response time data were submitted to a repeated
measures ANOVA, with mood (positive, negative, and neutral),
flanker compatibility (compatible vs. incompatible), and spacing
(near, medium, and far levels) as within subject variables. Trials
with response times ⬎1,000 ms were considered incorrect and
excluded from response time analysis. Analysis of response times
was restricted to correct trials only. There was a high level of task
accuracy (93.6%).
A highly reliable main effect of flanker compatibility [F(1,
23) ⫽ 410.91, P ⬍ 0.0001] revealed that responses to incompatible trials (M ⫽ 487.56, SD ⫽ 15.75) were slower than compatible
(M ⫽ 457.68, SD ⫽ 14.62) flanker trials. There was also a main
effect of spacing [F(2, 46) ⫽ 54.75, P ⬍ 0.0001] with response
times at near (M ⫽ 491.25, SD ⫽ 16.12) significantly slower than
c
Happy
50
Sad
45
40
Neutral
35
30
25
20
15
90
80
Flanker incompatibility (ms)
Mood ratings (± s.e.m.)
7
70
60
50
40
30
20
10
10
0
5
-10
0
0
2
4
6
8
RAT (# correct)
10
12
Fig. 2. Effect of mood manipulation on task performance. (a) Correct RAT responses. (b) Magnitude of flanker task incompatibility effects in milliseconds
(incompatible minus compatible). (c) Correlation between RAT (number correctly identified) and flanker compatibility (incompatible minus compatible) under
positive mood.
384 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0605198104
113
Rowe et al.
W3
Response time (ms ± s.e.m.)
a
530
520
Happy
510
Sad
500
Neutral
490
480
470
460
450
440
430
Comp
500
480
460
440
420
520
500
480
460
440
420
Near
Medium
Far
Happy
540
520
500
480
PSYCHOLOGY
Incomp
520
Sad
540
460
440
420
Near
Medium
Far
Near
Medium
Far
Fig. 3. Effect of mood manipulation and flanker distance on compatibility effects. (a) Compatibility collapsed across flanker distances. (b–d) Neutral (b), sad
(c), and happy (d) mood at near, medium, and far flanker eccentricities. Comp, compatible flanking distractors; Incomp, incompatible flanking distractors.
W3
both medium [M ⫽ 462.07, SD ⫽ 13.92, F(1, 23) ⫽ 89.00, P ⬍
0.0001] and far [M ⫽ 464.54, SD ⫽ 15.61, F(1, 23) ⫽ 74.60, P ⬍
0.0001] levels of spacing. The effect of flanker spacing interacted
with flanker compatibility [F(2, 46) ⫽ 4.69, P ⬍ 0.02] such that
compatibility effects were reduced with greater flanker distance,
with near versus far spacing revealing a 45% reduction in the
effect of flanker compatibility.
Consistent with the hypothesized influence of positive affect
on visual selective attention, positive moods resulted in greater
flanker interference relative to both sad and neutral moods (Fig.
2b). There was a marginal overall slowing on positive relative to
both sad and neutral moods [F(2, 23) ⫽ 3.09, P ⬍ 0.06], which
was largely due to an interaction between mood and compatibility [F(2, 46) ⫽ 3.86, P ⬍ 0.03], with a disproportionate slowing
to incompatible flankers under positive mood (Fig. 3a). A
focused test on the interaction revealed that positive moods
resulted in greater incompatibility effects relative to neutral
[F(1, 46) ⫽ 6.56, P ⬍ 0.02] and sad moods [F(1, 46) ⫽ 4.90, P ⬍
0.04]. Sad and neutral moods did not differ (F ⬍ 1). Focused
analyses on incompatible distractor trials demonstrated that
positive moods resulted in pronounced slowing relative to both
sad [F(1, 46) ⫽ 30.37, P ⬍ 0.0001] and neutral mood [F(1, 46) ⫽
39.41, P ⬍ 0.0001]. Sad mood did not result in additional slowing
relative to neutral moods on incompatible trials (F ⬍ 1).
Even as flanker eccentricity increased, positive mood resulted
in pronounced slowing relative to negative and neutral moods
(Fig. 3 b–d). A significant three-way interaction revealed that the
effect of spacing on flanker compatibility was influenced by
mood [F(4, 92) ⫽ 3.43, P ⬍ 0.02]. Critically, under neutral mood,
the effect of flanker compatibility at the far distance was no
longer significant [F(1, 92) ⬍ 1], consistent with abolished
processing of flanker content (Fig. 3b). By contrast, under
positive mood, far incompatible f lankers maintained pronounced and greatest interference relative to compatible flankRowe et al.
ers [F(1, 92) ⫽ 31.12, P ⬍ 0.0001; Fig. 3d]. Although flanker
compatibility effects still were found during negative mood [Fig.
3c; F(1, 92) ⫽ 10.93, P ⬍ 0.002], focused analyses of incompatible
flanker trials demonstrated that positive mood resulted in highly
robust interference relative to both negative [F(1, 46) ⫽ 31.12,
P ⬍ 0.0001] and neutral moods [F(1, 46) ⫽ 40.07, P ⬍ 0.0001].
Further illustrative of the importance of mood relative to flanker
distance, during positive mood, incompatible distractors at far
distances (M ⫽ 498 ms) resulted in statistically equivalent
response times to near distractors during neutral (M ⫽ 491) and
negative (M ⫽ 501) mood (P ⬎ 0.3).
NEUROSCIENCE
Neutral
d 560
560
Response time (ms ± s.e.m.)
Response time (ms ± s.e.m.)
Comp
540
Response time (ms ± s.e.m.)
c
b 560
Incomp
Relation Between RAT and Flanker Performance. We next examined
individual differences in enhanced access to remote associates and
impaired visuospatial selective attention associated with positive
mood. Although these tasks tap putatively different domains of
cognitive function, individual differences in performance during
positive mood may reveal a common underlying influence on
information processing. After removal of 2 response time outliers
(⬎4 SDs), a significant correlation was found between the number
of remote associates correctly identified and slowed response times
associated with flanker incompatibility (incompatible minus compatible) (r ⫽ 0.49, P ⬍ 0.02; Fig. 2c). As such, during positive mood,
individuals with the greatest breadth in semantic access (indexed by
number of remote associates accessed) demonstrated the most
pronounced visuospatial attentional breadth (indexed by increased
flanker incompatibility effect). This finding did not reflect a more
general association between RAT and flanker performance, or
generalization to negative affective states, as indicated by the lack
of significant correlation under neutral (r ⫽ ⫺0.09) and negative
moods (r ⫽ 0.10).
Discussion
Relative to both neutral and sad mood, positive mood was
associated with increased capacity to generate remote associates
114
PNAS 兩 January 2, 2007 兩 vol. 104 兩 no. 1 兩 385
W3
mation (37, 48). In addition, compared with perceptual load
for familiar words (16, 18). This finding is consistent with a
proposed broadening function of positive affect (11, 12). We
(e.g., difficult perceptual discrimination), manipulating cognishow here this broadening in informational access is not retive load (e.g., working memory) results in late relative to early
stricted to its beneficial search for semantically distant associaattentional filtering (37, 38). With regard to performance on the
tions. It extends to a detrimental influence on visuospatial
Flanker task, central target processing difficulty was manipuselective attention. Positive affect impaired the ability to seleclated through perceptual load via perceptual crowding (i.e., near
tively focus on a target and thereby increased processing of
versus far flankers), which is known to impair perceptual enspatially distant flanking distractors, consistent with expanded
coding (49–51). Paralleling the present results, such perceptual
scope of the proverbial attentional ‘‘spotlight.’’ Positive moods
interference effects are suppressed by focal attention (52, 53)
thus facilitate tasks requiring a more global (24) and encomand exacerbated by more diffuse spatial attention (50). It is
passing style of information processing, such as in the RAT, but
important to note that this attentional-load account need not
impair those calling for a narrow, focused style, such as selective
suggest that decreased effort is applied under positive mood.
W4 visual attention. A buoyant mood may represent a fundamental Enhanced RAT performance indicates positive affect did not
shift in the breadth of information processing, the result of which
decrease effort or primary task engagement. Rather, we suggest
would be to cultivate a more open and exploratory mode of
the easing of inhibitory control alters the quality of attention,
attention to both exteroceptive and interoceptive sources of
resulting in a shift from a narrow focused state to a more broad
information. We did not, however, observe that sad mood
and diffuse attentional focus.
resulted in the opposite influence of positive mood (6). This may
In parallel with influences on perceptual resources, positive
reflect that mild melancholic states evoked by music are not
affect may influence the allocation of cognitive resources (5, 21)
exclusively aversive, reflecting mixed-feeling states (43). The
to postsemantic levels of analysis. As such, attentional-load
evocation of anxiety or fear-related states may be necessary for
theory also may account for the effect of positive mood on
the proposed attentional narrowing of negative affect (9, 10).
semantic processing. Similar to studies examining the role of
In addition to the hypothesized increased breadth of attenattention on perceptual encoding, examinations of semantic
tional selection during positive mood, it has been suggested that
memory have demonstrated that attention plays a prominent
affective states selectively modulate task-related neural activity
role in semantic encoding. Attention to specific semantic feathe prefrontal cortex (e.g., ref. 44), paralleling the antetures has been shown to bias semantic retrieval toward the
Mid Genwithin
Dis
(5):
rior hemispheric asymmetries thought to support positive and
attended dimension, inhibiting access to unattended associations
affect (45). Gray et al. (44) found negative states
Relate tonegative
literature
(42). Also, studies examining semantic priming under different
facilitated tasks supported by the right hemisphere, such as
arousal-related attentional states show that focal attention invisual working memory, whereas positive states benefited verbal
creases priming for strong associates, whereas diffuse attentional
working memory, supported by the left hemisphere. Evidence
focus results in increased priming for weak associates (41). With
for enhanced semantic search and impaired visuospatial selecregard to performance on the RAT, greater attention toward a
tive attention shown here may be interpreted within this heminarrow set of semantic features will inhibit retrieval of more
spheric lateralization framework. However, positive affect has
remote/weak semantic associations. For example, biased attenbeen linked with facilitating a broader, more generative mindset
tion to the most strongly associated semantic features of the cue
(11, 12, 18, 46) across diverse situations that include verbal and
‘‘widow’’ (i.e., the death of one’s husband) would inhibit access
visual materials (13, 14). As such, the pattern of facilitation and
to more remote associations (i.e., spider) in common with the
interference in the present results is unlikely related to material
cue’s ‘‘bite’’ and ‘‘monkey.’’ More diffuse attentional focus under
specificity (verbal vs. visual) but rather is indicative of a shift in
positive affect would attenuate such biases, broadening the scope
mode of attentional selection that operates on these visual and
of cognitive resources to include more semantically remote
verbal materials. Mid Gen Dis (4): Discuss scope & limitation
associations on which creative solutions depend.
An analysis of individual differences revealed that Flanker
In summation, the present results suggest a unifying frameinterference during positive mood was correlated with enhanced
work in which to understand the interaction between positive
RAT performance, suggesting a common origin. Such an unaffect, attention, and creativity. Mirroring the broad attentional
derlying increase in breadth of processing, extending from
focus and enhanced creative problem solving during positive
external perceptual to internal conceptual processing, may be
mood, individual differences in creative intelligence have been
derived from a central origin in cognitive or inhibitory control.
associated with decreased attentional filtering. For instance,
Cognitive control is thought to limit the amount of information
creative individuals have been shown to exhibit less latent
entering the focus of attention (8, 36) and, thereby, influences
inhibition, a measurement of attentional decrements to stimuli
the capacity for selective attention (47). Similar to the effect of
deemed irrelevant (54). As such, positive affect may represent a
positive mood, individual differences in cognitive control are
fundamental shift in information processing style, reflecting a
associated with the capacity for selective attention, as indexed by
relaxation of inhibitory control and, thereby, reducing the tensusceptibility to the ‘‘cocktail party effect,’’ where observers
dency to narrowly focus attention across disparate informational
report hearing their own name on an unattended ear during
domains. The result of this altered capacity for attentional
dichotic listening (47). We suggest that positive mood reflects a
selection is a broadening of thought-action routines (11), englobal relaxation of inhibitory control (8, 36), resulting in an
gendering a broad exploratory (11, 18, 55) rather than narrow
altered capacity for selective attention from early perceptual to
vigilant processing mode (25, 56). Donning the proverbial roselater postsemantic levels of analysis.
colored glasses of positive mood then may be less about the color
How altered inhibitory control may alter the extent of visuoand more the expansiveness of the view.
spatial and semantic processing may be best understood with
Final Gen Dis (7): End with conclusion
respect to the ‘‘attentional-load’’ theory of selective attention
Materials and Methods
(37), which attempts to integrate early (perceptual) and late
that fits the first paragraph in Intro
Participants and Design. Twenty-four university students (12 fe(postsemantic) models of attention. Attentional-load theory
male) participated in a multipart experiment for either course
suggests that inhibitory suppression of ignored events is related
credit or monetary remuneration. All participants were tested
to the degree (48) and type (37) of cognitive resources allocated
between 11 a.m. and 1 p.m., in the middle of the peak and
to a primary task. To the extent that attentional resources are
off-peak periods of their circadian cycle (for a review of circamore available, inhibition of ignored events will be decreased,
resulting in leaky and ineffective filtering of unattended infordian arousal patterns, see ref. 57).
386 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0605198104
115
Rowe et al.
Manuscript Triage
Standard definition of medical triage. The process of sorting people based on their need
for immediate medical treatment, traded off against their chance of benefiting from such
care.
Translation for publishing manuscripts. The process of sorting original (or revised) MSs
to journals based on the chance of acceptance, traded off against the quality and desired
audience of the journal.
A good triage process will obtain the greatest gain in outcome (journal quality and
audience) for the invested resources. In the case of MSs, time & energy are the most
valuable resources.
Principle 1 (develop a triage process): A lab would benefit from reaching a consensus
about a systematic process for manuscript triage.
Principle 2 (no more than two weeks of “down time”): A manuscript should not sit idle
on any ones’ desk during the writing, revising, submission, or re-submission process for
more than about two weeks. Thus, except for two week periods of “dead time” in which
nothing is being done, one of the following things always should be happening:
New articles:
(a) Selecting target journal (TJ1) and back-up 2nd tier journal (TJ2);
(b) Reviewing journal policy and instructions to authors;
(c) Writing the new MS;
(d) Peer review of the new MS by lab mates;
(e) Revising a new MS with the lab director (required: two week turn around by
author and lab director);
(f) Papercheck.com (guaranteed three day turn around).
MSs returned from journals (one or more of the following should be happening at all
times):
(a) designing and running additional subjects/new analyses required by reviewers;
(b) following “bottom-up process for writing and reviewing revisions based on
reviewer comments;
(c) peer review of revisions;
(d) Reviewing (completed) revision of a manuscript with lab director (two week turn
around for each person);
(e) Reviewing the decision about which journal to submit to;
(f) Papercheck.com (guaranteed three day turn around).
The following example of a MS triage assumes that before submission you have made in advance
a wise decision about the first tier journal and back-up second tier journal.
Legend: TJ1 = target journal 1; TJ2 = back-up target journal 2;
AE = action editor; RC reviewer comments; EC editor comments;
= follow strategy below.
Initial submission to TJ1
What is the best way
to make this decision?
BOTTOM UP!
Rejected with no RC or EC
Submit to TJ2
Rejected: do not
resubmit.
Rejected with option
to re-submit to TJ1
Strategy 1*: Revise
using any and all
“usable” RC or EC
Strategy 1*: Revise
using any and all
“usable” RC
Submit to TJ2
* You may want to take
only a few comments for
fast turn round, or you may
take most or all for a major
revision
Strategy 2: Revise
using ALL RC
Accepted pending
revisions based on RC
Strategy 3: Revise
using ALL RC:
1) Never change a
word in the MS unless
it directly is required
by an RC
2) Briefest possible
changes;
Submit to TJ2
Re-submit to TJ1
* Put your emotion aside, and think positively
about the improvement of the MS through this
revision process.
3) Letter to AE
contains itemized
response containing
each RC statement to
which you made a
change; followed by
the quote (page,
paragraph, line) of the
change that was made.
Re-submit to TJ1
Return to Checklist for Revision
Bottom-up decision strategy for
“Rejected with option to resubmit to TJ1”
(Revision is stressful; therefore, efficiency is extremely important)
Follow these five bottom-up steps to help reach a decision about whether to:
1) Respond to any and all “usable” RC and submit to TJ2, or;
2) Respond ALL RC and resubmit to TJ1.
Note: in (2), “respond to” means either: (a) to change the MS, or (b) do not
change the MS, but explain why in the letter to the editor. (examples)
Step 1: Separate and quote (“Divide and conquer”)
A reviewer’s comments may contain 5 or 6 separate criticisms within one
paragraph that require a change in the MS. Begin by separating and numbering
the Reviewer’s comments into all the parts that require a revision or a comment to
the editor (and no change to the MS).
Objective: (1) this forces you to analyze very carefully each separate instruction
for a change; and (2) it is the beginning of the letter to the editor. This may be a
very long series of separate comments. If two reviewers make the same comment,
you can list those together in a section at the end, but preferably separate each
reviewer’s comments and response to them in the letter when submitting the
revision.
Return to Checklist for Revision
Example 1
Reviewer 1
Comment 1.1:
“The reference on page 2, paragraph 3, line 12 is incorrect, It should be Smith
(2008)”
Comment 1.2:
“The authors should explain if the instructions were given before or during the
second experimental procedure.”
Comment 1.3:
“The description of the xxx is not clear on page 4, paragraph 2, line 12. This
should be spelled out clearly.”
Comment 1.4:
“The authors only mention one theoretical framework; namely, dual-process
theory. They should mention the other competing frameworks and relate these to
the results of their study.”
Step 2: Eliminate all comments you do not want to respond to
with a change in the MS (“Pull out the weeds”).
Example 2 (highlight the “weeds” in red)
Comment 1.1:
“The reference on page 2, paragraph 3, line 12 is incorrect, It should be Smith
(2008)”
Comment 1.2:
“The authors should explain if the instructions were given before or during the
second experimental procedure.”
The procedure was explained by adding on page 12, paragraph 3, line 12
Comment 1.3:
“The description of the xxx is not clear on page 4, paragraph 2, line 12. This
should be spelled out clearly.”
Comment 1.4:
“The authors only mention one theoretical framework; namely dual-process
theory. They should mention the other competing frameworks and relate these to
the study.” I don’t think we should do this.
Step 3: Revision plan: write notes to yourself about the decisions
for each separate comment. (“Don’t try to eat the whole
meal in one bite.”)
Example 3
Comment 1.1:
“The reference on page 2, paragraph 3, line 12 is incorrect, It should be Smith
(2008)”
Change as suggested
Comment 1.2:
“The authors should explain if the instructions were given before or during the
second experimental procedure.”
We can explain by noting the xxx xxx xxx.
Comment 1.3:
“The description of the xxx is not clear on page 4, paragraph 2, line 12. This
should be spelled out clearly.”
We can mention xxx and cite our earlier paper (ref) here
Comment 1.4:
“The authors only mention one theoretical framework; namely dual-process
theory. They should mention the other competing frameworks and relate these to
the study.” I don’t think we should do this.
Step 4. Consult collaborators and revise the bottom-up revision
plan (“Multiple brains are better then one.”)
Ask the co-authors to study this document. Meet with them and have a discussion
of each point. Then revise the bottom-up revision plan. . If they are not at the
same university, ask them to revise using tracked changes and return.
Step 5: Make the decision (bottom up) and revise the MS (“Decide
and revise.”)
(a) Read together and make re-submission decision (TJ1 of TJ2). Read
through the finished document and decide whether all the decisions, when
taken together, add up to:
1) Respond to any and all “usable” RC and submit to TJ2, or
2) Respond ALL RC and resubmit to TJ1
Two kinds of “responses” are possible under option 2
1) Revise the MS according to all RC.
2) Revise using most of the RC, but provide justification for why several RC
were not followed. In this case, a good statement to make is:
“One reviewer suggested that we should mention the other competing
frameworks (other than dual process theory) and relate these to the study
(Reviewer 1.4). But after carefully considering this suggestion, we ended up
agreeing with the other three reviewers who did not suggest this change.”
(b) Revise the MS in serial order. Once the decision is made (TJ1 or TJ2) assign
the authors to be responsible for revising in response to specific RC that match
their expertise. It is important that each author completes their revision and then
passes that version on the next author – Do not have multiple versions because
it’s a huge waste of time to put them all together!
Important note: As each author revises the MS, also modify this document,
because will become the letter to the editor. Each revision should be quoted
exactly just beneath the numbered RC (unless it’s a huge change to an entire
section of the MS). Send this revised letter along with the revised MS to the next
author who will revise their section. At the end, the entire letter to the editor now
will be completed, except for putting the exact page no.; paragraph no.; and line
no.; of each change.
i) Assign the revision order and revise in serial order passing from one author to
the next using tracked changes and initials after the insertions.
There are several ways to do this. First, relatively simple revisions, you could
assign the revision to the co-authors who are responsible for that portion of the
manuscript (e.g., the analysis or the experimental procedure). That is the
procedure illustrated here and below in the next sections. Second, for more
complex revisions, you could assign the order of revision by section of the
paper (e.g., method, results, discussion, etc) or by the reviewer, or by the
concept being revised. Senior author judgment is required to make these
assignments.
1) Alice will revise first, then pass the MS with her revisions to Brian.
2) Brian will do his parts, then send to Cindy
3) Cindy will complete her parts and reviews the those MS.
Example 4
Comment 1.1:
“The reference on page 2, paragraph 3, line 12 is incorrect, It should be Smith
(2008)”
Change as suggested
Alice is responsible for doing this one
Comment 1.2:
“The authors should explain if the instructions were given before or during the
second experimental procedure.”
We can explain by noting the xxx xxx xxx
Brian is responsible for this one
.
Comment 1.3:
“The description of the xxx is not clear on page 4, paragraph 2, line 12. This
should be spelled out clearly.”
We can mention xxx and cite our earlier paper (ref) here
Cindy will do this one and then review the entire MS
Comment 1.4:
“The authors only mention one theoretical framework; namely dual-process
theory. They should mention the other competing frameworks and relate these to
the study.” I don’t think we should do this.
(The MS is not revised and the nearly finished letter to the editor is on the next
page. If you submit to TJ2 you still may want to sent this letter to show openly
where you are in the revision process for this MS)
ii) Then each author completes the revision in serial order.
Example 5 (Almost finished letter to the editor)
Comment 1.1:
“The reference on page 2, paragraph 3, line 12 is incorrect, It should be Smith
(2008)”
“The first application of this paradigm was by Smith (2008)”
Page xx; paragraph yy; line zz
( by Alice)
Comment 1.2:
“The authors should explain if the instructions were given before or during the
second experimental procedure.”
“The instructions were given while the subjects was responding to the stimuli.”
page xx; paragraph yy; line zz
(by Brian )
.
Comment 1.3:
“The description of the procedure is not clear on page 4, paragraph 2, line 12.
This should be spelled out clearly.”
“The entire procedure begins with the subject receiving 25 training trials in the
same manner described by Lee and Thomas (2008).”
page xx; paragraph yy; line zz
(by Cindy)
Comment 1.4:
“The authors only mention one theoretical framework; namely dual-process
theory. They should mention the other competing frameworks and relate these to
the study.” I don’t think we should do this.
Now the MS is revised and the letter to the editor is complete except for the final
page, paragraph, and line numbers of the changes.
SPECIAL ARTICLE
How to reply to referees’ comments when
submitting manuscripts for publication
Hywel C. Williams, PhD
Nottingham, United Kingdom
Background: The publication of articles in peer-reviewed scientific journals is a fairly complex and stepwise process that involves responding to referees’ comments. Little guidance is available in the biomedical
literature on how to deal with such comments.
Objective: The objective of this article is to provide guidance to novice writers on dealing with peer
review comments in a way that maximizes the chance of subsequent acceptance.
Methods: This will be a literature review and review of the author’s experience as a writer and referee.
Results: Where possible, the author should consider revising and resubmitting rather than sending an
article elsewhere. A structured layout for responding to referees’ comments is suggested that includes the 3
golden rules: (1) respond completely; (2) respond politely; and (3) respond with evidence.
Conclusion: Responding to referees’ comments requires the writer to overcome any feelings of personal
attack, and to instead concentrate on addressing referees’ concerns in a courteous, objective, and evidencebased way. (J Am Acad Dermatol 2004;51:79-83.)
P
British Specialist Registrars in dermatology, and I am
grateful to them for helping me to develop the
learning themes.
I have deliberately not entered into any discussions on the quality of peer review8 or the value
of peer review in publication because it is still hotly
debated if peer review really helps to discriminate
between good and bad research or whether it simply
improves the readability and quality of accepted
articles.9 Instead, I have decided to stick to providing
what I hope is helpful and practical guidance within
the system that already exists.
lenty of guidance is available on conducting
good research,1,2 and Web sites of most scientific journals give clear and helpful instructions
on what is suitable for submission and how to
submit. Yet where does one obtain guidance on
replying to referees’ (peer reviewer) comments once
the manuscript is returned? I could find little in the
literature dealing with this important topic.3-7
This article attempts to address this gap by providing some helpful tips on how to reply to referees’
comments. In the absence of any systematic research
to determine which strategies are best in terms of
acceptance rates, the tips suggested below are based
simply on my personal experience of publishing
approximately 200 articles, refereeing more than 500
manuscripts, and working as an editor for 3 dermatology journals. I have presented some aspects of the
work previously in two workshops with groups of
THAT LETTER ARRIVES FROM THE
JOURNAL
After laboring for many months or years on your
research project and having written many manuscript drafts to send off your final journal submission,
a letter or electronic-mail message from the journal
arrives several weeks later indicating whether the
journal editor is interested in your manuscript. At this
stage, it is every author’s hope that the manuscript is
accepted with no changes, yet such an experience
is incredibly rareeit has happened to me only twice,
and these were both commissioned reviews. More
commonly, one of the following scenarios ensues.
From the Centre of Evidence Based Dermatology, Queen’s Medical
Centre.
Funding sources: None.
Conflicts of interest: None identified.
Reprint requests: Hywel C. Williams, PhD, Centre of Evidence Based
Dermatology, Queen’s Medical Centre, Nottingham NG7 2UH,
United Kingdom. E-mail: [email protected].
0190-9622/$30.00
ª 2004 by the American Academy of Dermatology, Inc.
doi:10.1016/j.jaad.2004.01.049
127
YMJD1792_proof 9 June 2004 3:19 pm
79
80 Williams
J AM ACAD DERMATOL
JULY 2004
Table I. Three golden rules of responding to referees’ comments
constructive suggestions to help you get the manuscript published, it might be a safer bet to follow their
wishes of a complete rewrite. It might be difficult for
the editor to then turn you down if you have done
exactly what was asked of you. If, on the other hand,
the request for a complete rewrite is a cold one, ie,
without suggestions as to exactly what needs to be
done and where, then it might be better to reflect on
the other comments and submit elsewhere.
Referees may recommend splitting a manuscript if
it is part of a large study that tries to cram in too many
different results. Such a request from one of the
referees may appear like a gift to the authoretwo for
the price of one. But a word of warningeif you are
going to redraft the original manuscript into two
related manuscripts, there is no guarantee that both
will be accepted. The best thing under such
circumstances is to have a dialogue with your editor
to test how receptive they would be to having the
manuscript split into two.
Rule 1. Answer completely
Rule 2. Answer politely
Rule 3. Answer with evidence
Accept with minor revision
If you are lucky, the letter will ask for only minor
revisions. In such circumstances, it is probably best to
simply get on with these changes without invoking
too much argument. If you send the revised manuscript back to the editor quickly, it is still likely to be
fresh in his or her mind, and you will probably get
a speedy acceptance.
Major revisions needed
The most common form of letter is one that lists 2
or 3 sets of referees’ comments, some of which are
quite major. In such circumstances, you will need to
work hard at reading and replying to each referee in
turn, following the layout and 3 golden rules (Table
1) that I will develop later in this article. Such
a process can take days to complete, so do not
underestimate the task. Only you can decide whether
such an investment of time is worthwhile. My advice
is always to revise and resubmit to the same journal if
the comments are fair, even if responding to them
takes a lot of time. Some authors go weak at the
knees when requested to do a major revision, and
instead simply send the manuscript elsewhere. This
is understandable, but the authors should still try and
make improvements to the manuscript in light of the
referees’ comments. Authors should also be aware
that in certain fields of research, their work is likely to
end up with the same referee when they send their
manuscript to another major specialty journal. It will
not go down well with that referee if they see that the
authors have completely ignored the referees’ previous comments. So, generally speaking, my advice
is to put in the time needed to make a better
manuscript based on the referees’ comments, and
resubmit along the lines suggested. If you do submit
to another journal, you should consider showing the
latest journal the previous referees’ comments and
how you have improved the article in response to
such commentsesome journal editors feel positively
about such honesty (J. D. Bernhard, MD, written
communication, November 2003).
Unsure as to rejection or possible
resubmission
The wording of some journal response letters can
be difficult to interpret. For example, phrases such as
‘‘we cannot accept your manuscript in its current
form, but if you do decide to resubmit, then we
would only consider a substantial revision,’’ may
sound like a rejection, yet in reality, it may indicate an
opportunity to resubmit. If you are unsure on how to
read between the lines, ask an experienced colleague or, better still, someone who works as a referee for that journal. Failing that, you could simply
just write back to the editor to ask for clarification.
Sometimes, a journal will ask you to resubmit your
article in letter format rather than as an original
manuscript. You then have to decide if the effort
versus reward for resubmission elsewhere is worth it,
or if you are content to accept the bird in the hand
principle and resubmit your original manuscript as
a letter.
The outright rejection
Usually this type of letter is quite short, with very
little in the way of allowing you an opportunity to
resubmit. Outright rejection may be a result of the
manuscript being unsuitable for the journal or because of ‘‘lethal’’ methodologic concerns raised by
the referees that are nonsalvageable. For example,
doing a crossover clinical trial on lentigo maligna
with an intervention such as operation that has a permanent effect on patient outcomes in the first phase
of the crossover study. Sometimes the editors, who
are always pushed for publication space, simply did
not find your article interesting, novel, or important
Journal requests a complete rewrite
Only you can decide if the effort of a complete
rewrite is worth it. If it is clear that the referees and
editor are interested in your manuscript and they are
doing everything they can to make detailed and
128
YMJD1792_proof 9 June 2004 3:19 pm
Williams 81
J AM ACAD DERMATOL
VOLUME 51, NUMBER 1
Table II. Some useful phrases to begin your replies
to critical comments
enough to warrant inclusion. You will just have to
live with that and submit elsewhere.
Dealing with outright rejection of your precious
sweat and toil may not be easy, especially if the
journal has taken ages to get back to you. You have
two main choices at this stage. If you believe that the
referees’ comments are grossly unfair or just plain
wrong, you can write to the editor to appeal the
decision and ask for new referees. The success of
such appeals depends on how confident you are that
their decision was out of order and whether the real
decision for rejection was indeed based on those
comments transferred to you. Appeals such as this
are rarely successfuleI have done it twice with the
BMJ, and failed both times.
The other (better) option is to stop snivelling, pick
yourself up, and resubmit elsewhere. If you do this, it
is important that you read and objectively assess the
referees’ comments from the journal that has turned
down your manuscript. This is for two reasons: (1)
those comments may improve the article; and (2) as
stated earlier, your manuscript may end up with the
same referee even if you send it to another journal. If
you are really convinced that your manuscript is
earth-shattering, then you should not automatically
resubmit to a journal that might offer easier acceptance. It has been my experience that sometimes
a manuscript that was rejected by a medium-ranking
dermatology journal is subsequently accepted by
a higher-ranking oneesuch is the unpredictability of
peer review and journal editor preferences.9
We agree with the referee that ___, but. . .
The referee is right to point out ___, yet. . .
In accordance with the referees’ wishes, we have now
changed this sentence to___.
Although we agree with the referee that. . .
It is true that___, but. . .
We acknowledge that our manuscript might have
been ___, but. . .
We, too, were disappointed by the low response rate.
We agree that this is an important area that requires
further research.
We support the referee’s assertion that ___, although. . .
With all due respect to the reviewer, we believe that this
point is not correct.
separate comments (eg, comment 1.1, 1.2, 1.3), then
answer them in turn. Even if some of the comments
are just compliments, repeat these in your cover
letter followed by a phrase such as ‘‘we thank the
referee for these comments.’’
Rule 2: Answer politely
Remember that nearly all referees have spent at
least an hour of their personal time in refereeing your
manuscript without being paid for it. If you (as a lead
author) receive a huge list of comments, it usually
means that the referee is trying very hard to help you
improve the manuscript to get it accepted. Rejection
statements are usually short, and do not allow you an
open door to resubmit.
It is quite all right to disagree with referees when
replying, but do it in a way that makes your referees
feel valued. Avoid pompous or arrogant remarks.
Although it is only human nature to feel slightly
offended when someone else dares to criticize your
precious work, this must not come across in your
reply. Your reply should be scientific and systematic.
Get someone else to read your responses before
sending them off.
Try to avoid opening phrases such as ‘‘we totally
disagree’’ or ‘‘the referee obviously does not know
this field.’’ Instead, try to identify some common
ground and use phrases starting with words such as
‘‘we agree with the referee, however. . ..’’ A list of
helpful phrases that I have developed over the years
is given in Table 2 for guidance.
THE 3 GOLDEN RULES OF STRUCTURING
YOUR RESPONSE LETTER
Rule 1: Answer completely
It important that all of the referees’ comments are
responded to in sequence, however irritating or
vague they may appear to you. Number them, and
repeat them in your cover letter using the headings
such as ‘‘Reviewer 1,’’ then ‘‘Comment 1,’’ followed
by ‘‘Response.’’ What you are doing here is making
the editor’s and referees’ jobs easy for themethey
will not have to search and cross-reference a lot of
scripts to discover what you have doneeit will all be
there in one clean document.
Typing out or paraphrasing the referees comments as a means of itemizing the points also
achieves two other things: (1) it forces you to listen
to what the referees actually said, rather than what
you thought they might have said when you first read
their comments; and (2) it helps you to understand
how many separate points are being made by the
referee. Quite often, you will just receive a paragraph
with several comments mixed together. In such
a situation, you can split the paragraph into 2 or 3
Rule 3: Answer with evidence
If you disagree with the referee’s comments, don’t
just say, ‘‘we disagree,’’ and move on. Say why you
disagree with a coherent argument or, better still,
back it up with some facts supported by references
that you can cite in your reply. Sometimes those extra
129
YMJD1792_proof 9 June 2004 3:19 pm
82 Williams
J AM ACAD DERMATOL
JULY 2004
references are just to back the point you make in your
cover letter, but occasionally you may add them to
the revised article. Some kind referees go to the
trouble of suggesting missed references or how you
might reword important areas of your document. If
providing the references or rewording makes sense
to you, just go ahead and incorporate them. It is quite
legitimate to use the referee’s comments to add some
extra text and data if their comments require it,
although if this amounts to more than a page, you
would be wise to suggest it as an option to the editor.
Another option is to suggest that the extensive
additions would be better placed in another subsequent article.
Sometimes, if there are no clear published data to
strongly support your methodologic approaches,
you can discuss this with an expert in the field. If
he or she agrees with your approach, then you can
say so in your reply. For example, ‘‘although other
approaches have been used in the past, we have
discussed this statistical methods with Professor Soand-So who agrees that it was the appropriate
analysis.’’
The referee is just plain rude
Anyone who has done clinical research will realize just how difficult it can be, and there is no place
for rudeness from referees. I find it sad that senior
academics can sometimes forget their humble beginnings when they referee other’s work. Nearly all
journals provide clear guidance to their referees to
avoid remarks that they would find hurtful if applied
to their own work, yet some ignore such advice and
delight in rude or sarcastic comments, possibly
because of envy or insecurity. In such circumstances,
all you need to do is complain to the editor and ask
for another nonhostile review.
The dreaded request to reduce the
manuscript by 30%
Such a request typically comes from the editor
who is pushed for space in his or her journal. I have
to confess that, for me, this is the comment that I
dread most of all because it is often accompanied by
3 referees’ comments, the response to which usually
involves making the article longer than the original
submission. A general reduction in text by 30%
basically requires a total rewrite (which is slow and
painful). It is usually easier to make a brave decision
to drop an entire section that adds little to the
manuscript. Ask a colleague who is not involved in
the manuscript to take out their editing knife and
suggest nonessential areas that can go—even though
the process of losing your precious words may seem
very painful to you. Discussion sections are usually
the best place to look for radical excisions of entire
paragraphs. Background sections should be just one
to two paragraphs long—just long enough to say
why the study was done, rather than an exhaustive
review of all previous literature. Please do not skimp
on the methods section unless you are referring to a
technique that can be put on a Web site or referenced.
TIPS ON DEALING WITH OTHER
SCENARIOS
Referees with conflicting viewpoints
At first, this scenario might appear very difficult to
the novice, yet it should be viewed as a gift. You, the
author, have the choice of which viewpoint you
agree with the most (or better still, the one that is
right). Then it is simply a question of playing one
referee against the other in your reply. You can
always appeal to the editor by asking him or her to
make the final decision, but give them your preferred
option with reasons.
The referee is wrong
Referees are not gods, but human beings who
make mistakes. Sometimes they do not read your
manuscript properly, and instead go on at length
about their hobbyhorse whereas, in fact, you have
dealt with their concerns elsewhere in the manuscript. Try to resist the temptation of rubbing their
nose in it with lofty sarcastic phrases such as ‘‘if the
referee had bothered to read our manuscript.’’
Instead, say something like ‘‘we agree that this is an
important point and we have already addressed it on
page A, paragraph B, line C.’’
Sometimes the referee is just plain wrong about
something. If so, it is silly to agree with the referee,
and you are entitled to a good argument. If you are
confident that you are right, then simply argue back
with facts that can be referencedethe editor can then
adjudicate who has the best evidence on their side.
CONCLUSION
Referees are human beings. The secret of a successful resubmission is to make your referees feel
valued without compromising your own standards.
Make your referees’ and editor’s life easy by presenting them with a clear numbered and structured
response letter. Provided you have made a good
attempt at answering all of the referees’ comments in
a reasonable way by following the 3 golden rules,
many referees and editors are too weak at the stage of
resubmission to open another round of arguments
and resubmission. In my experience, I spend up to 90
minutes on the initial refereeing of a manuscript, but
only around 20 minutes on a resubmission. However,
130
YMJD1792_proof 9 June 2004 3:19 pm
Return to Checklist for Revision
Williams 83
J AM ACAD DERMATOL
VOLUME 51, NUMBER 1
2. Altman DG. Practical statistics for medical research. London:
Chapman and Hall 1991.
3. Cummings P, Rivara FP. Responding to reviewers’ comments on
submitted articles. Arch Pediatr Adolesc Med 2002;156:105-7.
4. DeBehnke DJ, Kline JA, Shih RD, Research Committee of the
Society for Academic Emergency Medicine. Research fundamentals: choosing an appropriate journal, manuscript preparation, and interactions with editors. Acad Emerg Med 2001;8:
844-50.
5. Byrne DW. Publishing your medical research paper. Baltimore:
Williams and Wilkins; 1998.
6. Huth EJ. Writing and publishing in medicine. 3rd ed. Baltimore:
Williams and Wilkins; 1999.
7. Rothman KJ. Writing for epidemiology. Epidemiology 1998;9:
333-7.
8. Jefferson T, Wager E, Davidoff F. Measuring the quality of
editorial peer review. JAMA 2002;287:2786-90.
9. Jefferson T, Alderson P, Wager E, Davidoff F. Effects of editorial
peer review: a systematic review. JAMA 2002;287:2784-6.
if you miss some comments completely or your
manuscript changes do not correspond with what
you say you have done in your cover letter, this will
entice your referee to spend hours going through
your manuscript with a fine-tooth comb. If he/she
finds lots of little errors, this leads to a possible
deserved rejection.
Like a good marriage, resubmitting your manuscript in light of your referees’ comments is a process
of give and take.
The author wishes to thank Dr Jeffrey Bernhard for his
constructive comments and for references 5 to 7.
REFERENCES
1. Lowe D. Planning for medical research: a practical guide to
research methods. Cheshire (UK): Astraglobe Ltd; 1993.
131
YMJD1792_proof 9 June 2004 3:19 pm