Simulating background settings during spoken and written sentence comprehension
Psychonomic Bulletin & Review
https://doi.org/10.3758/s13423-022-02061-9
BRIEF REPORT
Simulating background settings during spoken and written sentence
comprehension
Oleksandr V. Horchak1 · Margarida Vaz Garrido1
Accepted: 19 January 2022
© The Psychonomic Society, Inc. 2022
Abstract
Previous findings from the sentence-picture verification task demonstrated that comprehenders simulate visual information
about intrinsic attributes of described objects. Of interest is whether comprehenders may also simulate the setting in which
an event takes place, such as, for example, the light information. To address this question, four experiments were conducted
in which participants (total N = 412) either listened to (Experiment 1) or read (Experiment 3) sentences like “The sun is
shining onto a bench” followed by a picture with the matching object (bench) and either the matching lighting condition
of the scene (sunlit bench against the sunlit background) or the mismatching one (moonlit bench against the moonlit background). In both experiments, response times (RTs) were shorter when the lighting condition of the pictured scene matched
the one implied in the sentence. However, no difference in RTs was observed when the processing of spoken sentences was
interfered with visual noise (Experiment 2). Specifically, the results showed that visual interference disrupted incongruent
visual content activated by listening to the sentences, as evidenced by faster responses on mismatching trials. Similarly, no
difference in RTs was observed when the lighting condition of the pictured scene matched sentence context, but the target
object presented for verification mismatched sentence context (Experiment 4). Thus, the locus of simulation effect is on the
lighting representation of the target object rather than the lighting representation of the background. These findings support embodied and situated accounts of cognition, suggesting that comprehenders do not simulate objects independently of
background settings.
Keywords Language comprehension · Visual simulation · Embodied cognition · Background settings · Light
Introduction
Does language comprehension rely on visual simulation as
suggested by perceptual symbol theories (Barsalou, 1999,
2008)? Much behavioral research has sought to answer this
question using a sentence-picture verification paradigm (see
Horchak et al., 2014, for a review). As one example, Zwaan
et al. (2002) observed faster responses when the pictured
object shape was compatible with the shape implied by the
preceding sentence. As a different example, Winter and Bergen (2012) showed that verifying pictures depicting smaller
objects was faster when reading sentences about distant
objects than about nearby objects, and the reverse for the
time to verify pictures depicting larger objects. The result
* Oleksandr V. Horchak
1
Iscte-Instituto Universitário de Lisboa, Cis-Iscte, Av. das
Forças Armadas, 1649‑026 Lisbon, Portugal
that response times (RTs) are shorter whenever the pictured
object matches the state implied by the sentence was taken
as support for the hypothesis that people rely on visual simulation during the task.
Nonetheless, the above evidence could be interpreted differently. For example, comprehenders might not simulate
an object as being in a specific state before picture verification. Instead, they might simply find it easier to incorporate the pictured version of the object when it matches
sentence content (Masson, 2015). This explanation fits with
the mechanism of backward semantic priming, according
to which processing of picture stimuli should be supported
by recruitment of the previously processed sentence stimuli
(e.g., Neely et al., 1989). One of the most common mechanisms underlying semantic priming is spreading activation
(Collins & Loftus, 1975), which suggests that there are
strong links between the representations of related words
in semantic memory. For example, reading a word such as
“table” should activate the corresponding node in semantic
13
Vol.:(0123456789)
Psychonomic Bulletin & Review
memory that spreads to the words with similar meaning via
the nearby nodes. Consequently, RTs for the word “stool”
should be faster than RTs for the word “squirrel.”
Recently, a more nuanced picture of the functional role
of simulation during word processing has emerged with the
use of visual noise. By using this technique, the assumed
simulation is interfered with rapidly flashing visual masks
that selectively activate the visual cortex (Yuval-Greenberg
& Heeger, 2013), and the impact of this interference on the
task is assessed. For example, Edmiston and Lupyan (2017)
asked participants to listen to a word followed by the presentation of two pictured objects, one of which was oriented
upright and the other was oriented upside down. Seventy-five
percent of the time, the pictured objects matched the word
(e.g., verifying pictures of two dogs after hearing “dog”), but
25% of the time, the pictured objects mismatched the word
(e.g., verifying pictures of two cats after hearing “dog”).
On 50% of all trials, participants saw visual noise in the
form of colorful rectangles with colors, sizes, and positions
alternating at a rate of around 60 Hz. Participants’ task was
to press the button corresponding to the side that displayed
the image in upright position. The results showed that RTs
for matching stimuli were approximately the same for trials
with and without visual interference. However, RTs for mismatching stimuli were reduced for trials with (vs. without)
visual interference. Edmiston and Lupyan (2017) concluded
that visual noise disrupted incongruent visual content while
listening to the word. Furthermore, in the same study the
researchers measured the effect of visual interference on
comprehenders’ ability to answer questions about objects’
properties. The results showed that visual interference
reduced the accuracy in answering visual questions (e.g.,
color) but not non-visual questions (e.g., tactile feelings).
Thus, Experiment 2 showed that visual interference affects
only visual knowledge (see also Ostarek & Huettig, 2017,
for further evidence).
Whereas the case for visual simulation is strong regarding
word processing, the case for the involvement of visual processes during sentence processing is weaker. For example,
Ostarek et al. (2019) investigated which processes contribute
to the retrieval of shape information in a sentence-picture
verification task by using the materials from the original
Zwaan et al.’s (2002) study. They hypothesized that if faster
RTs are explained by visual simulation, then visual interference occurring before the presentation of the target image
should reduce the effect of the sentence on subsequent image
recognition. The researchers found no evidence that disrupting visual processes interfered with visual simulation. This
is the case because RTs were faster for shape-matching trials
in both “blank screen” and “visua (...truncated)