Comparing Spatial Information in Speech/Gesture
and Signed Language
SARAH TAUB, PILAR PIÑAR & DENNIS GALVAN
Gallaudet University
1. Introduction
Linguists have always wanted to compare how languages express
information. Do users of different languages communicate essentially
the same information, or does language play a critical role in shaping what
is included and excluded from a narrative? The "null hypothesis" or
simplest assumption might be that humans in general attend to and express the
same conceptual elements universally. This paper presents a research methodology
that should allow us to evaluate that hypothesis. Earlier work has suggested
that the hypothesis fails; that spoken languages differ to a large degree in
what is expressed, and that signed languages draw on different resources and
may express more information (in certain domains) than spoken languages. Taking
spatial motion events as a test case, this paper argues that it is not appropriate
to compare languages based on speech alone, or to compare speech to signed language;
rather, the speech/gesture complex produced in discourse and narrative is analogous
to signed language, and constitutes the appropriate level for cross-linguistic
comparison. If speech/gesture is taken as the level of comparison, we predict
more universal trends in expression of information. The technique presented
in this paper should allow collection of quantitative data over a large range
of languages.
Talmy (1985) provided an analysis of spatial events upon which much subsequent
typological research has been based. He divided motion events into such categories
as Figure, Ground, Motion, Path, and Manner, then classified languages as to
how they typically expressed each class of information. In particular, verb-framed
languages such as Spanish express Path information through verbs, and satellite-framed
languages such as English express Path through other members of the verb complex
("satellites"). As examples, consider (1), a typical English sentence
in which a bird's Path through a window is described with a verb particle and
prepositional phrase, and (2), a typical Spanish sentence where the same Path
is described with a verb.
(1) It flew out of the window.
(2) Salió por la ventana.
went-out-3rd through the window
Talmy noted that verb-framed languages are less likely to provide complex specifications
of Path, as they would require a long series of verbs and thus many clauses
(making the discourse feel "over-detailed"); as Manner requires an
extra verb or gerund in these languages, it is often left out as well. On the
other hand, satellite-framed manner-type languages such as English incorporate
Manner easily in the main verb, then typically present long series of Path elements
via several satellites. (3) demonstrates the use of a second verb in Spanish
to specify Manner, and (4) demonstrates the use of multiple verb particles and
prepositions in English to specify complex Paths.
(3) Salió volando por la ventana.
went-out-3rd flying through the window
(4) It flew right back out the window, across the street, and into the forest.
Slobin's (1996) studies of Spanish and English written narratives indicate that
differences between the two languages (exemplars of the verb-framed and satellite-framed
types) persist over a longer time-period than the individual clause. That is,
Spanish narratives consistently present fewer explicit Manner and Path elements
than English narratives; they do not "catch up" in the course of the
story. This analysis, however, considers only written texts and not oral narrative
or discourse. Studies show (e.g., Kendon 1986, McNeill 1992) that the gesture
that accompanies speech also contains conceptual elements (though it is a matter
of hot debate whether such gesture is intended to communicate those elements).
Galvan & Taub (forthcoming) used a similar method of counting conceptual
elements to compare the information expressed in American Sign Language and
English narratives. Narratives were elicited by the same stimulus (the wordless
storybook Frog, Where Are You?) and so could be compared more easily. They found
that in nearly every category (Ground elements being the only exception), ASL
storytellers included more conceptual information; in general, the signed narratives
relied on ASL's highly iconic grammatical resources to compress a huge amount
of spatial information into each sentence. But again in this study, analysis
of the English narratives looked only at the speech produced by the subject;
there was no record of gesture or prosody.
Many people have noted similarities between signed languages and speakers' gestures;
many others have noted differences. (The debate has a political edge, since
when ASL was thought to be "only gesture," deaf signers were thought
to be without language and mentally impaired.) Liddell (2000) has created a
precise explanation for this phenomenon based on conceptual blending. According
to Liddell, both signers and speaking gesturers create a "blend" (Fauconnier
& Turner 1996) between an imagined mental space and Real space (i.e., their
conceptual model of the space in which they stand). In the blend, imagined entities
are conceived of as located in the space around the person. Iconic and deictic
gestures may be aimed toward those entities, or may track their progress through
space.
For spoken languages, these gestures are loosely integrated with the speech
signal the emphatic stroke phase of the gesture occurs precisely with
a spoken word or words that relates to its meaning (McNeill 1992), but the preparatory
and refractory phases may vary in timing. (Indeed, McNeill notes that the precise
timing of these other phases may reveal details of the speaker's thought processes).
For signed languages, on the other hand, these gestures are tightly constrained
to unify with lexical and grammatical elements. Indeed, certain signs (e.g.,
pronouns, some verbs, classifier signs) are ungrammatical if they do not contain
a gestural element. For one example, ASL pronouns must be placed in signing
space so as to indicate the entity to which they refer; for another, verbs such
as GIVE must indicate their subject or Source argument and object or Goal argument
by moving from a space associated with the Source to a space associated with
the Goal. We might describe these gestural elements of signed languages as "grammaticalized
gesture."
If Liddell's proposal is correct, then signed language is more fully analogous
to the speech/gesture complex than to speech alone. If co-speech gesture presents
additional conceptual information, then comparison of speech/gesture across
spoken languages may also prove more fruitful than comparison of speech alone.
Elaborating on the null hypothesis above, if gesture is taken into consideration,
cross-linguistically and cross-modally, in comparable narratives we may expect
to see the following:
A. Similar amounts of conceptual information
B. i. Similar types of information in speech and lexical sign elements
ii. Similar types of information in gesture and gestural sign elements
Given the research of Talmy and Slobin, we may hypothesize more specifically
the following:
C. Spanish uses gesture to add more Path and Manner information.
2. Methods
Subjects were 12 native users of English, 12 of Spanish, and 11 of ASL. Each
subject was paired with another native user of his/her language. Subjects watched
animated cartoons involving the adventures of a cat and a bird; they were instructed
to tell what happened in the cartoon to their partner clearly enough so that
the partner could then tell the story to third person. All narratives were videotaped.
Partners' narratives were recorded but not analyzed in this study.
In the particular scene analyzed here, the cat and the bird are in high-rise
apartments across the street from each other. The cat has been studying the
bird through their windows. The cat swings from his window to the bird's window
on a rope, but misses the window, crashes into the wall, and falls to the street
below. The researchers developed a list of conceptual elements present in the
cartoon, including potential Figure, Ground, Path, Manner, and Instrument elements.
The analysis compiled 1) total number of conceptual elements expressed; 2) number
of elements expressed through speech or lexical sign elements; and 3) number
of elements expressed through gesture or gestural sign elements. Lexical and
gestural sign elements were distinguished on the following criterion: if the
element (e.g., handshape, movement, location, complete sign) is conventionalized
in the lexicon of ASL, than it was counted as lexical; if it is not, then it
was counted as gestural. A number of signs contained both lexical and gestural
elements.
3. Results
Tables 1-4 summarize the results. In each table, the first row shows items expressed
through lexical means, and the second row shows items expressed through gestural
means; note that some elements are represented in both rows. The third row shows
all elements in both modes, while the fourth row excludes elements that were
expressed only through gesture. The purpose of the fourth row is to simulate
the results of previous methodologies which did not include gesture. An ANOVA
was performed on the data; means that are statistically different are printed
in different colors for easy identification.
Table 1: Total Conceptual Items: Statistics
Mean Spanish Mean English Mean ASL F Sig
lexical 12.67 14.33 6.82 23.08 0
gestural 9.33 9.58 13.45 7.979 0.002
all 15.33 16.75 15.36 1.11 0.342
all w/o gesture 12.67 14.33 15.36 2.774 0.077
Table 2: Figure/Ground Conceptual Items: Statistics
Mean Spanish Mean English Mean ASL F Sig
lexical 4.42 4.5 3.36 2.769 0.078
gestural 1.67 1.5 2.55 3.531 0.041
all 4.42 4.5 3.36 2.769 0.078
all w/o gesture 4.42 4.5 3.36 2.769 0.078Table 3: Path Conceptual Items: Statistics
Mean Spanish Mean English Mean ASL F Sig
lexical 4.17 5.17 1.36 37.3 0
gestural 4.92 4.92 5.82 1.983 0.154
all 5.83 6.33 6.27 0.699 0.504
all w/o gesture 4.17 5.25 6.27 11.3 0Table 4: Manner Conceptual Items: Statistics
Mean Spanish Mean English Mean ASL F Sig
lexical 3.17 3.58 1.27 16.79 0
gesture 2.33 2.58 4.27 8.812 0.001
all 4 4.5 4.45 0.861 0.432
all w/o gesture 3.17 3.67 4.45 3.272 0.051
Figures 1-4 illustrate the same data graphically; in these figures,
the yellow bar segments represent items presented through gesture only, the
blue segments represent items presented through lexical means only, and the
green segments represent items that were presented through both means (though
presentations in the two modes were rarely simultaneous). Thus, the total height
of the bars represents the total number of items for each language.
4. Conclusions
The data suggest a significant role for gesture in equalizing the expression
of information across languages and modalities.
A. Similar amounts of conceptual information
This hypothesis is supported. There were no significant differences between
languages in total amounts of information. On the other hand, English and Spanish
clearly convey more information through lexical means, and ASL conveys more
information through gestural means. Thus, gesture may be said to play a role
in equalizing the amount of information expressed.
B. i. Similar types of information in speech and lexical sign elements
ii. Similar types of information in gesture and gestural sign elements
These hypotheses are supported. Certain types of conceptual item are preferentially
expressed through one mode or the other: Figure/Ground information always shows
up in lexical elements, and Path information always shows up in gestural elements.
But spoken and signed languages vary as to how much those types show up in their
non-preferred mode. Thus, all subjects support Figure/Ground information with
some gestural elements but this is significantly more prevalent for ASL than
for English and Spanish. On the other hand, ASL only rarely supports Path information
with lexical elements, while English and Spanish do this to a considerable degree,
adding non-gestured Path items in their speech. These two tendencies make sense,
in that identification of things (Figure/Ground elements) is most easily accomplished
through memorized lexical elements or category names, while the details of Paths
are more easily shown through free gesture than through fixed lexical categories.
Manner, however, shows no overall mode preference. Instead, users of the spoken
languages express it largely lexically, adding little new information through
gesture; while users of the signed language show the opposite pattern, expressing
most Manner information through mapped elements.
C. Spanish uses gesture to add more Path and Manner information.
This hypothesis is not supported. The results show no significant difference
between English and Spanish for any measure, lexical or gestural. On the other
hand, trends are visible that support Slobin and Talmys results; notably,
the difference between English and Spanish in Path information expressed through
lexical means approaches significance (sig. <.059), while if gesture is included
the difference disappears. It is possible that a larger sample size would make
the trends significant.
5. Summary
We find that in both of the spoken languages, subjects express a significant
amount of additional information through gesture; this leads to a rough equalization
of amount of information expressed.
Particular categories of conceptual items appear universally in speech or universally
in gesture, optionally supported by the other mode. This pattern would not have
been apparent had we not separated gestural and lexical elements within signed
language, and brought in gesture as well as speech for cross-modal comparisons.
We conclude that the methodology of comparing speech/gesture complexes with
each other and with signed languages produces interesting results, and we recommend
it as the standard for future typological investigations.
References
Galvan, Dennis, & Sarah F. Taub. Forthcoming. The encoding of motion information
in American Sign Language. In R. Berman, D. Slobin, S. Strömqvist, &
L. Verhoeven (eds.) The Frog Story Revisited. Hillsdale, NJ: Lawrence Erlbaum
Associates.
Fauconnier, Gilles, & Mark Turner. 1996. Blending as a central process of
grammar. In Adele Goldberg (ed.) Conceptual Structure, Discourse, and Language,
pp. 113-130. Stanford, CA: CSLI.
Slobin, Dan I. 1996. Two ways to travel: verbs of motion in English and Spanish.
In M. Shibatani & S. A. Thompson (eds.) Essays in semantics. Oxford: Oxford
University Press.
Kendon, Adam. 1986. Some Reasons for Studying Gesture. Amsterdam: Mouton de
Gruyter.
Liddell, Scott. 2000. Blended spaces and deixis in sign language discourse.
In D. McNeill (ed.) Language and Gesture. Cambridge: Cambridge University Press.
McNeill, David. 1992. Hand and Mind: What Gestures Reveal About Thought. Chicago:
University of Chicago Press.
Talmy, Leonard. 1985. Lexicalization Patterns: Semantic Structure in Lexical
Forms. In T. Shopen (ed.) Language Typology and Syntactic Description, Vol.3:
Grammatical Categories and the Lexicon. Cambridge: Cambridge University Press.