Making Thinking Visible (MTV): Promoting Students’ Model

Download Report

Transcript Making Thinking Visible (MTV): Promoting Students’ Model

Summarizing, Explaining, and Diagramming: The Differential Effects
on Reading Times, Text-Base Representations, and Mental Models of
Science Text
Janice Gobert
Senior Research Scientist
The Concord Consortium
37 Thoreau Street
Concord, MA 01742
[email protected]
Research Associate
Dept. of Learning & Teaching
Graduate School of Education
Harvard University
Cambridge, MA 01238
Check us out at www.concord.org
This research is funded by the the National Science Foundation under grant
No. REC-9806141 awarded to the author. Any opinions, findings,
and conclusions expressed are those of the presenter and
do not necessarily reflect the views of the NSF.
Gobert, WTC 2002, 01/25/02
Background to the Research: Study 1

Builds on two studies addressing the on-line processing of science text and conceptual gains made when
different orienting tasks are given to students.

Study 1: Gobert & Clement (1999; WTC, 1997) -- effects of either summarizing or diagramming vs.
read only on
1) students’ understanding of science text using summaries or diagrams generated at 3 different
points in the text (intermediate representations)
2) post-test measures reflecting conceptual understanding (situation models).

On the post-test -- diagram group > summary group and summary group > read only group as predicted
on both the understanding of spatial and causal/dynamic aspects of the domain.

On students’ intermittent tasks (i.e., summaries and diagrams generated during reading)-- summaries
contained more semantic information than did the diagrams. Puzzling at first, but….

Interpreted as follows: For summary group, summarizing was a low-level task, as it involved same
media as text, and thus, did not elicit the development of rich mental models from which inferences could
be made. Superior performance on the post-test by the diagram group, reflecting students’ situation model
understanding (Kintsch, 1988) also supports these findings, i.e., that diagramming elicited a deeper
processing of the text, seen at post-test.
Gobert, WTC 2002, 01/25/02
Background to the Research (cont’d):
Study 2

Study 2: Gobert (1997) -- whether the learning advantages obtained by the diagram group > summary
group in Study 1 were due to the translation from textual into a visual representation (constructing
diagrams) or due to inferencing in general.

same experimental design except the two orienting conditions used were diagramming and explaining (a
higher level task than summarizing).

Results:
 diagram group = explanation group on the amounts of semantic information generated on the
intermittent tasks during reading AND
 diagram group = explanation group on the conceptual post-test measures reflecting mental
models of the domain.

Interpretation: generating explanations and diagrams both promoted inference-making on the textual
material and development of rich situation models, as evidenced by equally good performance on the posttest and equivalent amounts of semantic information on the explanations and diagrams.

Thus, it was not the visual representation driving the superior performance in Study 1.
Gobert, WTC 2002, 01/25/02
Ok, so the visual medium hypothesis was
out the window, what now?…
 Studies 1 and 2 provided a motivation for investigating the reading time performance,
intermittent task performance, and subsequent conceptual understanding in three different
orienting conditions in order to track potential differences in text comprehension.
 Orienting tasks for Study 3: summarizing, explaining, and diagramming.
 IF reading time performance of the explanation = diagram group but > summary group…..
it is possible that similar processes are being used in the explanation and diagram conditions in
setting up mental models of the text.
Gobert, WTC 2002, 01/25/02
Research Design: Study 3

Subjects: 30 5th grade students from a suburban town in eastern Massachusetts were selected for minimal
prior knowledge of the domain on the basis of a written pre-test.

Data Collection:
 Students interviewed individually;
 sessions = 45 minutes and 1 hour.
 All sessions were video-taped using two cameras, one focussed on the student and interviewer, the
other was positioned over the student’s desk to record the diagrams, etc.

The Text Source:
 3 pages in total; 5 sections: 1) introduction, 2) layers of the earth, 3) the movement and processes
inside the earth, 4) how mountains form and how volcanes erupt, and 5) how the sea floor
spreads.
 text presented on a Macintosh computer using Select-the-Text for reading times (Goldman & Saul,
1990).
 reading times were recorded for each sentence and for each section of text, both in milliseconds.
Gobert, WTC 2002, 01/25/02
Research Design: Study 3 (cont’d)

The three orienting & intermittent tasks were: to draw a diagram at 3 specific points in the text, to
summarize at 3 specific points in the text, and to explain at 3 specific points in the text.

Orienting instructions, given BEFORE the respective section:
“After this paragraph you will be asked to give a summary {OR give an explanation, OR draw a diagram}
of…. the different layers of the earth (task 1)
… .. the movement and processes inside the earth (task2)
…… the movement and processes inside the earth when mountains form (task 3).

The intermittent tasks (1-3) were designed and ordered in order to allow for a learnable progression of
models where simple models provide conceptual leverage for building more complex models (Raghavan &
Glaser, 1993; White, 1993).

While drawing, summarizing, or explaining, the students could not look back at the text.
Gobert, WTC 2002, 01/25/02
Coding of Students’ Intermittent Data, e.g.,
Spatial Scoring- Mountain Formation (task 3)
Based on a propositional analysis of the text read by the students,
Scoring of Spatial Components are:
crust::
LOC: on surface/ 1st layer
PRT: plates
ATT: up to 96 miles thick
ATT: plates, moving
PRT: crust, continents
mantle:
PART: magma
LOC: below crust/ 2nd layer
magma:
ATT: liquid
ATT: hot
core:
LOC: center of earth/ interior layer
ATT: hot mass
1 point
2 points
1 point
1 point
1 point
2 points
1 point
1 point
1 point
1 point
1 point
Gobert, WTC 2002, 01/25/02
total spatial (13) =
Coding of Students’ Intermittent Data, e.g.,
Causal Scoring for Mountain Formation (task 3)
Based on a propositional analysis of the text read by the students,
Scoring of Causal/Dynamic Components
core heats magma
if currents are shown/mentioned
currents of magma “rise” to top of mantle
mantle moves
plates collide/forced together
one plate moves under another
crust/land breaking/crumbling
cyclical pattern causes mountain
2 points
2 points
2 points
2 points
2 points
2 points
2 points
2 points
total (16)=
For more info on how a semantic-based coding scheme can be used for diagrams and
explanations, see Gobert & Clement (JRST, 99) & Gobert (IJSE, 2000).
Gobert, WTC 2002, 01/25/02
Coding of Students’ Post-test Data
Post-test: All of the post-test items assessed either spatial or causal/dynamic knowledge.
Examples of spatial/static items are:
“Where is the thinnest part of the crust, on the top of mountains or at the sea floor?”
“If the continents were all together, would the rest of the earth’s surface be water?”
and spatial features of students’ post-test diagrams depicting volcanic eruption and sea floor
spreading (these diagrams were NOT requested during reading).
Examples of causal/dynamic items are:
“Plates moving apart causes. . .?”
“Why is rock from the floor of the Atlantic Ocean newer (i.e. younger) than rock from the
middle of the North American Continent because….
and causal and dynamic features of students’ post-test diagrams depicting volcanic eruption
and sea floor spreading.
Gobert, WTC 2002, 01/25/02
Hypotheses Regarding Reading Times
For part 1- Introduction- no orienting task given:
 expected no differences in reading times
 provided an analysis of whether the three groups’ baseline reading times were equivalent
For part 2- “The Layers of the Earth” consisting primarily of spatial information:
 expected diagram and explanation group > summary group since task instructions would elicit a deeper
level processing of the text,
 interaction of diagramming or explaining with text requires setting up a spatial mental model AND that this
would be similar in these two conditions.
For parts 3- “The Movement and processes inside the Earth” and part 4- How Mountains Form”
consisting of several different types of information (spatial, causal, & dynamic):
 expected diagram and explanation groups > summary group since their task instructions would elicit a
deeper level processing of the text, and processing required to annotate spatial mental model with causal &
dynamic information would be similar.
 For an elaboration on mental model building from text in this domain, see Gobert, Int. J. Sci. Ed., 2000.
For part 5- “How volcanoes erupt and how the sea floor spreads”, no orienting task given:
 served as an exploratory analysis of whether the students would apply the same strategy used in reading
the 3 previous sections of text.
Gobert, WTC 2002, 01/25/02
Hypotheses Regarding Post-test Measures
 Based on vanDijk and Kintsch’s text comprehension model (van Dijk & Kintsch, 1983):
simple recall and recognition tasks are best supported by a good text-base and inference
tasks are best supported by higher level, mental models or situation models (JohnsonLaird, 1983; Kintsch, 1987)….(as well as based on date from Study 1)
 hypothesized that generating either diagrams or explanations while reading would promote
the development of rich mental models reflected by a superior understanding of the domain
compared to the summary group.
Gobert, WTC 2002, 01/25/02
Regarding Intermittent Tasks….
 A corollary of the hypothesis for the post-test regarding the intermittent data is ….
 For the intermittent tasks--the summary group would generate summaries with information
that more closely reflected the surface structure of the text when compared to the explanation
group; thus,
 Summary> explanation on RECALL of propositions of propositions from text.
 Explanation>summary on PARAPHRASE and INFERENCE of propositions from text.
 This analysis is presently being done and will not be discussed today.
Gobert, WTC 2002, 01/25/02
Overview of Results
There are four sets of quantitative data:
1) reading time data for each section of the text,
2) intermittent data generated by the student at 3 points during the text (these data will not be
discussed today) and
3) performance on the post-test.
Gobert, WTC 2002, 01/25/02
Analysis of Reading Times
 (Re prior knowledge of the domain, there was no statistically significant differences found
across the three groups for prior knowledge (F= 2.243, p=0.125), so all subsequent analyses
were done NOT using prior knowledge as a covariate. (Subjects were selected for minimal
p.k.).
 Reading time for each section of text was used as opposed to sentence reading times because
multiple types of knowledge, (i.e., spatial, causal, & dynamic) were reflected in many of the
sentences of the text.
READING TIME ANALYSES
 Part 1: Introduction-- No orienting task given here
 Anova, total time part 1 as the d.v. yielded, as expected, no statistically significant
differences across groups (F= .92, p=.414)
 Thus, no baseline differences in reading times across groups-GOOD.
Gobert, WTC 2002, 01/25/02
Total Reading Time for Part 2: Layers of the Earth
ANOVA Table for Tot Time Part 2
DF
Sum of Sq uares
Mean Sq uar e
F-Val ue
P-Val ue
Lambda
Power
g roup
2
59720652674.210
29860326337.105
7.961
.0022
15.922
.939
Total Pr e
1
4565686095.343
4565686095.343
1.217
.2808
1.217
.176
g roup * Total Pre
2
29828518103.527
14914259051.763
3.976
.0322
7.953
.654
24
90017435773.623
3750726490.568
Residual
Fisher's PLSD f or Tot Time Part 2
Eff ect: group
Significance Lev el: 5 %
Mean Diff.
0= summary
1= explanation
2= diagram
0, 1
34105.100
Crit. Diff.
P-Val ue
56527.665
.2251
0, 2
-147311.600
56527.665
<.0001
S
1, 2
-181416.700
56527.665
<.0001
S
Gobert, WTC 2002, 01/25/02
Figure 1
Total Reading Time by group for Part 2: Layers of the Earth
Interact ion Bar Plot for Tot Time Part 2
Eff ect: group
350000
300000
Significant:
Diag>sum
Diag>exp
Cel l Mean
250000
200000
150000
100000
50000
0
0
Gobert, WTC 2002, 01/25/02
1
Cel l
2
Total Reading Time for Part 3: Movement in the Layers of the
Earth
ANOVA Table for Tot Time Part 3
DF
Sum of Sq uares
Mean Squar e
F-Val ue
P-Val ue
Lambda
Power
g roup
2
81152198125.247
40576099062.624
4.017
.0313
8.035
.659
Total Pr e
1
29787091556.959
29787091556.959
2.949
.0988
2.949
.363
g roup * Total Pre
2
34285813674.015
17142906837.008
1.697
.2044
3.395
.311
24
242404065194.612
10100169383.109
Residual
Fisher's PLSD f or Tot Time Part 3
Eff ect: group
Significance Lev el: 5 %
0= summary
1= explanation
2= diagram
Mean Diff.
0, 1
38627.900
Crit. Diff.
P-Val ue
92761.481
.3986
0, 2
-204983.500
92761.481
.0001
S
1, 2
-243611.400
92761.481
<.0001
S
Gobert, WTC 2002, 01/25/02
Figure 2
Total Reading Time by group for Part 3: Movement of the
Layers of the Earth
Interact ion Bar Plot for Tot Time Part 3
Eff ect: group
450000
400000
Significant:
Diag>sum
Diag>exp
Cel l Mean
350000
300000
250000
200000
150000
100000
50000
0
0
1
Cel l
Gobert, WTC 2002, 01/25/02
2
Total Reading Time for Part 4: Movement in the Layers of the
Earth When Mountains are Formed
ANOVA Table for Tot Time Part 4
DF
Sum of Sq uares
Mean Squar e
F-Val ue
P-Val ue
Lambda
Power
g roup
2
84509885023.924
42254942511.962
3.614
.0425
7.227
.607
Total Pr e
1
18327679158.869
18327679158.869
1.567
.2226
1.567
.213
g roup * Total Pre
2
35244815581.513
17622407790.757
1.507
.2418
3.014
.280
24
280630374242.122
11692932260.088
Residual
Fisher's PLSD f or Tot Time Part 4
Eff ect: group
Significance Lev el: 5 %
0= summary
1= explanation
2= diagram
Mean Diff.
0, 1
29181.800
Crit. Diff.
P-Val ue
99807.933
.5519
0, 2
-218169.100
99807.933
.0001
S
1, 2
-247350.900
99807.933
<.0001
S
Gobert, WTC 2002, 01/25/02
Figure 3
Total Reading Time by group for Part 4: Movement in the
Layers of the Earth When Mountains are Formed
Interact ion Bar Plot for Tot Time Part 4
Eff ect: group
450000
400000
Significant:
Diag>sum
Diag>exp
Cel l Mean
350000
300000
250000
200000
150000
100000
50000
0
0
Gobert, WTC 2002, 01/25/02
1
Cel l
2
Analysis 5: Part 5 of the Text, “Why the Atlantic Ocean is
getting wider”.
 Here, no orienting task given
 Exploratory question whether students might utilize the same reading processes they had
were prompted to do in tasks 2, 3, and 4.
 Anova using total time on part 5 as the d.v. yielded no statistically significant differences
between the groups (F=.243, p=.786).
 Thus, students did not employ the same processing strategies they had used in the prior three
sections of text where the orienting tasks were given.
Gobert, WTC 2002, 01/25/02
Analysis of Post-Test Data
 spatial knowledge and causal/dynamic knowledge entered as variables into a manova.
 No statistically significant differences were found (Fmult= .60, p=.67). Univariate not
significant for:
 for spatial knowledge (Funiv= .007, p=.99) or
 causal/dynamic knowledge (Funiv= .397, p=.68)
 Analysis of Specific Post-Test Items (desperate times calls for desperate measures….)
 Anovas on each of the 14 post-test items yielded no statistical significant results, but some
means in the expected direction.
Gobert, WTC 2002, 01/25/02
Overview of Results
 The diagram group consistently took longer than either the explanation and summary group
to read sections of the text where orienting tasks were given.
 It was expected the reading time explanation group = diagram group because it was assumed
that higher level processing is required to construct mental models (regardless of medium of
information received)- this did not happen!
 Rather the summary group = explanation group on reading time, and the diagram group >
than summary or explanation.
 BUT, the post-test does not reflect any gains for the diagram group despite their longer time
spent reading the text?
Gobert, WTC 2002, 01/25/02
Some Speculations

Speculations about this are…...
 1) the summary task (as well as the other two) said “Include as much information as you can” and as
such, may have elicited a higher level processing of the test than is typical with summarization tasks.
 2) through the act of articulating summaries and explanations during their reading of the text,
students were reifying what they had read, and thus performed equally well on the post-test.

Additional Analyses underway to tease this out are:
 1) analyses of the three summaries and explanations generated during reading are being coded for
propositions recalled, inferred, and paraphrased. If summarizing was interpreted as a high level
task by the students we would expect to see similar proportions of propositions recalled,
summarized, and inferred when compared to the explanation group.
 BUT, if summarizing was interpreted as a lower level tasks (as it was designed to), we would
expect explanation>summary on propositions paraphrased & inferred but
summary>explanation on propositions recalled.
 (The summaries, explanations, and diagrams are being coded for inclusion of semantic features but it
is likely that this is not fine grained enough to pick up difference. Additional analyses may be
required of all the video data in the three conditions to track possible differences between the three
groups while they summarized, explained, or diagrammed).
Gobert, WTC 2002, 01/25/02
Conclusions

These data are consistent with other studies on self-explanations (Cote & Goldman, 1998; Cote, et al,
1998) in which it was found that the relationship between self-explanations and learning are not simple and
linear.

Cote et al speculate that differences might be due to the types of self-explanations that are generated. In
terms of the present study, a fine grained analysis of the propositions recalled and inferred across the
summary and explanation groups may yield important information about what features students are
attending to in constructing these representations.

Further research needs to be conducted with texts that are specifically designed to tease out potential group
differences. The text used here had many sentences which contained multiple types of information, i.e.,
spatial, causal, and dynamic, and combinations of the three, thus sentence by sentence reading time
comparisons would not be fruitful. The domain, being highly spatial, causal, and dynamic may not be
conducive to designing “natural” texts for this purpose.

Regarding “Summarizing, Explaining, and Diagramming”, there are differential effects on reading times,
there may be differences in Text-Base Representations and there are no tractable differences in mental
models of science text.
Gobert, WTC 2002, 01/25/02