A digression into
quirky statistics
Excerpt from
the "Phantom Tollbooth"
from Norton Juster (with illustration from Jules Feiffer)
Up he went -
very quickly at first-then more slowly - then in a little while even more
slowly than that - and finally, after many minutes of climbing up the endless
stairway, one weary foot was barely able to follow the other. Milo suddenly
realized that with all his effort he was no closer to the top than when he
began, and not a great deal further from the bottom. But he struggled on for a
while longer, until at last, completely exhausted, he collapsed onto one of the
steps.
"I
should have known it," he mumbled, resting his tired legs and filling his
lungs with air. "This is just like the line that goes on forever, and I'll
never get there."
"You
wouldn't like it much anyway," someone replied gently. "Infinity is a
dreadfully poor place. They can never manage to make ends meet."
Milo looked
up, with his head still resting heavily in his hand; he was becoming quite
accustomed to being addressed at the oddest times, in the oddest places, by the
oddest people-and this time he was not at all disappointed. Standing next to
him on the step was exactly one half of a small child who had been divided neatly
from top to bottom.
"Pardon
me for staring," said Milo, after he had been staring for some time,
"but I've never seen half a child before."
"It's
.58 to be precise," replied the child from the left side of his mouth
(which happened to be the only side of his mouth).
"I beg
your pardon?" said Milo.
"It's
.58," he repeated; "it's a little bit more than a half."
"Have
you always been that way?" asked Milo impatiently, for he felt that that
was a needlessly fine distinction.
"My
goodness, no," the child assured him. 'A few years ago I was just .42 and,
believe me, that was terribly inconvenient."
"What is
the rest of your family like?" said Milo, this time a bit more
sympathetically.
"Oh,
we're just the average family," he said thoughtfully; "mother,
father, and 2.58 children and, as I explained, I'm the .58"
"It must
be rather odd being only part of a person," Milo remarked.
"Not at
all," said the child. "Every average family has 2.58 children, so I
always have someone to play with.
Besides, each
family also has an average of 1.3 automobiles, and since I'm the only one who
can drive three tenths of a car, I get to use it all the time."
"But
averages aren't real," objected Milo; "they're just imaginary."
"That
may be so," he agreed, "but they're also very useful at times. For
instance, if you didn't have any money at all, but you happened to be with four
other people who had ten dollars apiece, then you'd each have an average of
eight dollars. Isn't that right?"
"I guess
so," said Milo weakly.
"Well,
think how much better off you'd be, just because of averages," he
explained convincingly. "And think of the poor farmer when it doesn't rain
all year: if there wasn't an average yearly rainfall of 37 inches in this part
of the country, all his crops would wither and die".
It all
sounded terribly confusing to Milo, for he had always had trouble in school
with just this subject.
"There
are still other advantages," continued the child. "For instance, if
one rat were cornered by nine cats, then, on the average, each cat would be 10
per cent rat and the rat would be 90 per cent cat. If you happened to be a rat,
you can see how much nicer it would make things."
"But
that can never be," said Milo, jumping to his feet.
"Don't
be too sure," said the child patiently, "for one of the nicest things
about mathematics, or anything else you might care to learn, is that many of
the things which can never be, often are. You see," he went on, "it's
very much like your trying to reach Infinity. You know that it's there, but you
just don't know where but just because you can never reach it doesn't mean that
it's not worth looking for."
"I
hadn't thought of it that way," said Milo, starting down the stairs.
"I think I'll go back now."
"A wise
decision," the child agreed; "but try again someday perhaps you'll
get much closer." And, as Milo waved good-by, he smiled warmly, which he
usually did on the average of 47 times a day.
"Everyone
here knows so much more than I do," thought Milo as he leaped from step to
step.
This Landsat image is centered on the
Missouri River at Glasgow, near the St.
Louis airport. At this location the Missouri River is flooding, due to the
high level of the Mississippi. |
This Landsat image of the Mississippi
River was taken in 1988. |
This Landsat image of the Mississippi River was taken in
1993 |
Flood Prediction: The Mississippi case-study:
1) Upload the data annual peak
discharge data of the Mississippi at St Louis (StLouis-Discharge).
If you like, you can visit the source of the data on the USGS water resources
page (Mississippi
River At St Louis Mo).
2) Make a histogram of the
entire data set and verify (visually) that it is normally distributed.
3) Calculate the natural
logarithm of the annual peak discharge and determine the basic statistics of
those data (MIN, MAX, AVERAGE, STDEV)
4) Calculate the discharge rate
equivalent to the 100 years flood (P=0.01). (in Ft3/sec. Do not
forget to reverse the transformation by using the EXP function):
a) Graphically: Transform the
data by calculating exceedance level (r/(n+1)) and then plot that vs. lnQ. Use
this graph to estimate the value of the 100 years flood. (Note: You can use 1) a regression
function applied to the portion of the curve that is linear ob 2) a polynomial
fitted to the entire data set).
b) Formula: Use the NORMINV function
to determine this value. The NORMINV function in Excel returns the inverse of
the normal cumulative distribution for the specified mean and standard
deviation. Syntax: NORMINV(probability,mean,standard_dev), where “Probability” is a
probability corresponding to the normal distribution, “Mean” is the arithmetic
mean of the distribution, and “Standard_dev” is the standard deviation of the
distribution.
c) Are these values similar?
Explain why.
5) Do the same for the 1000 year
flood
6) How does the 1993 flood
compare to these two discharge rates (100- and 1000-years floods)?
7) What is the recurrence
interval of the 1993 flood at St. Louis? Use the NORMDIST function. The NORMDIST
function in Excel returns the normal distribution for the specified mean and
standard deviation. Syntax: NORMDIST(x,mean,standard_dev,cumulative),
where “X” is the value for which you want the distribution, “Mean” is the arithmetic
mean of the distribution, “Standard_dev” is the standard deviation of the
distribution, and “Cumulative” is a logical value that determines the form of
the function. When cumulative is TRUE, NORMDIST returns the cumulative
distribution function.
8)
Perform
the calculation for the 100 years flood again, this time using a period of only
20 years from anywhere in the record (make sure you state which period you chose).
9) Compare this result to the
one obtained using the entire data set. How big a data set should you use? Why?
What processes may prevent using the present data set for predictions of future
flood magnitude and frequency?
Student
exercise: Precipitation
prediction
10) Upload the precipitation data for several cities in the US.
11) Use these data to calculate
the 100 years rain event in the city of your choosing (ask yourself: which data
you should use?). To perform this calculation, you should use the table below
that provides cdf for any particular z-value.
12) Compare this value with the
number of times measured precipitation did equal or exceed the prediction made
in the previous question. Are they similar/different? Why?
13) Perform the calculation for
the 100 years precipitation, this time using a period of only 20 years from
anywhere in the record of the chosen city. Are the results similar? Explain
why.
Challenge Question:
Identify a station in the U.S. for which you can obtain a time series for precipitation and for streamflow (visit the USGS web site: http://water.usgs.gov/waterwatch/). Is there a correlation between precipitation and streamflow? Why?