This section describes in detail the main statistical, graphical, economical and environmental concepts that will be discussed in this unit. Terms that may be included in a vocabulary list are italicized. These concepts are referred to in the individual lesson plans. The economical and environmental concepts are related to the statistical and graphical applications that will be used to study the behavior of and relationships between the natural resources and human populations. References to resources for data sets involving the economical and environmental concepts are provided in the individual lessons.
is measured or observed information. The two main
Quantitative data is numerical, while qualitative data is non-numerical. The four common
include nominal, ordinal, interval and ratio measurements.
is purely qualitative, and is essentially categorical, i.e. does not have order or magnitude.
can be qualitative or quantitative, has implicit order, but has no common difference.
are purely quantitative, but the difference between two data has meaning.
is interval data in which a zero implies a quantity of zero for some measurement or observation (Larson & Farber, 2000). Comparison of the characteristics of data types can be eased by use of a matrix
Data summary measures of central tendency include
. The mean is the sum of all data divided by the size of the data set. The median is the middle entry in an ordered data set of odd size n. If n is even, the median is the mean of the two middle data points of an ordered data set. The mode is the datum with the highest frequency.
Data summary measures of dispersion, spread, or variability include
. In an ordered data set, the lowest value is the
and the largest value is the
. The range is the difference of the maximum and minimum. The variance (of a sample of size n) is the sum of the squared difference of each data point from the mean, divided by one less than the size of the data set. The standard deviation is the square root of the variance
. The interquartile range is found by first separating the ordered data set into four
, each with the same size. The divisions of the quartiles are known as Q
, the median (Q
), and Q
. The interquartile range (IQR) is Q
. Taken with the maximum and minimum, the median and quartiles are collectively known as the
five number summary
, which is useful for creating box plots.
is a tabular representation of the
a finite number of categories, or
, where the classes are interval data.
is the number of data entries in a class. The types of distributions include
normal or symmetric, skewed right, skewed left,
uniform or flat
. In a normal distribution, all measures of center coincide. In a skewed-left distribution, mean median mode, and the “peak” of the data occurs on the right side of the distribution. In a skewed-right distribution, mode median mean, and the “peak” of the data occurs on the left side of the distribution
. When a frequency distribution is normal, probabilities or percentiles can be known using the standard deviation: 68% of the data lies within 1 standard deviation of the mean, 95% of the data is within two standard deviations from the mean, and virtually all (99.7%) of the data lies within three standard deviations from the mean
, for the purposes of this unit, is a phenomenon measured or recorded as quantitative data. In this way, a data set consisting of discrete nominal categories with frequencies is considered
. A data set with two corresponding sets of data (ordered pairs) is considered a
is the set of all outcomes or measurements. A
is a part of the population. A
is a measurement of some facet of a sample, while a
measures a characteristic of the population (Larson & Farber, 2000). One of the main goals of
is to use statistics to estimate parameters.
Missing data points within the range of a bivariate data set can be
using a linear regression formula
Graphical displays used to show frequency of univariate data that is classified nominally include dot plots, bar graphs, pie charts, and frequency tables. A stem and leaf plot and a histogram display frequency and distribution shape of data that is classified by interval data. Cumulative frequency plots or curves show the rate of increase in total frequency for all previous classes. Graphics lending themselves to display of distribution shape and variability include box plots and histograms. Graphical displays used to highlight the relationship between two interval or ratio variables include time series graphs and scatter plots. In both of these types of displays, data are considered as ordered pairs.
Economic and Environmental Concepts
Some of the economic and environmental concepts that students may explore include the basics for human existence on a large scale: climate, weather, freshwater, food, energy, health care, child care, housing, transportation, and taxes. Other data that may be explored includes human population of different areas, land area, production of waste and consumption of energy, carbon footprint, and water footprint. Preliminary data or introduction to the idea of natural resources may be acquired by viewing data maps and cartograms, which will reinforce geographical knowledge and practice spatial reasoning.
Economical concepts consist primarily of costs, which students will investigate and to which students will apply the statistics and graphics. Some basic cost categories that may be explored include household costs, such as the costs of water, food, electric and fuel (energy) costs, health care costs and childcare costs. The results of these costs or rates of consumption may be compared to the known quantities of resources that are available, and conclusions or predictions about the future availability of the resources will be explored