Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
31 Cards in this Set
- Front
- Back
Context
|
Who, What, When, Where, Why, How the data were collected.
|
|
Data
|
Systematically recorded information, whether numbers or labels, together with its context.
|
|
Categorical variable
|
A variable that names categories.
|
|
Quantitative variable
|
A variable (with units) in which the numbers act as numerical values.
|
|
Frequency table
|
Table that lists the categories in a categorical variable and gives count or percentage of observations.
|
|
Area principle
|
In a statistical display, each data value should be represented by the same amount of area.
|
|
Bar chart
|
Bars with areas representing counts or percentages.
|
|
Pie chart
|
A "whole" divided into categories.
|
|
Contingency table
|
Table displaying counts and sometimes percentages of individuals falling into named categories on two or more variables. The table categorizes the individuals on all variables at once to reveal patterns.
|
|
Marginal distribution
|
Distribution of either variable alone, in last column or row.
|
|
Conditional distribution
|
Distribution of a variable restricting the Who to consider only a smaller group of individuals.
|
|
Independence
|
When the conditional distribution of one variable is the same for each category of the other.
|
|
Simpson's paradox
|
When averages are take across different groups, they can appear to contradict the overall averages.
|
|
Histogram
|
Adjacent bars to show the distribution of a quantitative variable.
|
|
Shape
|
Single vs. multiple modes
Symmetry vs. skewness Outliers and gaps |
|
Center
|
The place in the distribution of a variable that you'd point to if you wanted to attempt to summarize the entire distribution with a single number.
|
|
Measures of center
|
Mean and median
|
|
Spread
|
Numerical summary of how tightly the values are clustered around the circle.
|
|
Measures of spread
|
IQR and standard deviation
|
|
Mode
|
High point in the shape of the distribution of a variable.
|
|
Unimodal
|
Having one mode. Used to describe the shape of a distribution.
|
|
Bimodal
|
Having two modes.
|
|
Multimodal
|
Having more than two modes.
|
|
Uniform
|
Distribution that's roughly flat.
|
|
Symmetric
|
Two halves on either side of the center look approximately like the mirror images of each other.
|
|
Median
|
Middle value
|
|
Range
|
Difference between lowest and highest values in a data set.
|
|
Interquartile range (IQR)
|
The difference between the first and third quartiles.
|
|
5-number summary
|
Summary reporting minimum value, Q1, the median, Q3, and maximum value.
|
|
Variance
|
Sum of squared deviations from the mean, divided by the count minus 1.
s² = √ E(y-Y)² / n-1 |
|
Standard deviation
|
Square root of the variance
s = √ E(y-Y)² / n-1 |