• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/85

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

85 Cards in this Set

  • Front
  • Back
alternative hypothesis
The hypothesis that the researcher usually wants to prove or verify, a statement about the value of a parameter that is either less than, greater than, or not equal to.
association
Values of one variable tend to occur with certain values of another variable, detected when the conditional distributions differ from the marginal distribution and from each other.
bias
A condition where the mean of the statistic values differs from the parameter that the statistic estimates.
bivariate data
Data collected on two variables for each individual in a study.
C
The symbol for level of confidence.
Central Limit Theorem
The name of the statement telling us that the sampling distribution of x bar is
approximately normal whenever the sample is large and random.
conditional distribution
The distribution of the values in a single row (or a single column) of a two-way table.
conditions required for the sampling distribution of x bar to be approximately normal
Sample size is large and sample is random (Central Limit Theorem application).
conditions required for the sampling distribution of x to be normal
Original population is normal and sample is random.
confidence interval
An estimate of the value of a parameter in interval form with an associated level of confidence, in other words, a list of reasonable values for the parameter based on a value of the statistic.
control chart
A statistical tool for monitoring the input or output of a process.
control limits
m – 3s / √n and m + 3s / √n , used to detect out-of-control signals in a control chart.
correlation coefficient
A measure of the strength of the linear relationship between two quantitative variables.
decreased
What happens to the width of a confidence interval when the sample size is increased.
disjoint events
Events that cannot occur simultaneously.
distribution of a variable
A list of the possible values of a variable together with the frequency of each value. (Note
event
A single outcome or a combination of outcomes from a random phenomenon.
extrapolation
Predicting a Y value using a value of X that is outside of the range of X values used to obtain the regression equation. This prediction could be very far off.
garbage
Results from statistical analyses performed on non-random samples.
increased
What happens to the width of a confidence interval when the level of confidence is increased.
inference
Using results from a sample statistic value to draw conclusions about the population parameter.
influential observation
An observation that substantially alters the values of slope and y-intercept in
the regression equation when it is included in the computations.
law of large numbers
The fact that the average ( x ) of observed values in a sample will get closer and closer to m as the sample size increases.
laws of probability
The basis for hypothesis testing and confidence interval estimation.
least squares
A method for finding the equation of a line that minimizes the sum of squared residuals.
least squares regression line
The line with the smallest sum of squared residuals.
level of confidence
The percentage of the time that the confidence interval procedure will produce confidence intervals that contain the parameter being estimated, the probability that the confidence interval to be computed (before sample is taken and data are gathered) will contain the value of the parameter. (NOTE
lurking variable
A variable that is not measured but explains association between two variables that are measured.
marginal distribution
The distribution of the values in the “total” row (or the “total” column) of a two-way table.
margin of error
The maximum amount that a statistic value will differ from the parameter value for the middle (1-C)´ 100% of the distribution of all possible statistics. (Note: (1-C)´ 100% is a generic way of expressing level of confidence.)
mean of the sampling distribution of x bar
the mean of all the sample means (x bar's) from all possible samples of size n from a population, equals m.
m
The mean of the population
no association
A condition where values of one variable occur independent of values of another variable, detected when the conditionals of a two-way table equal the marginal distribution (and each other.)
null hypothesis
The hypothesis that the researcher assumes to be true until sample results indicate otherwise, the hypothesis of no difference or no change, hypothesis researcher usually wants to disprove.
observed effect
The difference between the observed statistic and the claimed parameter value.
one-sided or one-tailed
A type of alternative hypothesis where the researcher is interested in deviations in only one direction. (”<” or ”>” is in Ha.)
out-of-control process
One sample mean outside three standard deviations of x or nine sample means in a row above or below the center line.
outlier
An observation that falls outside the overall pattern of the data set.
P-value
The probability of getting a test statistic as extreme or more extreme than the value observed assuming H0 is true.
parameter
A characteristic of a population that is usually unknown, this could be mean, median, proportion, standard deviation computed on all the data from the population., a parameter does not have variability.
parameter symbols
u, o, and p (mean of population, standard deviation of population, proportion of a population, respectively)
positive association
High values of one variable tend to associate with high values of another variable.
probability of an outcome
A measure of the proportion of times an outcome occurs in a very long series of repetitions that gives us an indication of the likelihood of the outcome.
process
Sequence of operations used in production, manufacturing, etc.
process in statistical control
A process whose inputs and outputs exhibit natural variation when observed over time.
quality control chart
A chart plotting the means x of regular samples of size n against time, this chart is used to access whether the process is in control.
quantitative bivariate
The type of data required for regression analysis.
r
The symbol for correlation coefficient.
r2
The percentage of total variation in the response variable, Y, that is explained by the regression equation, in other words, the percentage of total variation in the response variable, Y, that is explained by the explanatory variable, X.
random
A phenomenon that describes the uncertainty of individual outcomes but gives a regular distribution of the outcomes in the long run.
regression equation
A formula for a line that models a linear relationship between two quantitative variables.
residual
The observed y minus the predicted y, denoted: y  yˆ
residual plot
A diagnostic plot of the explanatory variable versus the residuals used to access how well the regression line fits the data, complete scatter in a shoebox pattern is good whereas a megaphone pattern denotes unequal variance in Ys across all levels of X and curvature in the form of a smile or a frown denotes that the linear model is not best for that data.
sample mean, x bar
The random variable of the sampling distribution of x .
sample space
The list of all possible outcomes of a random phenomenon.
sampling distribution
A distribution of a statistic, a list of all the possible values of a statistic together with the frequency (or probability) of each value.
sampling distribution of x
A list of all the possible values for x together with the frequency (or
probability) of each value, in other words, the distribution of all x ’s from all possible samples.
sampling variability
The variability of sample results from one sample to the next, something we must measure in order to effectively do inference.
scatterplot
A two dimensional plot used to examine strength of relationship between two variables as well as direction and type of relationship.
significance level (denoted a)
The risk (probability) of incorrectly rejecting a true null hypothesis, used as a comparison for P-value—when P-value is less than a, we reject the null hypothesis and conclude that the alternative hypothesis is correct.
simple conditions for inference
SRS, population normally distributed and s known.
Simpsons paradox
A condition where the percentages reverse when a third (lurking) variable is ignored, in other words, a condition leading to misinterpretation of the direction of association between two variables caused by ignoring a third variable that is associated with both of the reported variables.
simulation
Using random numbers to imitate chance behavior.
slope
A measure of the average change in the response variable for every one unit increase in the explanatory or independent variable.
standard deviation (s)
A measure of the variability of data in a sample about x .
standard deviation of x (also called the standard deviation of the sampling distribution of x )
\A measure of the variability of the values of the statistic x about , a measure of the variability of the sampling distribution of x bar, in other words, the average amount that the statistic, x bar, deviates from its associated parameter. Computed as n / s.
statistic
A number computed from sample data (without any knowledge of the value of a parameter) used to estimate the value of the parameter.
statistic symbols
x bar, s, pˆ (mean of sample, standard deviation of sample, proportion of sample, respectively)
statistical process control
A procedure used to check a process at regular intervals to detect problems and correct them before they become serious.
statistically significant
Results of a study that differ too much from what we expected (if H0 were true) because of randomization to attribute to chance variation, the difference between the observed statistic and the claimed parameter value (i.e., observed effect) is too large to be due to chance. Results are classified as statistically significant when P-value < a.
sum of squared residuals (or error)
the residuals are squared and added, denoted SSE.
test of significance
Procedure used to assess the evidence against a claim (hypothesis) about the value of a parameter.
test statistic
A number that summarizes the data for a test of significance, usually used to obtain P-value.
total variation in Y
The sum of the squared deviations of the Y observations about their mean, y .
two-sided alternative
A type of alternative hypothesis where the researcher is interested in deviations in both directions. (≠ is in Ha.)
two-way table
A table containing counts for two categorical variables. It has r rows and c columns.
unbiased
A condition where the mean of the statistic values equals the parameter that the statistic estimates.
unexplained variation
The sum of squared residuals
X
The symbol for explanatory variable.
x -chart
A plot of sample means over time used to assess whether a process is in control.
Y
The symbol for response variable.
The symbol for predicted y.
z*
The z-value obtained from table C corresponding to a desired level of confidence, used in computing a confidence interval.
z-score
A measure of the number of standard deviations of a value or observation from the mean.
zero or one
The probability that the value of the parameter  is in the confidence interval (26.7, 29.3). OR the probability that the null hypothesis is true.