Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Related Flashcards

Flashcards
»
Statistics 221 Exam 2

Statistics 221 Exam 2

by rachelle.harrison12, Oct. 2008

Subjects: statistics

Favorite

Add to folder

Flag

Related Essays

â€¢ What Is An Alternative Hypothesis In A Courtroom Trial?
Define the terms null hypothesis and alternative hypothesis. The term null hypothesis refers to a general statement or a default position in which there i...
Pearson Correlation Coefficient Essay
The alternative hypothesis is the existing hypothesis that has already been proven or maintained as the true outcome. For instance, water is wet. This is the...
Math Standards
The student applies mathematical process standards to solve problems by collecting, organizing, displaying, and interpreting data. The student is expected to...
BUS308: Statistics For Managers
It includes estimation of population parameter and hypothesis testing. In inferential statistics null hypothesis is a generic statement or certain default ...
Assess The Difference Between Null Hypothesis And Alternative Hypothesis
Certainly, the reading this week highlights one of the most important use of a hypothesis when conducting research. A hypothesis is an educated guess that ...
Graded Test Reflection
It was used to represent the entire population. c) What is the difference between x-bar and mu?
Null And Alternative Hypothesis In Research
Set type 1 (alpha) and type 2 (beta) error. Use the appropriate table to to look for the corresponding sample size. Basic Concepts Hypotheses: Null and...
Descriptive Statistics: Analysis Of Numerical Data
Introduction Statistics is the practice or science of collecting and analyzing numerical data in large quantities, especially for the purpose of inferring pr...
Malcolm Gladwell's Outliers: The Stories Of Success
“A statistical observation that is markedly different in value from the others of the sample” (Gladwell,
Emory Healthcare Case Study
Inferential statistics use a random sample of data taken from a population to describe and make inferences about the whole population. In order for the resea...

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/85

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

85 Cards in this Set

Front
Back

	alternative hypothesis	The hypothesis that the researcher usually wants to prove or verify, a statement about the value of a parameter that is either less than, greater than, or not equal to.
	association	Values of one variable tend to occur with certain values of another variable, detected when the conditional distributions differ from the marginal distribution and from each other.
	bias	A condition where the mean of the statistic values differs from the parameter that the statistic estimates.
	bivariate data	Data collected on two variables for each individual in a study.
	C	The symbol for level of confidence.
	Central Limit Theorem	The name of the statement telling us that the sampling distribution of x bar is approximately normal whenever the sample is large and random.
	conditional distribution	The distribution of the values in a single row (or a single column) of a two-way table.
	conditions required for the sampling distribution of x bar to be approximately normal	Sample size is large and sample is random (Central Limit Theorem application).
	conditions required for the sampling distribution of x to be normal	Original population is normal and sample is random.
	confidence interval	An estimate of the value of a parameter in interval form with an associated level of confidence, in other words, a list of reasonable values for the parameter based on a value of the statistic.
	control chart	A statistical tool for monitoring the input or output of a process.
	control limits	m – 3s / √n and m + 3s / √n , used to detect out-of-control signals in a control chart.
	correlation coefficient	A measure of the strength of the linear relationship between two quantitative variables.
	decreased	What happens to the width of a confidence interval when the sample size is increased.
	disjoint events	Events that cannot occur simultaneously.
	distribution of a variable	A list of the possible values of a variable together with the frequency of each value. (Note
	event	A single outcome or a combination of outcomes from a random phenomenon.
	extrapolation	Predicting a Y value using a value of X that is outside of the range of X values used to obtain the regression equation. This prediction could be very far off.
	garbage	Results from statistical analyses performed on non-random samples.
	increased	What happens to the width of a confidence interval when the level of confidence is increased.
	inference	Using results from a sample statistic value to draw conclusions about the population parameter.
	influential observation	An observation that substantially alters the values of slope and y-intercept in the regression equation when it is included in the computations.
	law of large numbers	The fact that the average ( x ) of observed values in a sample will get closer and closer to m as the sample size increases.
	laws of probability	The basis for hypothesis testing and confidence interval estimation.
	least squares	A method for finding the equation of a line that minimizes the sum of squared residuals.
	least squares regression line	The line with the smallest sum of squared residuals.
	level of confidence	The percentage of the time that the confidence interval procedure will produce confidence intervals that contain the parameter being estimated, the probability that the confidence interval to be computed (before sample is taken and data are gathered) will contain the value of the parameter. (NOTE
	lurking variable	A variable that is not measured but explains association between two variables that are measured.
	marginal distribution	The distribution of the values in the “total” row (or the “total” column) of a two-way table.
	margin of error	The maximum amount that a statistic value will differ from the parameter value for the middle (1-C)´ 100% of the distribution of all possible statistics. (Note: (1-C)´ 100% is a generic way of expressing level of confidence.)
	mean of the sampling distribution of x bar	the mean of all the sample means (x bar's) from all possible samples of size n from a population, equals m.
	m	The mean of the population
	no association	A condition where values of one variable occur independent of values of another variable, detected when the conditionals of a two-way table equal the marginal distribution (and each other.)
	null hypothesis	The hypothesis that the researcher assumes to be true until sample results indicate otherwise, the hypothesis of no difference or no change, hypothesis researcher usually wants to disprove.
	observed effect	The difference between the observed statistic and the claimed parameter value.
	one-sided or one-tailed	A type of alternative hypothesis where the researcher is interested in deviations in only one direction. (”<” or ”>” is in Ha.)
	out-of-control process	One sample mean outside three standard deviations of x or nine sample means in a row above or below the center line.
	outlier	An observation that falls outside the overall pattern of the data set.
	P-value	The probability of getting a test statistic as extreme or more extreme than the value observed assuming H0 is true.
	parameter	A characteristic of a population that is usually unknown, this could be mean, median, proportion, standard deviation computed on all the data from the population., a parameter does not have variability.
	parameter symbols	u, o, and p (mean of population, standard deviation of population, proportion of a population, respectively)
	positive association	High values of one variable tend to associate with high values of another variable.
	probability of an outcome	A measure of the proportion of times an outcome occurs in a very long series of repetitions that gives us an indication of the likelihood of the outcome.
	process	Sequence of operations used in production, manufacturing, etc.
	process in statistical control	A process whose inputs and outputs exhibit natural variation when observed over time.
	quality control chart	A chart plotting the means x of regular samples of size n against time, this chart is used to access whether the process is in control.
	quantitative bivariate	The type of data required for regression analysis.
	r	The symbol for correlation coefficient.
	r2	The percentage of total variation in the response variable, Y, that is explained by the regression equation, in other words, the percentage of total variation in the response variable, Y, that is explained by the explanatory variable, X.
	random	A phenomenon that describes the uncertainty of individual outcomes but gives a regular distribution of the outcomes in the long run.
	regression equation	A formula for a line that models a linear relationship between two quantitative variables.
	residual	The observed y minus the predicted y, denoted: y yˆ
	residual plot	A diagnostic plot of the explanatory variable versus the residuals used to access how well the regression line fits the data, complete scatter in a shoebox pattern is good whereas a megaphone pattern denotes unequal variance in Ys across all levels of X and curvature in the form of a smile or a frown denotes that the linear model is not best for that data.
	sample mean, x bar	The random variable of the sampling distribution of x .
	sample space	The list of all possible outcomes of a random phenomenon.
	sampling distribution	A distribution of a statistic, a list of all the possible values of a statistic together with the frequency (or probability) of each value.
	sampling distribution of x	A list of all the possible values for x together with the frequency (or probability) of each value, in other words, the distribution of all x ’s from all possible samples.
	sampling variability	The variability of sample results from one sample to the next, something we must measure in order to effectively do inference.
	scatterplot	A two dimensional plot used to examine strength of relationship between two variables as well as direction and type of relationship.
	significance level (denoted a)	The risk (probability) of incorrectly rejecting a true null hypothesis, used as a comparison for P-value—when P-value is less than a, we reject the null hypothesis and conclude that the alternative hypothesis is correct.
	simple conditions for inference	SRS, population normally distributed and s known.
	Simpsons paradox	A condition where the percentages reverse when a third (lurking) variable is ignored, in other words, a condition leading to misinterpretation of the direction of association between two variables caused by ignoring a third variable that is associated with both of the reported variables.
	simulation	Using random numbers to imitate chance behavior.
	slope	A measure of the average change in the response variable for every one unit increase in the explanatory or independent variable.
	standard deviation (s)	A measure of the variability of data in a sample about x .
	standard deviation of x (also called the standard deviation of the sampling distribution of x )	\A measure of the variability of the values of the statistic x about , a measure of the variability of the sampling distribution of x bar, in other words, the average amount that the statistic, x bar, deviates from its associated parameter. Computed as n / s.
	statistic	A number computed from sample data (without any knowledge of the value of a parameter) used to estimate the value of the parameter.
	statistic symbols	x bar, s, pˆ (mean of sample, standard deviation of sample, proportion of sample, respectively)
	statistical process control	A procedure used to check a process at regular intervals to detect problems and correct them before they become serious.
	statistically significant	Results of a study that differ too much from what we expected (if H0 were true) because of randomization to attribute to chance variation, the difference between the observed statistic and the claimed parameter value (i.e., observed effect) is too large to be due to chance. Results are classified as statistically significant when P-value < a.
	sum of squared residuals (or error)	the residuals are squared and added, denoted SSE.
	test of significance	Procedure used to assess the evidence against a claim (hypothesis) about the value of a parameter.
	test statistic	A number that summarizes the data for a test of significance, usually used to obtain P-value.
	total variation in Y	The sum of the squared deviations of the Y observations about their mean, y .
	two-sided alternative	A type of alternative hypothesis where the researcher is interested in deviations in both directions. (≠ is in Ha.)
	two-way table	A table containing counts for two categorical variables. It has r rows and c columns.
	unbiased	A condition where the mean of the statistic values equals the parameter that the statistic estimates.
	unexplained variation	The sum of squared residuals
	X	The symbol for explanatory variable.
	x -chart	A plot of sample means over time used to assess whether a process is in control.
	Y	The symbol for response variable.
	yˆ	The symbol for predicted y.
	z*	The z-value obtained from table C corresponding to a desired level of confidence, used in computing a confidence interval.
	z-score	A measure of the number of standard deviations of a value or observation from the mean.
	zero or one	The probability that the value of the parameter is in the confidence interval (26.7, 29.3). OR the probability that the null hypothesis is true.

Share This Flashcard Set