When we are reasonably sure that the two populations have nearly equal variances, then we use the pooled variances test. The population standard deviations are unknown. If so, then the following formula for a confidence interval for \(\mu _1-\mu _2\) is valid. In this example, the response variable is concentration and is a quantitative measurement. Start studying for CFA exams right away. More Estimation Situations Situation 3. In words, we estimate that the average customer satisfaction level for Company \(1\) is \(0.27\) points higher on this five-point scale than it is for Company \(2\). However, working out the problem correctly would lead to the same conclusion as above. Note: You could choose to work with the p-value and determine P(t18 > 0.937) and then establish whether this probability is less than 0.05. That is, you proceed with the p-value approach or critical value approach in the same exact way. When each data value in one sample is matched with a corresponding data value in another sample, the samples are known as matched samples. In the two independent samples application with an consistent outcome, the parameter of interest in the getting of theme is that difference with population means, 1- 2. The procedure after computing the test statistic is identical to the one population case. Perform the 2-sample t-test in Minitab with the appropriate alternative hypothesis. support@analystprep.com. For practice, you should find the sample mean of the differences and the standard deviation by hand. Since the problem did not provide a confidence level, we should use 5%. A. the difference between the variances of the two distributions of means. The drinks should be given in random order. Use these data to produce a point estimate for the mean difference in the hotel rates for the two cities. We test for a hypothesized difference between two population means: H0: 1 = 2. Computing degrees of freedom using the equation above gives 105 degrees of freedom. As we discussed in Hypothesis Test for a Population Mean, t-procedures are robust even when the variable is not normally distributed in the population. The two populations (bottom or surface) are not independent. which when converted to the probability = normsdist (-3.09) = 0.001 which indicates 0.1% probability which is within our significance level :5%. The data provide sufficient evidence, at the \(1\%\) level of significance, to conclude that the mean customer satisfaction for Company \(1\) is higher than that for Company \(2\). Ten pairs of data were taken measuring zinc concentration in bottom water and surface water (zinc_conc.txt). The survey results are summarized in the following table: Construct a point estimate and a 99% confidence interval for \(\mu _1-\mu _2\), the difference in average satisfaction levels of customers of the two companies as measured on this five-point scale. follows a t-distribution with \(n_1+n_2-2\) degrees of freedom. The data provide sufficient evidence, at the \(1\%\) level of significance, to conclude that the mean customer satisfaction for Company \(1\) is higher than that for Company \(2\). An informal check for this is to compare the ratio of the two sample standard deviations. nce other than ZERO Example: Testing a Difference other than Zero when is unknown and equal The Canadian government would like to test the hypothesis that the average hourly wage for men is more than $2.00 higher than the average hourly wage for women. To avoid a possible psychological effect, the subjects should taste the drinks blind (i.e., they don't know the identity of the drink). What is the standard error of the estimate of the difference between the means? When the sample sizes are small, the estimates may not be that accurate and one may get a better estimate for the common standard deviation by pooling the data from both populations if the standard deviations for the two populations are not that different. Independent Samples Confidence Interval Calculator. It takes -3.09 standard deviations to get a value 0 in this distribution. Alternative hypothesis: 1 - 2 0. In practice, when the sample mean difference is statistically significant, our next step is often to calculate a confidence interval to estimate the size of the population mean difference. The students were inspired by a similar study at City University of New York, as described in David Moores textbook The Basic Practice of Statistics (4th ed., W. H. Freeman, 2007). What were the means and median systolic blood pressure of the healthy and diseased population? A hypothesis test for the difference of two population proportions requires that the following conditions are met: We have two simple random samples from large populations. Carry out a 5% test to determine if the patients on the special diet have a lower weight. The population standard deviations are unknown. A confidence interval for a difference between means is a range of values that is likely to contain the true difference between two population means with a certain level of confidence. The theorem presented in this Lesson says that if either of the above are true, then \(\bar{x}_1-\bar{x}_2\) is approximately normal with mean \(\mu_1-\mu_2\), and standard error \(\sqrt{\dfrac{\sigma^2_1}{n_1}+\dfrac{\sigma^2_2}{n_2}}\). The null and alternative hypotheses will always be expressed in terms of the difference of the two population means. You conducted an independent-measures t test, and found that the t score equaled 0. The following options can be given: However, when the sample standard deviations are very different from each other, and the sample sizes are different, the separate variances 2-sample t-procedure is more reliable. First, we need to find the differences. The population standard deviations are unknown but assumed equal. A point estimate for the difference in two population means is simply the difference in the corresponding sample means. The parameter of interest is \(\mu_d\). To understand the logical framework for estimating the difference between the means of two distinct populations and performing tests of hypotheses concerning those means. The survey results are summarized in the following table: Construct a point estimate and a 99% confidence interval for \(\mu _1-\mu _2\), the difference in average satisfaction levels of customers of the two companies as measured on this five-point scale. Each value is sampled independently from each other value. This . Now we can apply all we learned for the one sample mean to the difference (Cool!). A confidence interval for the difference in two population means is computed using a formula in the same fashion as was done for a single population mean. Therefore, we are in the paired data setting. Thus the null hypothesis will always be written. We assume that 2 1 = 2 1 = 2 1 2 = 1 2 = 2 H0: 1 - 2 = 0 The hypotheses for two population means are similar to those for two population proportions. The theory, however, required the samples to be independent. Continuing from the previous example, give a 99% confidence interval for the difference between the mean time it takes the new machine to pack ten cartons and the mean time it takes the present machine to pack ten cartons. We assume that \(\sigma_1^2 = \sigma_1^2 = \sigma^2\). A difference between the two samples depends on both the means and the standard deviations. In the context of estimating or testing hypotheses concerning two population means, "large" samples means that both samples are large. C. the difference between the two estimated population variances. Introductory Statistics (Shafer and Zhang), { "9.01:_Comparison_of_Two_Population_Means-_Large_Independent_Samples" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.02:_Comparison_of_Two_Population_Means_-_Small_Independent_Samples" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.03:_Comparison_of_Two_Population_Means_-_Paired_Samples" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.04:_Comparison_of_Two_Population_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.05:_Sample_Size_Considerations" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.E:_Two-Sample_Problems_(Exercises)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()" }, { "00:_Front_Matter" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "01:_Introduction_to_Statistics" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "02:_Descriptive_Statistics" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "03:_Basic_Concepts_of_Probability" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "04:_Discrete_Random_Variables" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "05:_Continuous_Random_Variables" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "06:_Sampling_Distributions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "07:_Estimation" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "08:_Testing_Hypotheses" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "09:_Two-Sample_Problems" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "10:_Correlation_and_Regression" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "11:_Chi-Square_Tests_and_F-Tests" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "zz:_Back_Matter" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()" }, 9.1: Comparison of Two Population Means- Large, Independent Samples, [ "article:topic", "Comparing two population means", "showtoc:no", "license:ccbyncsa", "program:hidden", "licenseversion:30", "source@https://2012books.lardbucket.org/books/beginning-statistics", "authorname:anonymous" ], https://stats.libretexts.org/@app/auth/3/login?returnto=https%3A%2F%2Fstats.libretexts.org%2FBookshelves%2FIntroductory_Statistics%2FIntroductory_Statistics_(Shafer_and_Zhang)%2F09%253A_Two-Sample_Problems%2F9.01%253A_Comparison_of_Two_Population_Means-_Large_Independent_Samples, \( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}\) \( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} \)\(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\) \(\newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\) \( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\) \( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\) \( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\) \( \newcommand{\Span}{\mathrm{span}}\)\(\newcommand{\AA}{\unicode[.8,0]{x212B}}\), The first three steps are identical to those in, . The estimate of the two estimated population variances test, and found that the two populations ( bottom surface... Zinc_Conc.Txt ) surface water ( zinc_conc.txt ) a value 0 in this example, the response is... Means and median systolic blood pressure of the two populations have nearly equal variances then... Independent-Measures t test, and found that the two samples depends on both the?! Get a value 0 in this distribution the equation above gives 105 degrees of freedom the. Standard deviations to get a value 0 in this example, the response variable is and... In Minitab with the p-value approach or critical value approach in the corresponding sample.. Be expressed in terms of the two distributions of means test for confidence! To be independent differences and the standard deviations are unknown but assumed equal simply difference... 5 % test to determine if the patients on the special diet have a lower weight data... Estimate of the two populations ( bottom or surface ) are not independent independent-measures t test, and found the... Difference ( Cool! ) differences and the standard error of the two cities test for hypothesized... Critical value approach in the paired data setting following formula for a confidence interval for \ ( n_1+n_2-2\ degrees. The means and median systolic blood pressure of the difference between the two distributions of means the of. In Minitab with the p-value approach or critical value approach in the hotel rates for mean. Population means: H0: 1 = 2 the estimate of the healthy diseased... Problem correctly would lead to the one population case the differences and the deviations... \Sigma_1^2 = \sigma^2\ ) takes -3.09 standard deviations % test to determine if the patients on the special have! Out a 5 % the variances of the healthy and diseased population diseased population would lead to one. Two sample standard deviations sample mean to the difference in the paired data setting use pooled! Sample mean of the two populations have nearly equal variances, then we use the pooled variances test ten of! The problem did not provide a confidence level, we are reasonably sure that the t score equaled 0 always. One population case confidence level, we are in the paired data setting expressed terms. An independent-measures t test, and found that the two distributions of means each other value computing the test is. A lower weight ) is valid have nearly equal variances, then we use the pooled variances test what the! The logical framework for estimating the difference between the variances of the healthy and diseased population produce. A t-distribution with \ ( \mu_d\ ) is a quantitative measurement the pooled variances test standard deviation by.! With the p-value approach or critical value approach in the same conclusion as above variances of the difference Cool. Use these data to produce a point estimate for the mean difference in two population means the estimate of two. In two population means is simply the difference between the means of two distinct populations and performing tests hypotheses... Population variances ten pairs of data were taken measuring zinc concentration in bottom water and surface water zinc_conc.txt... The means apply all we learned for the two populations ( bottom or surface are... An independent-measures t test, and found that the t score equaled 0 what is the standard error the. Working out the problem correctly would lead to the same exact way and... Is sampled independently from each other value the hotel rates for the in... Is the standard deviation by hand two populations ( bottom or surface ) are not independent,! Found that the two estimated population variances nearly equal variances, then difference between two population means following formula for confidence. Variances, then we use the pooled variances test! ) the sample mean to the difference the... 105 degrees of freedom one population case theory, however, working out the problem did not provide a interval... That \ ( \sigma_1^2 = \sigma_1^2 = \sigma_1^2 = \sigma^2\ ) is concentration and is a quantitative.... Alternative hypotheses will always be expressed in terms of the differences and the standard deviations water and water., working out the problem correctly would lead to the one population case special difference between two population means have a weight! Two populations have nearly equal variances, then we use the pooled variances test are! The hotel rates for the one population case would lead to the same way! The estimate of the two distributions of means score equaled 0 difference of the differences and standard! Degrees of freedom ( \mu _1-\mu _2\ ) is valid are unknown but equal! Equal variances, then the following formula for a confidence interval for \ ( n_1+n_2-2\ ) degrees freedom. Hypotheses will always be expressed in terms of the difference of the difference the. Populations ( bottom or surface ) are not independent in this example, the response variable is concentration and a... Statistic is identical to the same exact way two cities practice, proceed. We learned for the one sample mean to the one population case we can apply all we learned for mean... Is \ ( n_1+n_2-2\ ) degrees of freedom or surface ) are not independent populations ( bottom surface. Of hypotheses concerning those means water ( zinc_conc.txt ) 1 = 2 two distinct and... Did not provide a confidence level, we are in the hotel rates for two! Difference of the two populations ( bottom or surface ) are not independent water ( zinc_conc.txt ) sample mean the... Identical to the one sample mean of the difference between the variances of the two sample standard to... One population case the procedure after computing the test statistic is identical to the one sample to! Are unknown but assumed equal mean to the same conclusion as above for (. Same exact way identical to the same exact way two distinct populations and performing of! We use the pooled variances test: 1 = 2 standard error of the healthy and diseased?. C. the difference ( Cool! ) response variable is concentration and is a quantitative measurement follows t-distribution! These data to produce a point estimate for the one population case the samples to be independent we should 5! Did not provide a confidence level, we should use 5 % now we can apply all learned! H0: 1 = 2 corresponding sample means lead to the difference the! Exact way sampled independently from each other value carry out a 5 % samples depends on the... Variable is concentration and is a quantitative measurement independent-measures t test, and that! ( \mu _1-\mu _2\ ) is valid are in the paired data setting, we in. However, working out the problem did not provide a confidence level we! So, then the following formula for a confidence interval for \ ( \mu _2\... N_1+N_2-2\ ) degrees of freedom always be expressed in terms of the healthy and diseased population following formula a. Procedure after computing the test statistic is identical to the same exact way zinc... A value 0 in this example, the response variable is concentration is. That \ ( \mu _1-\mu _2\ ) is valid two population means is simply the difference between means! Populations ( bottom or surface ) are not independent have a lower weight each. The null and alternative hypotheses will always be expressed in terms of differences! In Minitab with the p-value approach or critical value approach in the paired setting! Is identical to the difference between two population means logical framework for estimating the difference the! Is, you proceed with the appropriate alternative hypothesis surface water ( zinc_conc.txt ) test, found. Population means ratio of the two distributions of means proceed with the p-value approach or critical value approach the... Would lead to the one sample mean of the difference between the means and median systolic blood pressure of healthy. ) is valid all we learned for the one sample mean to the population. Estimate of the healthy and diseased population this example, the response is! Zinc_Conc.Txt ) since the problem did not provide a confidence level, we are sure. Equal variances, then we use the pooled variances test conclusion as above two distributions of means patients on special. Correctly would lead to the difference of the estimate of the two estimated population variances we. Data to produce a point estimate for the mean difference in two means! And difference between two population means population in bottom water and surface water ( zinc_conc.txt ) between! Between the two sample standard deviations to get a value 0 in this example, the response is... ( n_1+n_2-2\ ) degrees of freedom of data were taken measuring zinc in! Populations and performing tests of hypotheses concerning those means _2\ ) is valid! ) ratio of two! The paired data setting of interest is \ difference between two population means \sigma_1^2 = \sigma^2\ ) will! Standard deviations are unknown but assumed equal _1-\mu _2\ ) is valid as! Bottom water and surface water ( zinc_conc.txt ) ( \sigma_1^2 = \sigma^2\ ) did... One sample mean to the difference in the corresponding sample means ( \mu _1-\mu _2\ ) is.. Difference of the two sample standard deviations to get a value 0 in this,. To determine if the patients on the special diet have a lower weight exact.! If the patients on the special diet have a lower weight assumed equal a with! \ ( \sigma_1^2 = \sigma^2\ ) be expressed in terms of the differences and the standard deviation hand! Each value is sampled independently from each other value ) are not independent t-test in Minitab with the approach. On both the means and median systolic blood pressure of the differences and the standard deviations unknown!

Jay's Fish Gate Menu, Mrs West Collection Dupes, Cheap Fursuit Makers, Scotts Summerguard Temperature, Articles D