One-way analysis of variance

Читайте также:

Chapter 5

Analysis of variance (ANOVA)

Introduction

In Chapter 1 we discussed how to test whether or not the means of two populations are equal. Recall that the test involved the selection of an independent random sample from each of the populations. In this chapter we will discuss a statistical procedure for determining whether or not the means of more than two populations are equal. The technique that we will be introduced is called the analysis of variance (ANOVA) procedure.

One-way analysis of variance

This section discusses the one-way analysis of variance procedure to make tests comparing means often is called one-way analysis, because we will analyze only one factor. Sometimes we may analyze the effects of two factors. This is called two-way analysis of variance.

Suppose that we have independent samples of sizes

observations selected randomly from K populations. Assume that population means are . The one-way analysis of variance procedure designated to test the null hypothesis

(All population means are equal)

against the alternative hypothesis

At least one of the population has different mean

The application of one way analysis of variance requires that the following assumptions hold:

1. The populations from which the samples are selected are (approximately) normal.

2. The populations from which the samples are selected have the same variance (or standard deviation).

3. The samples selected from different populations are independent.

Suppose that from K populations samples of sizes are selected (Figure 5.1)

Figure 5.1

BlockPOPULATION(GROUP)

1 2 …. K

….

…. …. …. ….

…. …. …. …..

….

1) The first step is to calculate the sample mean for the K groups of observations. These sample means will be denoted as .

In general

where denotes the number of observations in group.

2) The second step is to find overall mean of the all sample observations, denoted , and defined as

where n denotes the total number of sample observations

An equivalent expression for overall mean is

3) In third step, we consider variability within-groups. To measure variability in the any group, we calculate the sum of squared deviations of the observations about their sample means. Within-groups variability will be denoted by SS. For example, for the first group the sum of squared deviations of the observations about their sample mean is

For the second group, whose sample mean is , we calculate

and so on.

4) In fourth step, we find total within-groups variability, denoted SSW. That is

5) Now we need a measure of variability between groups. It is based on the discrepancies between the individual group means and the overall mean. Total between-groups sum of squares denoted , and defined as

6) As a last step, we calculate the sum of squared discrepancies of all the sample observations about their overall mean. This is called the total sum of squares, denoted , expressed as

It can be shown, that the total sum of squares is the sum of the within-groups and between-groups sum of squares, that is

Testing the equality of population means is based on the assumption that K populations have equal variances (or standard deviations).

is true, each of the SSW and SSG can be used as the basis for estimate of the common population variance. To obtain these estimates, the sums of squares must be divided by the corresponding numbers of degrees of freedom.

SSW divided by results estimate called the within-groups mean square, denoted MSW, so that

SSG divided by results estimate called the between-groups mean square, denoted MSG, so that

The test of null hypothesis is based on the ratio of the mean squares

If this ratio is close to 1, there would be little cause to doubt the null hypothesis of equality of population means.

Дата добавления: 2015-08-05; просмотров: 123 | Нарушение авторских прав

<== предыдущая страница	\|	следующая страница ==>
Exercises	\|	Summary

mybiblioteka.su - 2015-2025 год. (0.008 сек.)