Six Sigma Quality Resources for European Companies In association withValeocon Management Consulting
 Main Site > Europe Channel > Statistics  > Hypothesis Testing Search:
 
 for    
Publications
Marketplace
| iSixSigma
Stuff
| iSixSigma
Blogosphere
| Events
Calendar
| The
Dictionary
| Discussion
Forum
| Find
a Job
| Post
a Job
| Industry
News
| Newsletter
Signup
| Sigma
Calculator
| Online
Surveys
Nominations for iSixSigma Awards! close November 30 – nominate your project/program today!
iSixSigma Magazine Signup
 iSixSigma Live!  
  Live! Home
  2010 Summit & Awards
  2010 Energy Forum
 Free Newsletters!  
  Sign Up Now!
  Manage Subscriptions
  New To Six Sigma?
  Six Sigma Q&A
  Cert. Practice Test
  Problem Solving Wizard
  ISSSP Info
ISSSP Is The Official Six Sigma Society of iSixSigma
 Channels 
  iSixSigma Main
  Financial Services
  Healthcare
  Military
  Software / IT
 Quality Directory 
  Recent Articles
  Certifications/Awards
  Consultants
  Culture Evolution
  Methodologies
  News & Events
  Organizations
  Product/Service Guides
  Statistics & Analysis
   Normality
   Variation
  Tools & Templates
  Voice of the Customer
  Free Whitepapers
 Related Topics 
  Innovation
  Outsourcing/Offshoring
  Business Process Mgt
 Quick Access 
  Help
  Search
  Advertise Here
  Article Archives
  Newsletter Archives
 User Feedback 
  Please suggest site
  improvements.
 
  [ larger form ]

Using ANOVA to Find Differences in Population Means

Bookmark This Page Bookmark This Page
Email This Page Email This Page
Format for Printing Format for Printing
Cite This Article Cite This Article
Submit an Article Submit an Article
Six Sigma Article Archive Read More Articles
Related Tools & Articles
  • Discussion Forum
    "[If] I have two paired samples, I can use the paired t-test, and the results can be very different from a normal two-sample t-test. What do I do in case I have more paired samples? Can I still use the one-way ANOVA?"

    Contribute to this Discussion
    Download Products

    By Chew Jian Chieh

    Three methods used to dissolve a powder in water are compared by the time (in minutes) it takes until the powder is fully dissolved. The results are summarized in the following table:

    Method Results

    It is thought that the population means of the three methods m1, m2 and m3 are not all equal – i.e., at least one m is different from the others. How can this be tested?

    One way is to use multiple two-sample t-tests and compare Method 1 with Method 2, Method 1 with Method 3 and Method 2 with Method 3 (comparing all the pairs). But if a for each test is 0.05, the probability of making a Type 1 error when running three tests would increase.

    A better method is called Analysis of Variance, or ANOVA, which is a statistical technique for determining the existence of differences among several population means. The technique requires the analysis of different forms of variances – hence the name. But note: ANOVA is not used to show that variances are different (that is a different test); it is used to show that means are different.

    How ANOVA Works

    Basically, ANOVA compares two types of variances: the variance within each sample and the variance between different samples. The following figure displays the data per method and helps to show how ANOVA works. The black dotted arrows show the per-sample variation of the individual data points around the sample mean (the variance within). The red arrows show the variation of the sample means around the grand mean (the variance between).

     Comparing Variances Using ANOVA
    Comparing Variances Using ANOVA

    The assumption is now: If the population means are different, then the variance within the samples must be small compared to the variance between the samples. Hence, if the variance between divided by the variance within is large, then we say that the means are different.

    Steps for Using ANOVA

    Step 1: Compute the Variance Between

    First, the sum of squares (SS) between is computed:

    SS Between formula

    Where x-bar is the sample mean and x-double-bar is the overall mean or grand mean. This can be easily found using spreadsheet software:


    Now, the variance between or mean square between (ANOVA terminology for variance) can be computed.

    The formula for sample variance is:

    Formula for sample variance


    Since there are three sample means and a grand mean, however, this is modified to:

    Modified formula for sample mean

    Where k is the number of distinct samples. In other words, the variance between is the SS between divided by k – 1:

    (This example uses Microsoft Excel software. In Minitab software, SS between is called SS factor, variance between is called MS factor and K – 1 is called DF.)

    Step 2: Compute the Variance Within

    Again, first compute the sum of squares within. This is:

    Sum of squares within equation

    SS within is 70 + 62 + 60 = 192.

    To obtain the variance within, use this equation:

    Variance within equation

    Step 3: Compute the Ratio of Variance Between and Variance Within

    This is called the F-ratio. How can this be interpreted? If the null hypothesis is true, meaning m1, m2 and m3 are all equal, then the variance between the samples is 0 (zero), i.e. the F-ratio is also zero.

    If the null hypothesis is not true, then this F-ratio will become larger, and the larger it gets, the more likely it is that the null hypothesis will be rejected.

    Using the F-tables for k = 3 and n = 16, one gets a p-value of 0.292 (easiest way is to use FDIST function in Excel). This means that the probability that the observed F-ratio of 1.354 is just random is 29.2 percent:

    Hence, if one sets a = 0.05, one must accept the null hypothesis that there is no difference in the population means.

    In Minitab, the results for the same data are displayed in the session window like this:

    If there had been a significant difference between the samples, this would have been seen with the p-value and also there would have been at least one confidence interval for one mean that had no or just very little overlap with the other confidence intervals. This would have indicated a significant difference between its population mean and the other population means.

    Minitab also computes an R-squared value (R-Sq) by taking the SS factor/SS total = 40/232 *100 = 17.24. This shows the percent of explained variation by the factor. Here, the factor only explains 17.24 percent of the total variation; hence, it is not a very good explanation.

    About the Author: Chew Jian Chieh is a senior consultant and Master Black Belt with Valeocon Management Consulting and supports clients across Asia and China. He has extensive experience in implementing process and organization improvements for various industries. He specializes in Lean Six Sigma, Strategy Development/Deployment and Change Management. Chew JC is a Singapore national. He can be reached at jian-chieh.chew@valeocon.com.

     
    Rate This Article:  Current Rating: 4.53
      Poor    Excellent     
              1    2    3     4    5
    Copyright � 2000-2009 iSixSigma – All Rights Reserved
    Reproduction Without Permission Is Strictly Prohibited – Copyright Requests


    Publish an Article: Do you have a Six Sigma tip, learning or case study?
    Share it with the largest community of Six Sigma professionals, and be recognized by your peers.
    It's a great way to promote your expertise and/or build your resume. Read more about submitting an article.



    BEST SELLING PRODUCTS (iSixSigma Publications)
    1. Six Sigma Black Belt (DMAIC) Training Slides - 2009 Version!
      The 2009 Six Sigma Black Belt course includes over 40 more slides than the 2008 version. Contents include: 1,220 PowerPo...
    2. Certified Lean Six Sigma Black Belt Assessment Exam
      Interested in assessing your knowledge of Lean Six Sigma? Preparing for certifications? Testing your students and traine...
    3. Certified Lean Six Sigma Green Belt Assessment Exam
      This assessment exam is useful for students interested in assessing their knowledge of Lean Six Sigma on the Green Belt ...
    4. Certified Lean Six Sigma Black Belt E-book
      In 670 pages learn everything within the Lean Six Sigma DMAIC body of knowledge to successfully achieve Black Belt certi...
    5. Kaizen Workshop E-book
      This 150+ page ebook teaches key tools and techniques of Kaizen, as well as real application to enhance learning. Kaizen...
    6. Six Sigma Yellow Belt Training Slides - 2009 Version
      The 2009 Six Sigma Yellow Belt course is comprised of: 503 slidesInstructor notesSlide explanations15 data sets19 suppo...
    7. Design For Six Sigma (DFSS) E-Book or Print
      Need an "encyclopedia" consisting of many of the tools you’ll study? Need a helpful refresher to apply the DFSS process?...
     
    Six Sigma AdLinks


    Google AdWords
     
    Home | Discussion Forum | Event Calendar | Job Shop
    Link To iSixSigma | Rate This Page | Report A Problem | Free Content For Your Site | Submit Article For Publishing
     Terms of Service. �2000-2009 iSixSigma. All rights reserved. v3.0lb, 0.1
    About iSixSigmaContact UsPrivacy PolicySite Map