Chi-Squared confidence intervals

Everything You Need in One Place

Homework problems? Exam preparation? Trying to grasp a concept or just brushing up the basics? Our extensive help & practice library have got you covered.

Learn and Practise With Ease

Our proven video lessons ease you through problems quickly, and you get tonnes of friendly practise on questions that trip students up on tests and finals.

Instant and Unlimited Help

Our personalized learning platform enables you to instantly find the exact walkthrough to your specific type of question. Activate unlimited help now!

0/1
?
Intros
Lessons
  1. What are Chi-Squared Confidence Intervals?
0/6
?
Examples
Lessons
  1. Determining Degrees of Freedom
    How many degrees of freedom does a sample of size,
    1. 7 have?
    2. 20 have?
  2. Determining the Critical Value for a Chi-Square Distribution (XR2(X_R^2 and XL2)X_L^2)
    If a Chi-Squared distribution has 8 degrees of freedom find XR2X_R^2 and XL2X_L^2, with a
    1. 95% confidence level
    2. 99% confidence level
  3. Determining the Confidence Interval for Variance
    Road and racing bicycles have an average wheel diameter of 622mm. From a sample of 15 bicycles it was found that the wheel diameters have a variance of 10mm. With a 90% confidence level give a range where the variance of all road and racing bicycle wheels lie.
    1. Determining the Confidence Interval for Standard Deviation
      A Soda-pop company "Jim's Old Fashion Soda" is designing their bottling machine. After making 41 bottles they find that their bottles have an average of 335mL of liquid with a standard deviation of 3mL. With a 99% confidence level what is the range of standard deviation that this machine will output per bottle?
      Topic Notes
      ?

      Chi - Squared confidence intervals


      In todays lesson we will learn what is chi square in statistics and how do we use it when constructing confidence intervals.

      What is chi square and what is a chi squared distribution?


      A chi squared distribution comes from the chi square statistic, which measures how different are observed values from the expected ones from a true hypothesis in a statistical test. The chi square symbol is x2x^{2}, and this statistic is also called the goodness-of-fit statistic.

      When talking in terms of the standard normal distribution, if we take a random sample of the values from such a distribution we would obtain a standard normal deviate. Squaring standard normal deviates and adding them together we would obtain a chi squared distribution, where the degrees of freedom are equal to the number of standard normal deviates that were added.

      A Chi-Square with one degree of freedom is written as x2x^{2}(1), is simply the distribution of a single normal deviate squared. The area of a Chi Square distribution below 4 is the same as the area of a standard normal distribution below 2, since 4 is 22. The mean of a Chi Square distribution is its degrees of freedom. Chi Square distributions are positively skewed, as the degrees of freedom increase, the Chi Square distribution approaches a normal distribution.

      The concept of the chi squared statistic can be a bit complicated so let us answer the following questions to have a little bit more insight on the matter:

      • What is chi square distribution in statistics
      Chi square goodness of fit test statistic is a measure of the difference between the observed values and the expected values squared from a sample obtained from a standard normal distribution.

      • Does chi square assume normal distribution
      No, is chi square distribution symmetric then? No, the curve for the chi square graph is skewed towards the right. A chi-square curve has no area associated with negative values and is asymmetric, with a longer tail on the right. There are actually many chi-square distributions, each one identified with a different number of degrees of freedom. Curves corresponding to several chi-square distributions are shown in Figure 1:

      Chi - Squared confidence intervals
      Figure 1: Chi-square curves

      Notice the shape as the degrees of freedom increase, the more degrees of freedom present the more the chi-square curve looks almost symmetric like a normal distribution!

      • What is the shape of the chi-square distribution?
      A chi square distribution graph looks similar to a normal distribution but is skewed towards the right and it has all positive numbers in its horizontal axis. A general shape for a chi square is shown below:

      Chi - Squared confidence intervals
      Figure 2: A typical look for a chi-square curve

      • What is chi square critical value?
      Just as for the normal distribution, when using a chi square distribution it is important to recognize some of its characteristics.

      For example, when looking at what the area under the graph means we will find a confidence levell delimited by a confidence interval, and then a resulting significance level left on the side. Here is where the critical values come to place.

      In any distribution, we know that a critical value is a point in the horizontal axis of the graph which marks the limit between an area (or region) and another. Usually, delimiting in between the confidence level (nαn - \alpha) and the significance level (α\alpha). The chi square distribution follows the same rules and since the units of its horizontal axis is x2x^{2} itself, we call these critical values: chi-square critical values.

      Notice in the graph below how the confidence level, the significance level and the critical values XR2X_{R} ^{2} and XL2X_{L} ^{2} are shown. The R and L subindexes denote the sides right and leftsince we are looking at a two tailed distribution, and notice the significance level is still evenly divided in halves (one for each side) even if the distribution is not symmetrical.

      Chi - Squared confidence intervals
      Figure 3: Important parts of a chi-square distribution

      To find an area to the right of any critical value in the horizontal axis of this distribution such as XR2X_{R} ^{2} and XL2X_{L} ^{2} shown above, we use a chi-square distribution table. This table contains 3 types of values: degrees of freedom, area to the right of a critical value, and the chi square critical value itself and it looks like this:

      Chi-square Table


      Chi - Squared confidence intervals
      Figure 4: Chi distribution table

      At the very left column of this chi squared table are the degrees, the top row contains the area values and the numbers inside this left-column/top-row frame are the chi square critical values. We will learn how this is used in a later section of the lesson.

      An even bigger table (and more complete) can be found in the following link, and we recommend using it in conjunction with a chi square calculator (found at the end of this lesson) when solving multiple problems during studying time on your own.

      When is chi square distribution used?


      To estimate a population variance a Chi-Squared distribution is used,

      Chi-Squared:

      X2=(n1)s2σ2 \large X^{2} = \frac{(n-1)s^{2}}{\sigma^{2}}
      Equation 1: Chi Square Formula

      Where:
      X2X^{2} = chi square
      nn = sample size
      ss = sample standard deviation
      σ\sigma = population standard deviation
      σ2\sigma^{2} = population variance
      n1n-1 = degrees of freedom

      From the chi square formula we can solve for the value of the variance (the square of the populations standard deviation).

      With this we can calculate the confidence interval for that variance too, similarly to how we have used the t-distribution and the margin of error before to calculate the confidence interval for a population mean. The confidence interval, in this case for the variance or standard deviation squared, is given by:

      (n1)s2XR2<  σ2  <(n1)s2XL2 \large \frac{(n-1)s^{2}}{X_{R} ^{2}} < \; \sigma^{2} \; < \frac{(n-1)s^{2}}{X_{L} ^{2}}
      Equation 2: Formula for the confidence interval of the variance

      In this last formula, the chi square values in the denominators of each side refers to the chi square critical values defining the edges between the confidence level and the significance level on the distribution graph, and denoting that we have a two tailed significance level because we have a right side (XR2X_{R} ^{2} ) and a left side (XL2X_{L} ^{2}) as shown below:

      Chi - Squared confidence intervals
      Figure 5: Using the chi square distribution

      Another use for the chi-square distribution in statistics is when we are doing hypothesis testing, in this case, with the chi square test. We will talk about the chi-square test in a lesson later on.

      How to use the chi square distribution table


      The Chi-Square table gives critical value area to the right:

      Chi-square Table


      Chi - Squared confidence intervals
      Figure 6: The Chi Square Table

      Let me explain the chi square distribution table in a little more detail:
      The first column to the left expresses the degrees of freedom for a particular situation, while the top row expresses the area to the right of any specific chi square critical value we are looking at.

      Therefore, to find a value of chi square we look at the point in the table where the specific degrees of freedom row intersects with the column for the specific chi squared critical value. This process can be seen easily in example problem 2 on both parts a and b, so if you have any questions about how the chi square critical value table works, we recommend you to skip into that part of the lesson where you will also be reading about how to calculate chi square related confidence intervals.

      How to solve chi square distribution


      In order to understand better what chi square is and how we use it, let us take a look into a few problem examples.

      Example 1:

      Determining Degrees of Freedom
      How many degrees of freedom does a sample of size,

      a. \quad 7 have?

      The sample size is defined as n=7n=7, and the degrees of freedom arise from simply subtracting n1n-1, therefore for this case the degrees of freedom are n1=71=6n-1=7-1=6.

      And so we have 6 degrees of freedom in this case.

      b. \quad 20 have?

      In this case n=20n = 20.
      Thus n1=201=19n-1=20-1=19.
      19 degrees of freedom in this case.

      Example 2:

      Determining the Critical Value for a Chi-SquareDistribution (XR2X_{R} ^{2} and XL2X_{L} ^{2})
      If a Chi-Squared distribution has 8 degrees of freedom find XR2X_{R} ^{2} and XL2X_{L} ^{2}, with a

      a. \quad 95% confidence level

      If we have 8 degrees of freedom, then this means that n1=8n-1=8, thus the sample size n is equal to 9.
      A confidence level of 95% percent means 1α=0.951 - \alpha = 0.95, therefore the significance level is α=10.95=0.05 \alpha = 1 - 0.95 = 0.05.
      This significance level is spread in two equal parts in two tails since we are asked for both XR2X_{R} ^{2} and XL2X_{L} ^{2}, with this in mind, let us express that information in a chi squared distribution graph as follows:

      Chi - Squared confidence intervals
      Figure 7: The Chi Squared Distribution for this case

      Using the information about the 8 degrees of freedom and the area value of half significance level we use the chi squared table to find the value of both critical values XR2X_{R} ^{2} and XL2X_{L} ^{2} .
      Remember that the area value in the chi square table top row is the area to the right of a critical value, and so, we start with XR2X_{R} ^{2} since the area to its right is simply α2\large \frac{\alpha}{2} :
      Chi - Squared confidence intervals
      Figure 8: The Chi Squared critical value to the right

      Therefore the right critical value is XR2X_{R} ^{2} =17.535.
      Now, the area to the right of the left critical value is equal to the confidence level plus half alfa since it includes both the confidence level area and the right tail.
      Area to the right of the left critical value:

      1 α+- \alpha + α2\large \frac{\alpha}{2} = 0.95 + 0.025 = 0.975
      Equation 3: Area to the right of the left critical value

      We use this to obtain XL2X_{L} ^{2} :
      Chi - Squared confidence intervals
      Figure 9: The Chi Squared critical value to the left

      Therefore the right critical value is XL2X_{L} ^{2} = 2.180 .

      b. \quad 99% confidence level

      Remember we have 8 degrees of freedom, and in this case, a confidence level of 99% percent which means 1α=0.991 - \alpha = 0.99, therefore the significance level is α=10.99=0.01 \alpha = 1 - 0.99 = 0.01.

      This significance level is spread in two equal parts in two tails: XR2X_{R} ^{2} and XL2X_{L} ^{2}, and so we express that information in a chi squared distribution graph as follows:

      Chi - Squared confidence intervals
      Figure 10: The Chi Squared Distribution for this case

      Using the information about the 8 degrees of freedom and the area value of half significance level we use the chi squared table to find the value of both critical values XR2X_{R} ^{2} and XL2X_{L} ^{2}.
      We start with XR2X_{R} ^{2} since the area to its right is simply α2\large \frac{\alpha}{2}:

      Chi - Squared confidence intervals
      Figure 11: The Chi Squared critical value to the right

      Therefore the right critical value is XR2X_{R} ^{2} = 21.955 .
      Now, the area to the right of the left critical value is equal to the confidence level plus half alfa since it includes both the confidence level area and the right tail.
      Area to the right of the left critical value:

      1α+1 - \alpha + α2\large \frac{\alpha}{2} =0.99+0.005=0.995= 0.99+0.005=0.995
      Equation 3: Area to the right of the left critical value

      We use this to obtain XL2X_{L} ^{2}:

      Chi - Squared confidence intervals
      Figure 12: The Chi Squared critical value to the left

      Therefore the right critical value is XL2X_{L} ^{2} = 1.344 .

      Example 3:

      Determining the Confidence Interval for Variance
      Road and racing bicycles have an average wheel diameter of 622mm. From a sample of 15 bicycles it was found that the wheel diameters have a variance of 10mm. With a 90% confidence level give a range where the variance of all road and racing bicycle wheels lie.

      Let us gather all of the information of the problem:

      n\quad n = sample size = 15 bicycles
      s\quad s = sample standard deviation thus s2s^{2} = sample variance = 10 mm
      σ\quad \sigma = population standard deviation thus σ2\sigma^{2} = population variance
      (n1)\quad (n1) = degrees of freedom = 15 - 1 = 14 degrees of freedom.

      Remember that the confidence interval of the variance is given by:

      (n1)s2XR2<  σ2  (n1)s2XL2 \large \frac{(n-1)s^{2}}{X_{R} ^{2}} < \; \sigma^{2} \; \frac{(n-1)s^{2}}{X_{L} ^{2}}
      Equation 4: Formula for the confidence interval of the variance

      And so we need to find the critical values XR2X_{R} ^{2} and XL2X_{L} ^{2}.
      For that we use the 14 degrees of freedom and the confidence level of 90% given.
      Since the confidence level is equal to 1 - α \alpha = 0.90, we know that the significance level is α \alpha = 0.1 then, and α2\large \frac{\alpha}{2} = 0.05.

      We start with XR2X_{R} ^{2} since the area to its right is simply α2\large \frac{\alpha}{2} :

      Chi - Squared confidence intervals
      Figure 13: The Chi Squared critical value to the right

      Therefore the right critical value is XR2X_{R} ^{2} = 23.685 .
      Now, the area to the right of the left critical value is equal to the confidence level plus half alfa since it includes both the confidence level area and the right tail.
      Area to the right of the left critical value:

      1 -α\alpha + α2\large \frac{\alpha}{2} = 0.90 + 0.05 = 0.95
      Equation 5: Area to the right of the left critical value

      We use this to obtain XL2X_{L} ^{2}:

      Chi - Squared confidence intervals
      Figure 14: The Chi Squared critical value to the left

      Therefore the right critical value is XL2X_{L} ^{2} = 6.571 .
      And so finally we have everything we need to calculate the confidence interval for the variance:

      (n1)s2XR2<  σ2  <(n1)s2XL2 \large \frac{(n-1)s^{2}}{X_{R} ^{2}} < \; \sigma^{2} \; < \frac{(n-1)s^{2}}{X_{L} ^{2}}

      Where: (n1)s2XR2=(14)(10)23.685 \large \frac{(n-1)s^{2}}{X_{R} ^{2}} = \frac{(14)(10)}{23.685} \approx 5.9115.911

      And: (n1)s2XL2=(14)(10)6.571 \large \frac{(n-1)s^{2}}{X_{L} ^{2}} = \frac{(14)(10)}{6.571} \approx 21.30621.306

      Therefore:

      5.911<  σ2  <21.306 5.911 < \; \sigma^{2} \; < 21.306
      Equation 6: Confidence interval of the variance


      Example 4:

      Determining the Confidence Interval for Standard Deviation
      A Soda-pop company "Jim's Old Fashion Soda" is designing their bottling machine. After making 41 bottles they find that their bottles have an average of 335mL of liquid with a standard deviation of 3mL. With a 99% confidence level what is the range of standard deviation that this machine will output per bottle?

      Let us gather all of the information of the problem:
      n\quad n = sample size = 41 bottles
      x\quad \overline{x} = sample mean = 335 mL
      s\quad s = sample standard deviation = 3 mL
      s2\quad s^{2} = sample variance = 9 mL2
      σ\quad \sigma = population standard deviation = thus σ2\sigma^{2} = population variance
      (n1)\quad (n-1) = degrees of freedom = 41 - 1 = 40 degrees of freedom.

      With all of this we are asked to find the confidence interval of the standard deviation of the population, but we only have the formula for the confidence interval of the variance which is given by:

      (n1)s2XR2<  σ2  <(n1)s2XL2 \large \frac{(n-1)s^{2}}{X_{R} ^{2}} < \; \sigma^{2} \; < \frac{(n-1)s^{2}}{X_{L} ^{2}}
      Equation 7: Formula for the confidence interval of the variance

      In order to obtain the formula needed, we use the fact that the population standard deviation is the square root of the variance, and so we have that:

      (n1)s2XR2<σ2<(n1)s2XL2  \large \sqrt{\frac{(n-1)s^{2}}{X_{R}^{2}}} \,< \, \sqrt{\sigma^{2}} \,< \, \sqrt{\frac{(n-1)s^{2}}{X_{L}^{2}}} \;   (n1)    sXR  <  σ  <(n1)    sXL \; \frac{\sqrt{(n-1) \; \cdot \; s }}{X_{R}} \; < \; \sigma \; < \frac{\sqrt{(n-1) \; \cdot \; s }}{X_{L}}
      Equation 8: Deriving the confidence interval of the population standard deviation from the formula using variance

      Where we have the degrees of freedom and the sample standard deviation, so we only need to find the critical values XR2X_{R} ^{2} and XL2X_{L} ^{2}.

      In this case we have 40 degrees of freedom and a confidence level of 99%, which is 1 - α\alpha = 0.99 and so the significance level is α\alpha = 0.01 and α2\large \frac{\alpha}{2} = 0.005
      We start with XR2X_{R} ^{2} since the area to its right is simply α2\large \frac{\alpha}{2} :

      Chi - Squared confidence intervals
      Figure 15: The Chi Squared critical value to the right

      Therefore the right critical value is XR2X_{R} ^{2} = 66.766 .
      For the left critical value we add the area for the confidence level plus half the significance level in the following manner:

      1 - α\alpha + α2\large \frac{\alpha}{2} = 0.99 + 0.005 = 0.995
      Equation 9: Area to the right of the left critical value

      We use this to obtain XL2X_{L} ^{2}:

      Chi - Squared confidence intervals
      Figure 16: The Chi Squared critical value to the left

      Therefore the right critical value is XL2X_{L} ^{2} = 20.707.
      And so finally we have everything we need to calculate the confidence interval for the standard deviation:

      (n1)    sXR<  σ  <(n1)    sXL \large \frac{(n-1) \; \cdot \; s}{X_{R}} < \; \sigma \; < \frac{(n-1) \; \cdot \; s }{X_{L} }

      Where: (n1)    sXR=40(3)60.766 \large \frac{(n-1) \; \cdot \; s}{X_{R}} = \frac{\sqrt{40(3)}}{\sqrt{60.766}} \approx 2.3222.322

      Where: (n1)    sXL=40(3)20.707 \large \frac{(n-1) \; \cdot \; s}{X_{L}} = \frac{\sqrt{40(3)}}{\sqrt{20.707}} \approx 4.1694.169

      Therefore:

      2.322<  σ  <4.169 2.322 < \; \sigma \; < 4.169
      Equation 10: Confidence interval of the population standard deviation

      And so, here ends our lesson but this is not the last time we talk about chi square. The question that comes to mind next is: Is chi square a test statistic? The answer lies in our hypothesis testing section of this course, where you will learn how you can perform a chi squared test.

      Before you go, we would like to recommend the following chi square critical value calculator, so you can have a fast way to check your answers when using the methodology we have shown in this lesson.
      To estimate a population variance a Chi-Squared distribution is used,
      • Chi-Squared: X2=(n1)s2σ2X^2=\frac{(n-1)s^2}{\sigma ^2}
      nn: sample size
      ss: sample standard deviation
      σ\sigma: population standard deviation
      (n1)(n-1): is also called "degrees of freedom"
      • Chi-Square table gives critical value area to the right

      The Confidence interval for the variance is given by:
      (n1)s2XR2\frac{(n-1)s^2}{X_R^2} < σ2\sigma ^2 < (n1)s2XL2\frac{(n-1)s^2}{X_L^2}