next up previous contents
Next: Example 1: Confidence Interval Up: Parameter estimation Previous: Sampling errors   Contents

Confidence intervals

An error bar is a simple example of what statisticians refer to as an interval estimate. Instead of estimating a point value for a parameter, the sample data is used to estimate a range of estimates that are likely to cover the true population parameter with a prespecified probability known as the confidence level.

Figure: Sampling density function of statistic $ t$ showing the $ t_L$ and $ t_U$ lower and upper confidence limits.

A $ 100(1-\alpha)\%$ confidence interval (C.I.) contains the true value of the population parameter $ \theta$ with probability $ 1-\alpha$ (the confidence level). The interval $ [t_L, t_U]$ is defined by lower and upper confidence limits $ t_L$ and $ t_U$, which are functions of the data. In other words, if C.I.s were calculated for many different samples drawn from the full population then a $ (1-\alpha)$ fraction of the C.I.s would cover the true population value. These intervals are shown schematically in Fig. 5.1. To be precise, if $ F_T(t \mid \theta) = \Pr(T(X) \leq t)$ is the sampling cumulative distribution function that depends on $ \theta$ then $ \Pr\{F_T^{-1}(\alpha/2 \mid \theta) \leq T(X) \leq
F_T^{-1}(1-\alpha/2 \mid \theta)\} = 1 - \alpha$ and the two inequalities can be rearranged to give $ \Pr(t_L \leq \theta \leq t_U)
= 1 - \alpha$ for some $ t_L$ and $ t_U$. In classical (but not Bayesian) statistics, the true population parameter is considered to be a fixed constant and not a random variable, hence it is the C.I.s that randomly overlap the population parameter rather than the population parameter that falls randomly in the C.I.

Statisticians most often quote 95% confidence intervals, which should cover the true value in all but 5% of repeated samples. For normally distributed sample statistics, the 95% confidence interval is about twice as wide as the $ \pm 1$ error bar used by physicists (see example below). The $ \pm 1$ error bar corresponds to the 68.3% confidence interval for normally distributed sample statistics. In addition to its more precise probabilistic definition, another advantage of the C.I. over the error bar is that it is easily extended to skewed statistics such as sample variance.



Subsections
next up previous contents
Next: Example 1: Confidence Interval Up: Parameter estimation Previous: Sampling errors   Contents
David Stephenson 2005-09-30