Next: Example 1: Confidence Interval Up: Parameter estimation Previous: Sampling errors Contents

Confidence intervals

An error bar is a simple example of what statisticians refer to as an interval estimate. Instead of estimating a point value for a parameter, the sample data is used to estimate a range of estimates that are likely to cover the true population parameter with a prespecified probability known as the confidence level.

**Figure:** Sampling density function of statistic showing the and lower and upper confidence limits.

A $100(1-\alpha)\%$ confidence interval (C.I.) contains the true value of the population parameter $\theta$ with probability $1-\alpha$ (the confidence level). The interval is defined by lower and upper confidence limits and , which are functions of the data. In other words, if C.I.s were calculated for many different samples drawn from the full population then a $(1-\alpha)$ fraction of the C.I.s would cover the true population value. These intervals are shown schematically in Fig. 5.1. To be precise, if $F_T(t \mid \theta) = \Pr(T(X) \leq t)$ is the sampling cumulative distribution function that depends on $\theta$ then $\Pr\{F_T^{-1}(\alpha/2 \mid \theta) \leq T(X) \leq F_T^{-1}(1-\alpha/2 \mid \theta)\} = 1 - \alpha$ and the two inequalities can be rearranged to give $\Pr(t_L \leq \theta \leq t_U) = 1 - \alpha$ for some and . In classical (but not Bayesian) statistics, the true population parameter is considered to be a fixed constant and not a random variable, hence it is the C.I.s that randomly overlap the population parameter rather than the population parameter that falls randomly in the C.I.

Statisticians most often quote 95% confidence intervals, which should cover the true value in all but 5% of repeated samples. For normally distributed sample statistics, the 95% confidence interval is about twice as wide as the $\pm 1$ error bar used by physicists (see example below). The $\pm 1$ error bar corresponds to the 68.3% confidence interval for normally distributed sample statistics. In addition to its more precise probabilistic definition, another advantage of the C.I. over the error bar is that it is easily extended to skewed statistics such as sample variance.

Subsections

Next: Example 1: Confidence Interval Up: Parameter estimation Previous: Sampling errors Contents

David Stephenson 2005-09-30