23  Central Limit Theorem

Example 23.1 In a previous example, we consider a discrete population with a population mean of \(\mu = 72\) and a population standard deviation of \(\sigma = 69.4\). Now we’ll consider several other populations with a population mean of 72 and a population standard deviation of 69.4. We’ll use an applet to simulate the sampling distribution of the sample mean \(\bar{X}\) for samples of size \(n\) from several different populations. Enter 72 in the box for population mean, and 69.4 in the box for population SD.

  1. Regardless of the shape of the population, what will \(\textrm{E}(\bar{X})\) be? How we do interpret this?




  2. Regardless of the shape of the population, what will \(\textrm{SD}(\bar{X})\) be? How do we interpret this?




  3. Choose a super-small sample size, like \(n=2\). Choose a population shape, and then run the simulation to generate many values of the sample mean. Then repeat for different population shapes. When the sample size is small, does the shape of the sample-to-sample distribution of sample means (plot on right) depend on the shape of the population (plot on the left)?




  4. Now increase the sample size, and repeat for the several populations. What happens to the shape of the sample-to-sample distribution of sample means as the sample size increases?




  5. Now choose a not-super-small sample size, like \(n=500\). Choose a population, and simulate a single sample. Look at the simulated sample (the middle plot), then repeat to simulate a few samples. Then repeat for the different populations. Does the distribution of individual values within the sample depend on the shape of the population?




  6. When the sample size is large, does the sample-to-sample distribution of sample means depend on the shape of the population?




Example 23.2 Recall the population where every individual has an income of 10, 70, or 200 with probability 0.4, 0.4, 0.2. Now suppose that every individual has an income of 10, 70, 200, or 2000 with probability 0.4, 0.4, 0.19, 0.01. Simulate sample means for many samples of size \(n=30\) from this population. Is the distribution of sample means approximately Normal? What if \(n=100\)?