What is the sampling distribution when population is not normally distributed?

Before illustrating the use of the Central Limit Theorem (CLT) we will first illustrate the result. In order for the result of the CLT to hold, the sample must be sufficiently large (n > 30). Again, there are two exceptions to this. If the population is normal, then the result holds for samples of any size (i..e, the sampling distribution of the sample means will be approximately normal even for samples of size less than 30).

Central Limit Theorem with a Normal Population

The figure below illustrates a normally distributed characteristic, X, in a population in which the population mean is 75 with a standard deviation of 8.

What is the sampling distribution when population is not normally distributed?

If we take simple random samples (with replacement)

What is the sampling distribution when population is not normally distributed?
of size n=10 from the population and compute the mean for each of the samples, the distribution of sample means should be approximately normal according to the Central Limit Theorem. Note that the sample size (n=10) is less than 30, but the source population is normally distributed, so this is not a problem. The distribution of the sample means is illustrated below. Note that the horizontal axis is different from the previous illustration, and that the range is narrower.

What is the sampling distribution when population is not normally distributed?

The mean of the sample means is 75 and the standard deviation of the sample means is 2.5, with the standard deviation of the sample means computed as follows:

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

If we were to take samples of n=5 instead of n=10, we would get a similar distribution, but the variation among the sample means would be larger. In fact, when we did this we got a sample mean = 75 and a sample standard deviation = 3.6.

Central Limit Theorem with a Dichotomous Outcome

Now suppose we measure a characteristic, X, in a population and that this characteristic is dichotomous (e.g., success of a medical procedure: yes or no) with 30% of the population classified as a success (i.e., p=0.30) as shown below.

What is the sampling distribution when population is not normally distributed?

The Central Limit Theorem applies even to binomial populations like this provided that the minimum of np and n(1-p) is at least 5, where "n" refers to the sample size, and "p" is the probability of "success" on any given trial. In this case, we will take samples of n=20 with replacement, so min(np, n(1-p)) = min(20(0.3), 20(0.7)) = min(6, 14) = 6. Therefore, the criterion is met.

We saw previously that the population mean and standard deviation for a binomial distribution are:

Mean binomial probability:

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

Standard deviation:

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

The distribution of sample means based on samples of size n=20 is shown below.

What is the sampling distribution when population is not normally distributed?

The mean of the sample means is

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

and the standard deviation of the sample means is:

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

Now, instead of taking samples of n=20, suppose we take simple random samples (with replacement) of size n=10. Note that in this scenario we do not meet the sample size requirement for the Central Limit Theorem (i.e., min(np, n(1-p)) = min(10(0.3), 10(0.7)) = min(3, 7) = 3).The distribution of sample means based on samples of size n=10 is shown on the right, and you can see that it is not quite normally distributed. The sample size must be larger in order for the distribution to approach normality.

Central Limit Theorem with a Skewed Distribution

The Poisson distribution is another probability model that is useful for modeling discrete variables such as the number of events occurring during a given time interval. For example, suppose you typically receive about 4 spam emails per day, but the number varies from day to day. Today you happened to receive 5 spam emails. What is the probability of that happening, given that the typical rate is 4 per day? The Poisson probability is:

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

Mean = μ

Standard deviation =

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

The mean for the distribution is μ (the average or typical rate), "X" is the actual number of events that occur ("successes"), and "e" is the constant approximately equal to 2.71828. So, in the example above

What is the sampling distribution when population is not normally distributed?
What is the sampling distribution when population is not normally distributed?

Now let's consider another Poisson distribution. with μ=3 and σ=1.73. The distribution is shown in the figure below.

Can a sampling distribution be found for a population that isn't normal?

The central limit theorem (CLT) is a theorem that gives us a way to turn a non-normal distribution into a normal distribution. It tells us that, even if a population distribution is non-normal, its sampling distribution of the sample mean will be normal for a large number of samples (at least 3 0 30 30).

What happens if data is not normally distributed?

Non-normal distributions may lack symmetry, may have extreme values, or may have a flatter or steeper “dome” than a typical bell. There is nothing inherently wrong with non-normal data; some traits simply do not follow a bell curve. For example, data about coffee and alcohol consumption are rarely bell shaped.

What are the 3 types of sampling distributions?

There are three standard types of sampling distributions in statistics:.
Sampling distribution of mean. The most common type of sampling distribution is the mean. ... .
Sampling distribution of proportion. This sampling distribution focuses on proportions in a population. ... .
T-distribution..

What makes a sampling distribution not normal?

If the population is skewed and sample size small, then the sample mean won't be normal.