{"id":64,"date":"2021-01-12T22:19:37","date_gmt":"2021-01-12T22:19:37","guid":{"rendered":"https:\/\/textbooks.jaykesler.net\/introstats\/chapter\/using-the-central-limit-theorem\/"},"modified":"2024-03-15T13:16:44","modified_gmt":"2024-03-15T13:16:44","slug":"using-the-central-limit-theorem","status":"publish","type":"chapter","link":"https:\/\/textbooks.jaykesler.net\/introstats\/chapter\/using-the-central-limit-theorem\/","title":{"rendered":"Using the Central Limit Theorem"},"content":{"raw":"<span style=\"display: none;\">\r\n[latexpage]\r\n<\/span>\r\n<div id=\"45c3f6e5-1eab-41c3-91a9-e0c42a78c970\" class=\"chapter-content-module\" data-type=\"page\" data-cnxml-to-html-ver=\"2.1.0\">\r\n<p id=\"delete_me\">Before we begin this chapter, it is important for you to understand when to use the <span id=\"term136\" data-type=\"term\">central limit theorem (CLT)<\/span>. If you are being asked to find the probability of the mean, use the CLT for the mean. If you are being asked to find the probability of a sum or total, use the CLT for sums. This also applies to percentiles for means and sums.<\/p>\r\n\r\n<div id=\"id11030776\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"1\" class=\"os-title-label\" data-type=\"\">NOTE<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"fs-idm90223456\">If you are being asked to find the probability of an <strong>individual<\/strong> value, do <strong>not<\/strong> use the Central Limit Theorem. <strong>Use the distribution of its random variable.<\/strong><\/p>\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<section id=\"element-733\" data-depth=\"1\">\r\n<h3 data-type=\"title\">Examples of the Central Limit Theorem<\/h3>\r\n<section id=\"fs-idp5962992\" data-depth=\"2\">\r\n<h4 data-type=\"title\">Law of Large Numbers<\/h4>\r\nThe <span id=\"term137\" data-type=\"term\">law of large numbers<\/span> says that if you take samples of larger and larger size from any population, then the mean $\\bar x$ of the sample tends to get closer and closer to <em data-effect=\"italics\">\u03bc<\/em>. From the central limit theorem, we know that as <em data-effect=\"italics\">n<\/em> gets larger and larger, the sample means follow a normal distribution. The larger <em data-effect=\"italics\">n<\/em> gets, the smaller the standard deviation gets. (Remember that the standard deviation for $\\bar X$ is $\\frac{\\sigma}{\\sqrt{n}}$. ) This means that the sample mean $\\bar x$ must be close to the population mean <em data-effect=\"italics\">\u03bc<\/em>. We can say that <em data-effect=\"italics\">\u03bc<\/em> is the value that the sample means approach as <em data-effect=\"italics\">n<\/em> gets larger. The central limit theorem illustrates the law of large numbers.\r\n\r\n<\/section><section id=\"fs-idm1089056\" data-depth=\"2\">\r\n<h4 data-type=\"title\">Central Limit Theorem for the Mean and Sum Examples<\/h4>\r\n<div id=\"element-482\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.8<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<p id=\"element-361\">A study involving stress is conducted among the students on a college campus. <strong>The stress scores follow a <span id=\"term138\" data-type=\"term\">uniform distribution<\/span><\/strong> with the lowest stress score equal to one and the highest equal to five. Using a sample of 75 students, find:<\/p>\r\n\r\n<ol id=\"fs-idp18838512\" type=\"a\">\r\n \t<li>The probability that the <strong>mean stress score<\/strong> for the 75 students is less than two.<\/li>\r\n \t<li>The 90<sup>th<\/sup> percentile for the <strong>mean stress score<\/strong> for the 75 students.<\/li>\r\n \t<li>The probability that the <strong>total of the 75 stress scores<\/strong> is less than 200.<\/li>\r\n \t<li>The 90<sup>th<\/sup> percentile for the <strong>total stress score<\/strong> for the 75 students.<\/li>\r\n<\/ol>\r\n<p id=\"element-294\">Let <em data-effect=\"italics\">X<\/em> = one stress score.<\/p>\r\n<p id=\"element-632\">Problems a and b ask you to find a probability or a percentile for a <span id=\"term139\" data-type=\"term\">mean<\/span>. Problems c and d ask you to find a probability or a percentile for a <strong>total or sum<\/strong>. The sample size, <em data-effect=\"italics\">n<\/em>, is equal to 75.<\/p>\r\n<p id=\"element-494\">Since the individual stress scores follow a uniform distribution, <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">U<\/em>(1, 5) where <em data-effect=\"italics\">a<\/em> = 1 and <em data-effect=\"italics\">b<\/em> = 5 (In other words, the individual stress scores are <strong>uniformly distributed<\/strong> between 1 and 5. See <a href=\"\/introstats\/part\/continuous-random-variables\/\" target=\"_blank\" rel=\"noopener\">Continuous Random Variables<\/a> for an explanation on the uniform distribution).<\/p>\r\n$\\mu_x = \\frac{a+b}{2}=\\frac{1+5}{2}=3$\r\n\r\n$\\sigma_x = \\sqrt{\\frac{(b-a)^2}{12}}=\\sqrt{\\frac{(5-1)^2}{12}}=1.15$\r\n\r\nFor problems a. and b., let $\\bar X =$ the mean stress score for the 75 students. Then,\r\n$\\bar X \\sim N\\left(3, \\frac{1.15}{\\sqrt{75}} \\right)$\r\n\r\n<\/div>\r\n<div><\/div>\r\nFind $P(\\bar x &lt;2)$. Draw the graph.\r\n<div id=\"element-197\" class=\" unnumbered\" data-type=\"exercise\"><section>\r\n<div id=\"id10355911\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.8<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol type=\"a\">\r\n \t<li style=\"list-style-type: none;\">\r\n<ol type=\"a\">\r\n \t<li id=\"element-146\">Find $P(\\bar x &lt;2)$.\r\nDraw the graph.\r\nThe probability that the mean stress score is less than two is about zero.<span id=\"id10559440\" data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\"><\/span><span id=\"id10559440\" data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\"><img id=\"2\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/b5dad0da9c024b3da9b9fef4478aa2a82388f089.jpg\" alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span>\r\n<div id=\"fs-idm147247216\" class=\"os-figure\">\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.4<\/span><\/div>\r\n<\/div>\r\n$P(\\bar x &lt;2)$=\r\n<p id=\"fs-idm29706432\"><code id=\"3\">=NORM.DIST(2,3,1.15\/SQRT(75),TRUE)-NORM.DIST(1,3,1.15\/SQRT(75),TRUE)\r\n<\/code>\r\n=0\r\n<div id=\"note-1\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\" data-element-type=\"Reminder\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"4\" class=\"os-title-label\" data-type=\"\">Reminder<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"fs-idm90269648\">The smallest stress score is one.<\/p>\r\n\r\n<\/div>\r\n<\/section><\/div><\/li>\r\n \t<li>Find the 90<sup>th<\/sup> percentile for the mean of 75 stress scores. Draw a graph.\r\n<span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"><span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"><img id=\"5\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/823d909d6d3687a2742a9bcaef0bd4f912e29b88.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\" width=\"300\" data-media-type=\"image\/jpg\" data-print-width=\"3in\" \/><\/span><\/span><span id=\"id11181476\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"> <\/span>\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.5<\/span><\/div>\r\n<p id=\"element-179\">Let <em data-effect=\"italics\">k<\/em> = the 90<sup>th<\/sup> percentile.\r\nFind <em data-effect=\"italics\">k<\/em>, where $P(\\bar x &lt; k)=0.9$<\/p>\r\nThe 90<sup>th<\/sup> percentile for the mean of 75 scores is about 3.2. This tells us that 90% of all the means of 75 stress scores are at most 3.2, and that 10% are at least 3.2.\r\n\r\n<code>=NORM.INV(0.9, 3, 1.15\/SQRT(75))<\/code>\r\n\r\n$k=3.2$<\/li>\r\n \t<li>For problems c and d, let <em data-effect=\"italics\">\u03a3X<\/em> = the sum of the 75 stress scores. Then, $\\sum X \\sim N\\left( 75\\cdot 3, \\sqrt{75}\\cdot 1.15 \\right)$Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &lt; 200). Draw the graph.\r\n<span data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\"><img id=\"7\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/3bcb7bce3ce73c09aa0a4ff2c4204407674c6e55.jpg\" alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><span id=\"id10985583\" data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\"> <\/span>\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\"><span class=\"os-number\">6.6\r\n<\/span><\/span><\/div>\r\n<p id=\"fs-idp52789136\">The mean of the sum of 75 stress scores is (75)(3) = 225<\/p>\r\nThe standard deviation of the sum of 75 stress scores is $\\sqrt{75}\\cdot 1.15 = 9.96$\r\n\r\n<code>=NORM.DIST(200, 75*3, SQRT(75)*1.15, TRUE)-NORM.DIST(75, 75*3, SQRT(75)*1.15)\r\n<\/code>\r\n<div id=\"element-888\" class=\" unnumbered\" data-type=\"exercise\"><section>\r\n<div id=\"id11181711\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\"><section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<div class=\"os-solution-container\">\r\n<div id=\"note-2\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\" data-element-type=\"Reminder\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"9\" class=\"os-title-label\" data-type=\"\">Reminder<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"fs-idm105410608\">The smallest total of 75 stress scores is 75, because the smallest single score is one.<\/p>\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n$P(\\sum x &lt; 200) = 0$<\/li>\r\n \t<li>Find the 90<sup>th<\/sup> percentile for the total of 75 stress scores. Draw a graph.\r\n<p id=\"element-308\">Let <em data-effect=\"italics\">k<\/em> = the 90<sup>th<\/sup> percentile.<\/p>\r\nFind <em data-effect=\"italics\">k<\/em> where <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &lt; <em data-effect=\"italics\">k<\/em>) = 0.90.<span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\"><img id=\"10\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/34f66e086d52d6f7bcd9843020078ccfabe3d92c.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><span id=\"id10985827\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\"> <\/span>\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.7<\/span><\/div>\r\n<p id=\"element-780\">=NORM.INV(0.9, 75*3, SQRT(75)*1.15)<\/p>\r\n$k = 237.8$<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/section><\/section><\/div>\r\n<div id=\"element-681\" class=\" unnumbered\" data-type=\"exercise\"><section>\r\n<div id=\"id10985728\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\"><\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm56169616\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.8<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm127013136\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm53684704\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm54565472\">Use the information in <a class=\"autogenerated-content\" href=\"#element-482\">Example 6.8<\/a>, but use a sample size of 55 to answer the following questions.<\/p>\r\n\r\n<ol id=\"fs-idm68188624\" type=\"a\">\r\n \t<li>Find <em>$P(\\bar x&lt;7)$.<\/em><\/li>\r\n \t<li>Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &gt; 170).<\/li>\r\n \t<li>Find the 80<sup>th<\/sup> percentile for the mean of 55 scores.<\/li>\r\n \t<li>Find the 85<sup>th<\/sup> percentile for the sum of 55 scores.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"example2\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.9<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<p id=\"eip-id1171180958925\">Suppose that a market research analyst for a cell phone company conducts a study of their customers who exceed the time allowance included on their basic cell phone contract; the analyst finds that for those people who exceed the time included in their basic contract, the <strong>excess time used<\/strong> follows an <span id=\"term140\" data-type=\"term\">exponential distribution<\/span> with a mean of 22 minutes. (Note: you do not need to know anything about the exponential distribution to solve this problem, but if you are curious, you can read about it at <a href=\"https:\/\/en.wikipedia.org\/wiki\/Exponential_distribution\" target=\"_blank\" rel=\"noopener noreferrer\">Wikipedia<\/a>).<\/p>\r\n<p id=\"eip-id1171181431405\">Consider a random sample of 80 customers who exceed the time allowance included in their basic cell phone contract.<\/p>\r\n<p id=\"eip-id1171181123857\">Let <em data-effect=\"italics\">X<\/em> = the excess time used by one INDIVIDUAL cell phone customer who exceeds his contracted time allowance.<\/p>\r\n<p id=\"eip-id1171182604485\"><em data-effect=\"italics\">$X\\sim Exp\\left( \\frac{1}{22} \\right)$. <\/em>From previous chapters, we know that <em data-effect=\"italics\">\u03bc<\/em> = 22 and <em data-effect=\"italics\">\u03c3<\/em> = 22.<\/p>\r\nLet $\\bar X=$ the mean excess time used by a sample of <em data-effect=\"italics\">n<\/em> = 80 customers who exceed their contracted time allowance.\r\n\r\n$\\bar X \\sim N\\left( 22, \\frac{22}{\\sqrt{80}} \\right)$ by the central limit theorem for sample means\r\n<div id=\"eip-id8136792\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"eip-id8760885\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<div id=\"eip-id1171180795181\" data-type=\"list\">\r\n<div id=\"13\" data-type=\"title\"><strong>Using the CLT to find probability<\/strong><\/div>\r\n<ol type=\"a\">\r\n \t<li>Find the probability that the mean excess time used by the 80 customers in the sample is longer than 20 minutes. This is asking us to find $P(\\bar x &gt; 20)$<em data-effect=\"italics\">.<\/em> Draw the graph.<\/li>\r\n \t<li>Suppose that one customer who exceeds the time limit for his cell phone contract is randomly selected. Find the probability that this individual customer's excess time is longer than 20 minutes. This is asking us to find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">x<\/em> &gt; 20).<\/li>\r\n \t<li>Explain why the probabilities in parts a and b are different.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1171181279568\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.9<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<p id=\"fs-idm154380928\">Find: $P(\\bar x&gt;20)<\/p>\r\n$P(\\bar x &gt; 20) = 0.7919$ using\r\n<code>=1-NORM.DIST(20,22, 22\/SQRT(80), TRUE)<\/code>\r\n\r\nThe probability is 0.7919 that the mean excess time used is more than 20 minutes, for a sample of 80 customers who exceed their contracted time allowance.\r\n<div id=\"fs-idm52706528\" class=\"os-figure\">\r\n<figure data-id=\"fs-idm52706528\"><span id=\"id1164895099196\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, 20, is labeled to the left of 22. A vertical line extends from 20 to the curve. The area under the curve to the right of k is shaded. The shaded area shows that P(x-bar &gt; 20).\">\r\n<img id=\"15\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/832dc3bab2cd88641ded60380533d6e556f97ba7.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, 20, is labeled to the left of 22. A vertical line extends from 20 to the curve. The area under the curve to the right of k is shaded. The shaded area shows that P(x-bar &gt; 20).\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/figure>\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.8<\/span><\/div>\r\n<\/div>\r\n<p id=\"element-260\">Find<em data-effect=\"italics\">P<\/em>(x &gt; 20). Remember to use the exponential distribution for an <strong>individual:<\/strong> $X\\sim Exp\\left( \\frac{1}{22} \\right)$<\/p>\r\n$P(x&gt;20) = e^{-\\frac{1}{22}\\cdot 20} = 0.4029$\r\n<ol>\r\n \t<li style=\"list-style-type: none;\">\r\n<ol id=\"eip-id1171181135515\" type=\"1\">\r\n \t<li><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">x<\/em> &gt; 20) = 0.4029 but <em data-effect=\"italics\">$P(\\bar x &gt; 20)=0.7917$\r\n<\/em><\/li>\r\n \t<li>The probabilities are not equal because we use different distributions to calculate the probability for individuals and for means.<\/li>\r\n \t<li>When asked to find the probability of an individual value, use the stated distribution of its random variable; do not use the . Use the CLT with the normal distribution when you are being asked to find the probability for a mean.<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"eip-id1171180910907\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"eip-id1171181151372\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<h4 data-type=\"title\"><strong>Using the CLT to find percentiles<\/strong><\/h4>\r\nFind the 95<sup>th<\/sup> percentile for the <strong>sample mean excess time<\/strong> for samples of 80 customers who exceed their basic contract time allowances. Draw a graph.\r\n\r\n<\/div>\r\n<\/div>\r\n<div id=\"eip-id1171181126592\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.9<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<p id=\"eip-id1171181699771\">Let <em data-effect=\"italics\">k<\/em> = the 95<sup>th<\/sup> percentile. Find <em data-effect=\"italics\">k<\/em> where$P(\\bar x &lt;k)=0.95$<\/p>\r\n<p id=\"eip-id1171180867272\" class=\"finger\"><em data-effect=\"italics\">k<\/em> = 26.0 using<\/p>\r\n<p class=\"finger\"><code>=NORM.INV(0.95, 22, 22\/SQRT(80))<\/code><code id=\"18\"><\/code><\/p>\r\n\r\n<div id=\"fs-idm61807792\" class=\"os-figure\">\r\n<figure data-id=\"fs-idm61807792\"><span id=\"id1164881078203\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, k, is labeled to the right of 22. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.95.\">\r\n<img id=\"19\" src=\"https:\/\/openstax.org\/resources\/2bf126a04004910fb6ccb9dbfa68769e790e1a60\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, k, is labeled to the right of 22. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.95.\" width=\"380\" data-media-type=\"image\/jpg\" \/>\r\n<\/span><\/figure>\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.9<\/span><\/div>\r\n<\/div>\r\n<p id=\"element-987\">The 95<sup>th<\/sup> percentile for the <strong>sample mean excess time used<\/strong> is about 26.0 minutes for random samples of 80 customers who exceed their contractual allowed time.<\/p>\r\n<p id=\"eip-id1171181013130\">Ninety five percent of such samples would have means under 26 minutes; only five percent of such samples would have means above 26 minutes.<\/p>\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm50063488\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.9<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm113638352\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm60469088\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"eip-823\">Use the information in <a class=\"autogenerated-content\" href=\"#example2\">Example 6.9<\/a>, but change the sample size to 144.<\/p>\r\n\r\n<ol id=\"fs-idm72056816\" type=\"a\">\r\n \t<li>Find <em data-effect=\"italics\">$P(20&lt;\\bar x &lt; 30)$.\r\n<\/em><\/li>\r\n \t<li>Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> is at least 3,000).<\/li>\r\n \t<li>Find the 75<sup>th<\/sup> percentile for the sample mean excess time of 144 customers.<\/li>\r\n \t<li>Find the 85<sup>th<\/sup> percentile for the sum of 144 excess times used by customers.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm153949776\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.10<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<p id=\"fs-idp5363584\">In the United States, someone is sexually assaulted every two minutes, on average, according to a number of studies. Suppose the standard deviation is 0.5 minutes and the sample size is 100.<\/p>\r\n\r\n<div id=\"fs-idm67039584\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm9526400\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<ol id=\"fs-idm184397680\" type=\"a\">\r\n \t<li>Find the median, the first quartile, and the third quartile for the sample mean time of sexual assaults in the United States.<\/li>\r\n \t<li>Find the median, the first quartile, and the third quartile for the sum of sample times of sexual assaults in the United States.<\/li>\r\n \t<li>Find the probability that a sexual assault occurs on the average between 1.75 and 1.85 minutes.<\/li>\r\n \t<li>Find the value that is two standard deviations above the sample mean.<\/li>\r\n \t<li>Find the <em data-effect=\"italics\">IQR<\/em> for the sum of the sample times.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm110029216\" class=\"finger\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.10<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol id=\"fs-idm113256736\" type=\"a\">\r\n \t<li>We have, <em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<\/em> = 2 and <em data-effect=\"italics\">\u03c3<sub>x<\/sub><\/em> =$\\frac{\\sigma}{\\sqrt{n}}=\\frac{0.5}{10}=0.05$ Therefore:\r\n<ol id=\"fs-idm103882928\" type=\"1\">\r\n \t<li>50<sup>th<\/sup> percentile = <em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<\/em> = 2<\/li>\r\n \t<li>25<sup>th<\/sup> percentile <code>= <em data-effect=\"italics\">NORM.INV<\/em>(0.25,2,0.05)<\/code> = 1.97<\/li>\r\n \t<li>75<sup>th<\/sup> percentile <code>= <em data-effect=\"italics\">NORM.INV<\/em>(0.75,2,0.05)<\/code> = 2.03<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li>We have <em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n<\/em>(<em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em>) = 100(2) = 200 and <em data-effect=\"italics\">\u03c3<sub>\u03bcx<\/sub><\/em> =$\\sqrt{n}(\\sigma_x)$= 10(0.5) = 5. Therefore\r\n<ol id=\"fs-idm36416432\" type=\"1\">\r\n \t<li>50<sup>th<\/sup> percentile = <em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n<\/em>(<em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em>) = 100(2) = 200<\/li>\r\n \t<li>25<sup>th<\/sup> percentile <code>= NORM.INV(0.25,200,5)<\/code> = 196.63<\/li>\r\n \t<li>75<sup>th<\/sup> percentile <code>= NORM.INV(0.75,200,5)<\/code> = 203.37<\/li>\r\n<\/ol>\r\n<\/li>\r\n \t<li><em data-effect=\"italics\">P<\/em>(1.75 &lt;$\\bar x$&lt; 1.85) <code>= NORM.DIST(1.75,1.85,2,0.05,TRUE)<\/code> = 0.0013<\/li>\r\n \t<li>Using the <em data-effect=\"italics\">z<\/em>-score equation, $z=\\frac{\\bar x - \\mu_{\\bar x}}{\\sigma_{\\bar x}}$, and solving for <em data-effect=\"italics\">x<\/em>, we have <em data-effect=\"italics\">x<\/em> = 2(0.05) + 2 = 2.1<\/li>\r\n \t<li>The <em data-effect=\"italics\">IQR<\/em> is 75<sup>th<\/sup> percentile \u2013 25<sup>th<\/sup> percentile = 203.37 \u2013 196.63 = 6.74<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm67893584\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.10<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm59759104\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm113132656\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm69144496\">Based on data from the National Health Survey, women between the ages of 18 and 24 have an average systolic blood pressures (in mm Hg) of 114.8 with a standard deviation of 13.1. Systolic blood pressure for women between the ages of 18 to 24 follow a normal distribution.<\/p>\r\n\r\n<ol id=\"fs-idm115059504\" type=\"a\">\r\n \t<li>If one woman from this population is randomly selected, find the probability that her systolic blood pressure is greater than 120.<\/li>\r\n \t<li>If 40 women from this population are randomly selected, find the probability that their mean systolic blood pressure is greater than 120.<\/li>\r\n \t<li>If the sample were four women between the ages of 18 to 24 and we did not know the original distribution, could the central limit theorem be used?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm112138992\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.11<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<div id=\"fs-idm108732608\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm110483328\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm753456\">A study was done about violence against prostitutes and the symptoms of the posttraumatic stress that they developed. The age range of the prostitutes was 14 to 61. The mean age was 30.9 years with a standard deviation of nine years.<\/p>\r\n\r\n<ol id=\"fs-idm45985568\" type=\"a\">\r\n \t<li>In a sample of 25 prostitutes, what is the probability that the mean age of the prostitutes is less than 35?<\/li>\r\n \t<li>Is it likely that the mean age of the sample group could be more than 50 years? Interpret the results.<\/li>\r\n \t<li>In a sample of 49 prostitutes, what is the probability that the sum of the ages is no less than 1,600?<\/li>\r\n \t<li>Is it likely that the sum of the ages of the 49 prostitutes is at most 1,595? Interpret the results.<\/li>\r\n \t<li>Find the 95<sup>th<\/sup> percentile for the sample mean age of 65 prostitutes. Interpret the results.<\/li>\r\n \t<li>Find the 90<sup>th<\/sup> percentile for the sum of the ages of 65 prostitutes. Interpret the results.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idm138994048\" class=\"finger\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.11<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol id=\"fs-idm86323200\" type=\"a\">\r\n \t<li>The standard deviation for the mean of a sample of 25 is $\\sigma_{\\bar x} = \\frac{\\sigma}{\\sqrt{n}}\\frac{9}{\\sqrt{25}}=1.8$<em data-effect=\"italics\">\r\nP<\/em>($\\bar x$ &lt; 35) <code>= NORM.DIST(35,30.9,1.8,TRUE)<\/code> = 0.9886<\/li>\r\n \t<li><em data-effect=\"italics\">P<\/em>($\\bar x$&gt; 50) = <code>1-NORM.DIST(50,30.9,1.8,TRUE)<\/code> \u2248 0. For this sample group, it is almost impossible for the group\u2019s average age to be more than 50. However, it is still possible for an individual in this group to have an age greater than 50.<\/li>\r\n \t<li>The mean for the sum of ages of 49 people is $\\mu_{\\sum x} = n\\cdot \\mu = 49\\cdot 30.9 = 1514.1$\r\nThe standard deviation for the sum of ages of 49 people is $\\sigma_{\\sum x} = n\\cdot \\frac{\\sigma}{\\sqrt{n}}=49\\cdot \\frac{9}{\\sqrt{49}}=63$<em data-effect=\"italics\">\r\nP<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> \u2265 1,600) <code>= 1-NORM.DIST(1600<\/code>,1514.10,63,TRUE) = 0.0864<\/li>\r\n \t<li><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> \u2264 1,595) <code>= NORM.DIST(1595,1514.10,63,TRUE)<\/code> = 0.9005. This means that there is a 90% chance that the sum of the ages for the sample group <em data-effect=\"italics\">n<\/em> = 49 is at most 1595.<\/li>\r\n \t<li>The standard deviation for the mean of a sample of 65 is $\\sigma_{\\bar x} = \\frac{\\sigma}{\\sqrt{n}}\\frac{9}{\\sqrt{65}}=1.1$\r\nThe 95th percentile <code>= NORM.INV(0.95,30.9,1.1)<\/code> = 32.7. This indicates that 95% of the prostitutes in the sample of 65 are younger than 32.7 years, on average.<\/li>\r\n \t<li>The mean for the sum of ages of 65 people is $\\mu_{\\sum x} = n\\cdot \\mu = 65\\cdot 30.9 = 2008.5$\r\nThe standard deviation for the sum of ages of 65 people is $\\sigma_{\\sum x} = n\\cdot \\frac{\\sigma}{\\sqrt{n}}=65\\cdot \\frac{9}{\\sqrt{65}}=72.56$\r\nThe 90th percentile <code>= NORM.INV(0.90,2008.5,72.56)<\/code> = 2101.5. This indicates that 90% of the prostitutes in the sample of 65 have a sum of ages less than 2,101.5 years.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm102859984\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.11<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm123966224\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm146706672\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm107433712\">According to Boeing data, the 757 airliner carries 200 passengers and has doors with a height of 72 inches. Assume for a certain population of men we have a mean height of 69.0 inches and a standard deviation of 2.8 inches.<\/p>\r\n\r\n<ol id=\"fs-idm131956368\" type=\"a\">\r\n \t<li>What doorway height would allow 95% of men to enter the aircraft without bending?<\/li>\r\n \t<li>Assume that half of the 200 passengers are men. What mean doorway height satisfies the condition that there is a 0.95 probability that this height is greater than the mean height of 100 men?<\/li>\r\n \t<li>For engineers designing the 757, which result is more relevant: the height from part a or part b? Why?<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"eip-713\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"30\" class=\"os-title-label\" data-type=\"\">HISTORICAL NOTE<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"fs-idm36895584\"><strong>: Normal Approximation to the Binomial<\/strong><\/p>\r\n<p id=\"eip-614\">Historically, being able to compute binomial probabilities was one of the most important applications of the central limit theorem. Binomial probabilities with a small value for <em data-effect=\"italics\">n<\/em>(say, 20) were displayed in a table in a book. To calculate the probabilities with large values of <em data-effect=\"italics\">n<\/em>, you had to use the binomial formula, which could be very complicated. Using the <span id=\"term141\" data-type=\"term\">normal approximation to the binomial<\/span> distribution simplified the process. To compute the normal approximation to the binomial distribution, take a simple random sample from a population. You must meet the conditions for a <span id=\"term142\" data-type=\"term\">binomial distribution<\/span>:<\/p>\r\n\r\n<ul id=\"norm_bi_approx\" data-bullet-style=\"bullet\">\r\n \t<li>there are a certain number <em data-effect=\"italics\">n<\/em> of independent trials<\/li>\r\n \t<li>the outcomes of any trial are success or failure<\/li>\r\n \t<li>each trial has the same probability of a success <em data-effect=\"italics\">p<\/em><\/li>\r\n<\/ul>\r\nRecall that if <em data-effect=\"italics\">X<\/em> is the binomial random variable, then <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">B<\/em>(<em data-effect=\"italics\">n, p<\/em>). The shape of the binomial distribution needs to be\r\nsimilar to the shape of the normal distribution. To ensure this, the quantities <em data-effect=\"italics\">np<\/em>\r\nand <em data-effect=\"italics\">nq<\/em> must both be greater than five (<em data-effect=\"italics\">np<\/em> &gt; 5 and <em data-effect=\"italics\">nq<\/em> &gt; 5; the approximation is better if they are both greater than or equal to 10). Then the binomial can be approximated by the normal distribution with mean <em data-effect=\"italics\">\u03bc<\/em> = <em data-effect=\"italics\">np<\/em> and standard deviation $\\sigma = \\sqrt{npq}$. Remember that <em data-effect=\"italics\">q<\/em> = 1 \u2013 <em data-effect=\"italics\">p<\/em>. In order to get the best approximation, add 0.5 to <em data-effect=\"italics\">x<\/em> or subtract 0.5 from <em data-effect=\"italics\">x<\/em> (use <em data-effect=\"italics\">x<\/em> + 0.5 or <em data-effect=\"italics\">x<\/em> \u2013 0.5). The number 0.5 is called the <span id=\"term143\" data-type=\"term\">continuity correction factor<\/span> and is used in the following example.\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"eip-831\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.12<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<p id=\"eip-926\">Suppose in a local Kindergarten through 12<sup>th<\/sup> grade (K - 12) school district, 53 percent of the population favor a charter school for grades K through 5. A simple random sample of 300 is surveyed.<\/p>\r\n\r\n<ol id=\"eip-306\" type=\"a\">\r\n \t<li>Find the probability that <strong>at least 150<\/strong> favor a charter school.<\/li>\r\n \t<li>Find the probability that <strong>at most 160<\/strong> favor a charter school.<\/li>\r\n \t<li>Find the probability that <strong>more than 155<\/strong> favor a charter school.<\/li>\r\n \t<li>Find the probability that <strong>fewer than 147<\/strong> favor a charter school.<\/li>\r\n \t<li>Find the probability that <strong>exactly 175<\/strong> favor a charter school.<\/li>\r\n<\/ol>\r\nLet <em data-effect=\"italics\">X<\/em> = the number that favor a charter school for grades K trough 5. <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">B<\/em>(<em data-effect=\"italics\">n, p<\/em>) where <em data-effect=\"italics\">n<\/em> = 300 and <em data-effect=\"italics\">p<\/em> = 0.53. Since <em data-effect=\"italics\">np<\/em> &gt; 5 and <em data-effect=\"italics\">nq<\/em> &gt; 5, use the normal approximation to the binomial. The formulas for the mean and standard deviation are <em data-effect=\"italics\">\u03bc<\/em> = <em data-effect=\"italics\">np<\/em> and $\\sigma = \\sqrt{npq}$. The mean is 159 and the standard deviation is 8.6447. The random variable for the normal distribution is <em data-effect=\"italics\">Y<\/em>. <em data-effect=\"italics\">Y<\/em> ~ <em data-effect=\"italics\">N<\/em>(159, 8.6447).\r\n<p id=\"eip-214\" class=\"finger\">For part a, you <strong>include 150<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2265 150) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> \u2265 149.5) = 0.8641.<\/p>\r\n<p id=\"eip-554\"><code id=\"31\">1-NORM.DIST<\/code>(149.5,159,8.6447,TRUE) = 0.8641.<\/p>\r\n<p id=\"eip-323\">For part b, you <strong>include 160<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2264 160) has normal appraximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> \u2264 160.5) = 0.5689.<\/p>\r\n<p id=\"eip-162\"><code id=\"31\">NORM.DIST<\/code>(160.5,159,8.6447,TRUE) = 0.5689<\/p>\r\n<p id=\"eip-630\">For part c, you <strong>exclude 155<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &gt; 155) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">y<\/em> &gt; 155.5) = 0.6572.<\/p>\r\n<p id=\"eip-763\"><code id=\"31\">1-NORM.DIST<\/code>(155.5,159,8.6447,TRUE) = 0.6572.<\/p>\r\n<p id=\"eip-955\">For part d, you <strong>exclude 147<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &lt; 147) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> &lt; 146.5) = 0.0741.<\/p>\r\n<p id=\"eip-555\"><code id=\"31\">NORM.DIST<\/code>(146.5,159,8.6447,TRUE) = 0.0741<\/p>\r\n<p id=\"eip-788\">For part e,<em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> = 175) has normal approximation <em data-effect=\"italics\">P<\/em>(174.5 &lt; <em data-effect=\"italics\">Y<\/em> &lt; 175.5) = 0.0083.<\/p>\r\n<p id=\"eip-680\"><code id=\"35\"><code id=\"31\">NORM.DIST<\/code><\/code>(175.5,159,8.6447,TRUE)- <code id=\"35\"><code id=\"31\">NORM.DIST<\/code><\/code>(174.5,159,8.6447,TRUE)= 0.0083<\/p>\r\n<p id=\"eip-863\" class=\"finger\"><strong>Because of calculators and computer software<\/strong> that let you calculate binomial probabilities for large values of <em data-effect=\"italics\">n<\/em> easily, it is not necessary to use the the normal approximation to the binomial distribution, provided that you have access to these technology tools. In a spreadsheet application (like Google Sheets), you can use the <a href=\"https:\/\/support.google.com\/docs\/answer\/3093987?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">BINOM.DIST()<\/a> function to calculate binomial distribution probabilities.<\/p>\r\n<p id=\"eip-251\">For <a class=\"autogenerated-content\" href=\"#fs-idm153949776\">Example 6.10<\/a>, the probabilities are calculated using the following binomial distribution: (<em data-effect=\"italics\">n<\/em> = 300 and <em data-effect=\"italics\">p<\/em> = 0.53). Compare the binomial and normal distribution answers.<\/p>\r\n<p id=\"eip-67\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2265 150) :<code id=\"36\">1 - BINOM.DIST<\/code>(149, 300,0.53,TRUE) = 0.8641<\/p>\r\n<p id=\"eip-560\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2264 160) :<code id=\"36\">B INOM.DIST<\/code>(160,300,0.53,TRUE) = 0.5684<\/p>\r\n<p id=\"eip-727\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &gt; 155) :<code id=\"38\">1 - <\/code><code id=\"36\">BINOM.DIST<\/code>(155,300,0.53,TRUE) = 0.6576<\/p>\r\n<p id=\"eip-122\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &lt; 147) :<code id=\"36\"> BINOM.DIST<\/code>(146,300,0.53,TRUE) = 0.0742<\/p>\r\n<p id=\"eip-267\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> = 175) : <code id=\"36\">BINOM.DIST<\/code>(175,300,0.53,FALSE) = 0.0083 (You use the FALSE argument since we don't want the cumulative probabilities)<\/p>\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm31488240\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.12<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm78234832\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm12123632\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm152181632\">In a city, 46 percent of the population favor the incumbent, Dawn Morgan, for mayor. A simple random sample of 500 is taken. Using the continuity correction factor, find the probability that at least 250 favor Dawn Morgan for mayor.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>","rendered":"<p><span style=\"display: none;\"><br \/>\n[latexpage]<br \/>\n<\/span><\/p>\n<div id=\"45c3f6e5-1eab-41c3-91a9-e0c42a78c970\" class=\"chapter-content-module\" data-type=\"page\" data-cnxml-to-html-ver=\"2.1.0\">\n<p id=\"delete_me\">Before we begin this chapter, it is important for you to understand when to use the <span id=\"term136\" data-type=\"term\">central limit theorem (CLT)<\/span>. If you are being asked to find the probability of the mean, use the CLT for the mean. If you are being asked to find the probability of a sum or total, use the CLT for sums. This also applies to percentiles for means and sums.<\/p>\n<div id=\"id11030776\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"1\" class=\"os-title-label\" data-type=\"\">NOTE<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"fs-idm90223456\">If you are being asked to find the probability of an <strong>individual<\/strong> value, do <strong>not<\/strong> use the Central Limit Theorem. <strong>Use the distribution of its random variable.<\/strong><\/p>\n<\/div>\n<\/section>\n<\/div>\n<section id=\"element-733\" data-depth=\"1\">\n<h3 data-type=\"title\">Examples of the Central Limit Theorem<\/h3>\n<section id=\"fs-idp5962992\" data-depth=\"2\">\n<h4 data-type=\"title\">Law of Large Numbers<\/h4>\n<p>The <span id=\"term137\" data-type=\"term\">law of large numbers<\/span> says that if you take samples of larger and larger size from any population, then the mean $\\bar x$ of the sample tends to get closer and closer to <em data-effect=\"italics\">\u03bc<\/em>. From the central limit theorem, we know that as <em data-effect=\"italics\">n<\/em> gets larger and larger, the sample means follow a normal distribution. The larger <em data-effect=\"italics\">n<\/em> gets, the smaller the standard deviation gets. (Remember that the standard deviation for $\\bar X$ is $\\frac{\\sigma}{\\sqrt{n}}$. ) This means that the sample mean $\\bar x$ must be close to the population mean <em data-effect=\"italics\">\u03bc<\/em>. We can say that <em data-effect=\"italics\">\u03bc<\/em> is the value that the sample means approach as <em data-effect=\"italics\">n<\/em> gets larger. The central limit theorem illustrates the law of large numbers.<\/p>\n<\/section>\n<section id=\"fs-idm1089056\" data-depth=\"2\">\n<h4 data-type=\"title\">Central Limit Theorem for the Mean and Sum Examples<\/h4>\n<div id=\"element-482\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.8<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<p id=\"element-361\">A study involving stress is conducted among the students on a college campus. <strong>The stress scores follow a <span id=\"term138\" data-type=\"term\">uniform distribution<\/span><\/strong> with the lowest stress score equal to one and the highest equal to five. Using a sample of 75 students, find:<\/p>\n<ol id=\"fs-idp18838512\" type=\"a\">\n<li>The probability that the <strong>mean stress score<\/strong> for the 75 students is less than two.<\/li>\n<li>The 90<sup>th<\/sup> percentile for the <strong>mean stress score<\/strong> for the 75 students.<\/li>\n<li>The probability that the <strong>total of the 75 stress scores<\/strong> is less than 200.<\/li>\n<li>The 90<sup>th<\/sup> percentile for the <strong>total stress score<\/strong> for the 75 students.<\/li>\n<\/ol>\n<p id=\"element-294\">Let <em data-effect=\"italics\">X<\/em> = one stress score.<\/p>\n<p id=\"element-632\">Problems a and b ask you to find a probability or a percentile for a <span id=\"term139\" data-type=\"term\">mean<\/span>. Problems c and d ask you to find a probability or a percentile for a <strong>total or sum<\/strong>. The sample size, <em data-effect=\"italics\">n<\/em>, is equal to 75.<\/p>\n<p id=\"element-494\">Since the individual stress scores follow a uniform distribution, <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">U<\/em>(1, 5) where <em data-effect=\"italics\">a<\/em> = 1 and <em data-effect=\"italics\">b<\/em> = 5 (In other words, the individual stress scores are <strong>uniformly distributed<\/strong> between 1 and 5. See <a href=\"\/introstats\/part\/continuous-random-variables\/\" target=\"_blank\" rel=\"noopener\">Continuous Random Variables<\/a> for an explanation on the uniform distribution).<\/p>\n<p>$\\mu_x = \\frac{a+b}{2}=\\frac{1+5}{2}=3$<\/p>\n<p>$\\sigma_x = \\sqrt{\\frac{(b-a)^2}{12}}=\\sqrt{\\frac{(5-1)^2}{12}}=1.15$<\/p>\n<p>For problems a. and b., let $\\bar X =$ the mean stress score for the 75 students. Then,<br \/>\n$\\bar X \\sim N\\left(3, \\frac{1.15}{\\sqrt{75}} \\right)$<\/p>\n<\/div>\n<div><\/div>\n<p>Find $P(\\bar x &lt;2)$. Draw the graph.<\/p>\n<div id=\"element-197\" class=\"unnumbered\" data-type=\"exercise\">\n<section>\n<div id=\"id10355911\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.8<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol type=\"a\">\n<li style=\"list-style-type: none;\">\n<ol type=\"a\">\n<li id=\"element-146\">Find $P(\\bar x &lt;2)$.<br \/>\nDraw the graph.<br \/>\nThe probability that the mean stress score is less than two is about zero.<span id=\"id10559440\" data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\"><\/span><span data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\"><img decoding=\"async\" id=\"2\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/b5dad0da9c024b3da9b9fef4478aa2a82388f089.jpg\" alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 3 on the horizontal axis. A point, 2, is marked at the left edge of the curve.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/p>\n<div id=\"fs-idm147247216\" class=\"os-figure\">\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.4<\/span><\/div>\n<\/div>\n<p>$P(\\bar x &lt;2)$=<\/p>\n<p id=\"fs-idm29706432\"><code id=\"3\">=NORM.DIST(2,3,1.15\/SQRT(75),TRUE)-NORM.DIST(1,3,1.15\/SQRT(75),TRUE)<br \/>\n<\/code><br \/>\n=0\n<\/p>\n<div id=\"note-1\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\" data-element-type=\"Reminder\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"4\" class=\"os-title-label\" data-type=\"\">Reminder<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"fs-idm90269648\">The smallest stress score is one.<\/p>\n<\/div>\n<\/section>\n<\/div>\n<\/li>\n<li>Find the 90<sup>th<\/sup> percentile for the mean of 75 stress scores. Draw a graph.<br \/>\n<span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"><span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"><img decoding=\"async\" id=\"5\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/823d909d6d3687a2742a9bcaef0bd4f912e29b88.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\" width=\"300\" data-media-type=\"image\/jpg\" data-print-width=\"3in\" \/><\/span><\/span><span id=\"id11181476\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 3 on the horizontal axis. A point, k, is labeled to the right of 3. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.90.\"> <\/span><\/p>\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.5<\/span><\/div>\n<p id=\"element-179\">Let <em data-effect=\"italics\">k<\/em> = the 90<sup>th<\/sup> percentile.<br \/>\nFind <em data-effect=\"italics\">k<\/em>, where $P(\\bar x &lt; k)=0.9$<\/p>\n<p>The 90<sup>th<\/sup> percentile for the mean of 75 scores is about 3.2. This tells us that 90% of all the means of 75 stress scores are at most 3.2, and that 10% are at least 3.2.<\/p>\n<p><code>=NORM.INV(0.9, 3, 1.15\/SQRT(75))<\/code><\/p>\n<p>$k=3.2$<\/li>\n<li>For problems c and d, let <em data-effect=\"italics\">\u03a3X<\/em> = the sum of the 75 stress scores. Then, $\\sum X \\sim N\\left( 75\\cdot 3, \\sqrt{75}\\cdot 1.15 \\right)$Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &lt; 200). Draw the graph.<br \/>\n<span data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\"><img decoding=\"async\" id=\"7\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/3bcb7bce3ce73c09aa0a4ff2c4204407674c6e55.jpg\" alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><span id=\"id10985583\" data-type=\"media\" data-alt=\"This is a normal distribution curve over a horizontal axis. The peak of the curve coincides with the point 225 on the horizontal axis. A point, 200, is marked at the left edge of the curve.\"> <\/span><\/p>\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\"><span class=\"os-number\">6.6<br \/>\n<\/span><\/span><\/div>\n<p id=\"fs-idp52789136\">The mean of the sum of 75 stress scores is (75)(3) = 225<\/p>\n<p>The standard deviation of the sum of 75 stress scores is $\\sqrt{75}\\cdot 1.15 = 9.96$<\/p>\n<p><code>=NORM.DIST(200, 75*3, SQRT(75)*1.15, TRUE)-NORM.DIST(75, 75*3, SQRT(75)*1.15)<br \/>\n<\/code><\/p>\n<div id=\"element-888\" class=\"unnumbered\" data-type=\"exercise\">\n<section>\n<div id=\"id11181711\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<div class=\"os-solution-container\">\n<div id=\"note-2\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\" data-element-type=\"Reminder\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"9\" class=\"os-title-label\" data-type=\"\">Reminder<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"fs-idm105410608\">The smallest total of 75 stress scores is 75, because the smallest single score is one.<\/p>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<p>$P(\\sum x &lt; 200) = 0$<\/li>\n<li>Find the 90<sup>th<\/sup> percentile for the total of 75 stress scores. Draw a graph.\n<p id=\"element-308\">Let <em data-effect=\"italics\">k<\/em> = the 90<sup>th<\/sup> percentile.<\/p>\n<p>Find <em data-effect=\"italics\">k<\/em> where <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &lt; <em data-effect=\"italics\">k<\/em>) = 0.90.<span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\"><img decoding=\"async\" id=\"10\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/34f66e086d52d6f7bcd9843020078ccfabe3d92c.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><span id=\"id10985827\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 225 on the horizontal axis. A point, k, is labeled to the right of 225. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(sum of x &lt; k) = 0.90.\"> <\/span><\/p>\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.7<\/span><\/div>\n<p id=\"element-780\">=NORM.INV(0.9, 75*3, SQRT(75)*1.15)<\/p>\n<p>$k = 237.8$<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/section>\n<\/div>\n<div id=\"element-681\" class=\"unnumbered\" data-type=\"exercise\">\n<section>\n<div id=\"id10985728\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\"><\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm56169616\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.8<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm127013136\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm53684704\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm54565472\">Use the information in <a class=\"autogenerated-content\" href=\"#element-482\">Example 6.8<\/a>, but use a sample size of 55 to answer the following questions.<\/p>\n<ol id=\"fs-idm68188624\" type=\"a\">\n<li>Find <em>$P(\\bar x&lt;7)$.<\/em><\/li>\n<li>Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> &gt; 170).<\/li>\n<li>Find the 80<sup>th<\/sup> percentile for the mean of 55 scores.<\/li>\n<li>Find the 85<sup>th<\/sup> percentile for the sum of 55 scores.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"example2\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.9<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<p id=\"eip-id1171180958925\">Suppose that a market research analyst for a cell phone company conducts a study of their customers who exceed the time allowance included on their basic cell phone contract; the analyst finds that for those people who exceed the time included in their basic contract, the <strong>excess time used<\/strong> follows an <span id=\"term140\" data-type=\"term\">exponential distribution<\/span> with a mean of 22 minutes. (Note: you do not need to know anything about the exponential distribution to solve this problem, but if you are curious, you can read about it at <a href=\"https:\/\/en.wikipedia.org\/wiki\/Exponential_distribution\" target=\"_blank\" rel=\"noopener noreferrer\">Wikipedia<\/a>).<\/p>\n<p id=\"eip-id1171181431405\">Consider a random sample of 80 customers who exceed the time allowance included in their basic cell phone contract.<\/p>\n<p id=\"eip-id1171181123857\">Let <em data-effect=\"italics\">X<\/em> = the excess time used by one INDIVIDUAL cell phone customer who exceeds his contracted time allowance.<\/p>\n<p id=\"eip-id1171182604485\"><em data-effect=\"italics\">$X\\sim Exp\\left( \\frac{1}{22} \\right)$. <\/em>From previous chapters, we know that <em data-effect=\"italics\">\u03bc<\/em> = 22 and <em data-effect=\"italics\">\u03c3<\/em> = 22.<\/p>\n<p>Let $\\bar X=$ the mean excess time used by a sample of <em data-effect=\"italics\">n<\/em> = 80 customers who exceed their contracted time allowance.<\/p>\n<p>$\\bar X \\sim N\\left( 22, \\frac{22}{\\sqrt{80}} \\right)$ by the central limit theorem for sample means<\/p>\n<div id=\"eip-id8136792\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"eip-id8760885\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<div id=\"eip-id1171180795181\" data-type=\"list\">\n<div id=\"13\" data-type=\"title\"><strong>Using the CLT to find probability<\/strong><\/div>\n<ol type=\"a\">\n<li>Find the probability that the mean excess time used by the 80 customers in the sample is longer than 20 minutes. This is asking us to find $P(\\bar x &gt; 20)$<em data-effect=\"italics\">.<\/em> Draw the graph.<\/li>\n<li>Suppose that one customer who exceeds the time limit for his cell phone contract is randomly selected. Find the probability that this individual customer&#8217;s excess time is longer than 20 minutes. This is asking us to find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">x<\/em> &gt; 20).<\/li>\n<li>Explain why the probabilities in parts a and b are different.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/div>\n<div id=\"eip-id1171181279568\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.9<\/span><\/h4>\n<div class=\"os-solution-container\">\n<p id=\"fs-idm154380928\">Find: $P(\\bar x&gt;20)<\/p>\n<p>$P(\\bar x &gt; 20) = 0.7919$ using<br \/>\n<code>=1-NORM.DIST(20,22, 22\/SQRT(80), TRUE)<\/code><\/p>\n<p>The probability is 0.7919 that the mean excess time used is more than 20 minutes, for a sample of 80 customers who exceed their contracted time allowance.<\/p>\n<div id=\"fs-idm52706528\" class=\"os-figure\">\n<figure data-id=\"fs-idm52706528\"><span id=\"id1164895099196\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, 20, is labeled to the left of 22. A vertical line extends from 20 to the curve. The area under the curve to the right of k is shaded. The shaded area shows that P(x-bar &gt; 20).\"><br \/>\n<img decoding=\"async\" id=\"15\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/832dc3bab2cd88641ded60380533d6e556f97ba7.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, 20, is labeled to the left of 22. A vertical line extends from 20 to the curve. The area under the curve to the right of k is shaded. The shaded area shows that P(x-bar &gt; 20).\" width=\"380\" data-media-type=\"image\/jpg\" \/><\/span><\/figure>\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.8<\/span><\/div>\n<\/div>\n<p id=\"element-260\">Find<em data-effect=\"italics\">P<\/em>(x &gt; 20). Remember to use the exponential distribution for an <strong>individual:<\/strong> $X\\sim Exp\\left( \\frac{1}{22} \\right)$<\/p>\n<p>$P(x&gt;20) = e^{-\\frac{1}{22}\\cdot 20} = 0.4029$<\/p>\n<ol>\n<li style=\"list-style-type: none;\">\n<ol id=\"eip-id1171181135515\" type=\"1\">\n<li><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">x<\/em> &gt; 20) = 0.4029 but <em data-effect=\"italics\">$P(\\bar x &gt; 20)=0.7917$<br \/>\n<\/em><\/li>\n<li>The probabilities are not equal because we use different distributions to calculate the probability for individuals and for means.<\/li>\n<li>When asked to find the probability of an individual value, use the stated distribution of its random variable; do not use the . Use the CLT with the normal distribution when you are being asked to find the probability for a mean.<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"eip-id1171180910907\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"eip-id1171181151372\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<h4 data-type=\"title\"><strong>Using the CLT to find percentiles<\/strong><\/h4>\n<p>Find the 95<sup>th<\/sup> percentile for the <strong>sample mean excess time<\/strong> for samples of 80 customers who exceed their basic contract time allowances. Draw a graph.<\/p>\n<\/div>\n<\/div>\n<div id=\"eip-id1171181126592\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.9<\/span><\/h4>\n<div class=\"os-solution-container\">\n<p id=\"eip-id1171181699771\">Let <em data-effect=\"italics\">k<\/em> = the 95<sup>th<\/sup> percentile. Find <em data-effect=\"italics\">k<\/em> where$P(\\bar x &lt;k)=0.95$<\/p>\n<p id=\"eip-id1171180867272\" class=\"finger\"><em data-effect=\"italics\">k<\/em> = 26.0 using<\/p>\n<p class=\"finger\"><code>=NORM.INV(0.95, 22, 22\/SQRT(80))<\/code><code id=\"18\"><\/code><\/p>\n<div id=\"fs-idm61807792\" class=\"os-figure\">\n<figure data-id=\"fs-idm61807792\"><span id=\"id1164881078203\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, k, is labeled to the right of 22. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.95.\"><br \/>\n<img decoding=\"async\" id=\"19\" src=\"https:\/\/openstax.org\/resources\/2bf126a04004910fb6ccb9dbfa68769e790e1a60\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 22 on the horizontal axis. A point, k, is labeled to the right of 22. A vertical line extends from k to the curve. The area under the curve to the left of k is shaded. The shaded area shows that P(x-bar &lt; k) = 0.95.\" width=\"380\" data-media-type=\"image\/jpg\" \/><br \/>\n<\/span><\/figure>\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.9<\/span><\/div>\n<\/div>\n<p id=\"element-987\">The 95<sup>th<\/sup> percentile for the <strong>sample mean excess time used<\/strong> is about 26.0 minutes for random samples of 80 customers who exceed their contractual allowed time.<\/p>\n<p id=\"eip-id1171181013130\">Ninety five percent of such samples would have means under 26 minutes; only five percent of such samples would have means above 26 minutes.<\/p>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm50063488\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.9<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm113638352\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm60469088\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"eip-823\">Use the information in <a class=\"autogenerated-content\" href=\"#example2\">Example 6.9<\/a>, but change the sample size to 144.<\/p>\n<ol id=\"fs-idm72056816\" type=\"a\">\n<li>Find <em data-effect=\"italics\">$P(20&lt;\\bar x &lt; 30)$.<br \/>\n<\/em><\/li>\n<li>Find <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> is at least 3,000).<\/li>\n<li>Find the 75<sup>th<\/sup> percentile for the sample mean excess time of 144 customers.<\/li>\n<li>Find the 85<sup>th<\/sup> percentile for the sum of 144 excess times used by customers.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm153949776\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.10<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<p id=\"fs-idp5363584\">In the United States, someone is sexually assaulted every two minutes, on average, according to a number of studies. Suppose the standard deviation is 0.5 minutes and the sample size is 100.<\/p>\n<div id=\"fs-idm67039584\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm9526400\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<ol id=\"fs-idm184397680\" type=\"a\">\n<li>Find the median, the first quartile, and the third quartile for the sample mean time of sexual assaults in the United States.<\/li>\n<li>Find the median, the first quartile, and the third quartile for the sum of sample times of sexual assaults in the United States.<\/li>\n<li>Find the probability that a sexual assault occurs on the average between 1.75 and 1.85 minutes.<\/li>\n<li>Find the value that is two standard deviations above the sample mean.<\/li>\n<li>Find the <em data-effect=\"italics\">IQR<\/em> for the sum of the sample times.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idm110029216\" class=\"finger\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.10<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol id=\"fs-idm113256736\" type=\"a\">\n<li>We have, <em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<\/em> = 2 and <em data-effect=\"italics\">\u03c3<sub>x<\/sub><\/em> =$\\frac{\\sigma}{\\sqrt{n}}=\\frac{0.5}{10}=0.05$ Therefore:\n<ol id=\"fs-idm103882928\" type=\"1\">\n<li>50<sup>th<\/sup> percentile = <em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em> = <em data-effect=\"italics\">\u03bc<\/em> = 2<\/li>\n<li>25<sup>th<\/sup> percentile <code>= <em data-effect=\"italics\">NORM.INV<\/em>(0.25,2,0.05)<\/code> = 1.97<\/li>\n<li>75<sup>th<\/sup> percentile <code>= <em data-effect=\"italics\">NORM.INV<\/em>(0.75,2,0.05)<\/code> = 2.03<\/li>\n<\/ol>\n<\/li>\n<li>We have <em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n<\/em>(<em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em>) = 100(2) = 200 and <em data-effect=\"italics\">\u03c3<sub>\u03bcx<\/sub><\/em> =$\\sqrt{n}(\\sigma_x)$= 10(0.5) = 5. Therefore\n<ol id=\"fs-idm36416432\" type=\"1\">\n<li>50<sup>th<\/sup> percentile = <em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n<\/em>(<em data-effect=\"italics\">\u03bc<sub>x<\/sub><\/em>) = 100(2) = 200<\/li>\n<li>25<sup>th<\/sup> percentile <code>= NORM.INV(0.25,200,5)<\/code> = 196.63<\/li>\n<li>75<sup>th<\/sup> percentile <code>= NORM.INV(0.75,200,5)<\/code> = 203.37<\/li>\n<\/ol>\n<\/li>\n<li><em data-effect=\"italics\">P<\/em>(1.75 &lt;$\\bar x$&lt; 1.85) <code>= NORM.DIST(1.75,1.85,2,0.05,TRUE)<\/code> = 0.0013<\/li>\n<li>Using the <em data-effect=\"italics\">z<\/em>-score equation, $z=\\frac{\\bar x &#8211; \\mu_{\\bar x}}{\\sigma_{\\bar x}}$, and solving for <em data-effect=\"italics\">x<\/em>, we have <em data-effect=\"italics\">x<\/em> = 2(0.05) + 2 = 2.1<\/li>\n<li>The <em data-effect=\"italics\">IQR<\/em> is 75<sup>th<\/sup> percentile \u2013 25<sup>th<\/sup> percentile = 203.37 \u2013 196.63 = 6.74<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm67893584\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.10<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm59759104\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm113132656\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm69144496\">Based on data from the National Health Survey, women between the ages of 18 and 24 have an average systolic blood pressures (in mm Hg) of 114.8 with a standard deviation of 13.1. Systolic blood pressure for women between the ages of 18 to 24 follow a normal distribution.<\/p>\n<ol id=\"fs-idm115059504\" type=\"a\">\n<li>If one woman from this population is randomly selected, find the probability that her systolic blood pressure is greater than 120.<\/li>\n<li>If 40 women from this population are randomly selected, find the probability that their mean systolic blood pressure is greater than 120.<\/li>\n<li>If the sample were four women between the ages of 18 to 24 and we did not know the original distribution, could the central limit theorem be used?<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm112138992\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.11<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<div id=\"fs-idm108732608\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm110483328\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm753456\">A study was done about violence against prostitutes and the symptoms of the posttraumatic stress that they developed. The age range of the prostitutes was 14 to 61. The mean age was 30.9 years with a standard deviation of nine years.<\/p>\n<ol id=\"fs-idm45985568\" type=\"a\">\n<li>In a sample of 25 prostitutes, what is the probability that the mean age of the prostitutes is less than 35?<\/li>\n<li>Is it likely that the mean age of the sample group could be more than 50 years? Interpret the results.<\/li>\n<li>In a sample of 49 prostitutes, what is the probability that the sum of the ages is no less than 1,600?<\/li>\n<li>Is it likely that the sum of the ages of the 49 prostitutes is at most 1,595? Interpret the results.<\/li>\n<li>Find the 95<sup>th<\/sup> percentile for the sample mean age of 65 prostitutes. Interpret the results.<\/li>\n<li>Find the 90<sup>th<\/sup> percentile for the sum of the ages of 65 prostitutes. Interpret the results.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idm138994048\" class=\"finger\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.11<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol id=\"fs-idm86323200\" type=\"a\">\n<li>The standard deviation for the mean of a sample of 25 is $\\sigma_{\\bar x} = \\frac{\\sigma}{\\sqrt{n}}\\frac{9}{\\sqrt{25}}=1.8$<em data-effect=\"italics\"><br \/>\nP<\/em>($\\bar x$ &lt; 35) <code>= NORM.DIST(35,30.9,1.8,TRUE)<\/code> = 0.9886<\/li>\n<li><em data-effect=\"italics\">P<\/em>($\\bar x$&gt; 50) = <code>1-NORM.DIST(50,30.9,1.8,TRUE)<\/code> \u2248 0. For this sample group, it is almost impossible for the group\u2019s average age to be more than 50. However, it is still possible for an individual in this group to have an age greater than 50.<\/li>\n<li>The mean for the sum of ages of 49 people is $\\mu_{\\sum x} = n\\cdot \\mu = 49\\cdot 30.9 = 1514.1$<br \/>\nThe standard deviation for the sum of ages of 49 people is $\\sigma_{\\sum x} = n\\cdot \\frac{\\sigma}{\\sqrt{n}}=49\\cdot \\frac{9}{\\sqrt{49}}=63$<em data-effect=\"italics\"><br \/>\nP<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> \u2265 1,600) <code>= 1-NORM.DIST(1600<\/code>,1514.10,63,TRUE) = 0.0864<\/li>\n<li><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">\u03a3x<\/em> \u2264 1,595) <code>= NORM.DIST(1595,1514.10,63,TRUE)<\/code> = 0.9005. This means that there is a 90% chance that the sum of the ages for the sample group <em data-effect=\"italics\">n<\/em> = 49 is at most 1595.<\/li>\n<li>The standard deviation for the mean of a sample of 65 is $\\sigma_{\\bar x} = \\frac{\\sigma}{\\sqrt{n}}\\frac{9}{\\sqrt{65}}=1.1$<br \/>\nThe 95th percentile <code>= NORM.INV(0.95,30.9,1.1)<\/code> = 32.7. This indicates that 95% of the prostitutes in the sample of 65 are younger than 32.7 years, on average.<\/li>\n<li>The mean for the sum of ages of 65 people is $\\mu_{\\sum x} = n\\cdot \\mu = 65\\cdot 30.9 = 2008.5$<br \/>\nThe standard deviation for the sum of ages of 65 people is $\\sigma_{\\sum x} = n\\cdot \\frac{\\sigma}{\\sqrt{n}}=65\\cdot \\frac{9}{\\sqrt{65}}=72.56$<br \/>\nThe 90th percentile <code>= NORM.INV(0.90,2008.5,72.56)<\/code> = 2101.5. This indicates that 90% of the prostitutes in the sample of 65 have a sum of ages less than 2,101.5 years.<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm102859984\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.11<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm123966224\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm146706672\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm107433712\">According to Boeing data, the 757 airliner carries 200 passengers and has doors with a height of 72 inches. Assume for a certain population of men we have a mean height of 69.0 inches and a standard deviation of 2.8 inches.<\/p>\n<ol id=\"fs-idm131956368\" type=\"a\">\n<li>What doorway height would allow 95% of men to enter the aircraft without bending?<\/li>\n<li>Assume that half of the 200 passengers are men. What mean doorway height satisfies the condition that there is a 0.95 probability that this height is greater than the mean height of 100 men?<\/li>\n<li>For engineers designing the 757, which result is more relevant: the height from part a or part b? Why?<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"eip-713\" class=\"ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span id=\"30\" class=\"os-title-label\" data-type=\"\">HISTORICAL NOTE<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"fs-idm36895584\"><strong>: Normal Approximation to the Binomial<\/strong><\/p>\n<p id=\"eip-614\">Historically, being able to compute binomial probabilities was one of the most important applications of the central limit theorem. Binomial probabilities with a small value for <em data-effect=\"italics\">n<\/em>(say, 20) were displayed in a table in a book. To calculate the probabilities with large values of <em data-effect=\"italics\">n<\/em>, you had to use the binomial formula, which could be very complicated. Using the <span id=\"term141\" data-type=\"term\">normal approximation to the binomial<\/span> distribution simplified the process. To compute the normal approximation to the binomial distribution, take a simple random sample from a population. You must meet the conditions for a <span id=\"term142\" data-type=\"term\">binomial distribution<\/span>:<\/p>\n<ul id=\"norm_bi_approx\" data-bullet-style=\"bullet\">\n<li>there are a certain number <em data-effect=\"italics\">n<\/em> of independent trials<\/li>\n<li>the outcomes of any trial are success or failure<\/li>\n<li>each trial has the same probability of a success <em data-effect=\"italics\">p<\/em><\/li>\n<\/ul>\n<p>Recall that if <em data-effect=\"italics\">X<\/em> is the binomial random variable, then <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">B<\/em>(<em data-effect=\"italics\">n, p<\/em>). The shape of the binomial distribution needs to be<br \/>\nsimilar to the shape of the normal distribution. To ensure this, the quantities <em data-effect=\"italics\">np<\/em><br \/>\nand <em data-effect=\"italics\">nq<\/em> must both be greater than five (<em data-effect=\"italics\">np<\/em> &gt; 5 and <em data-effect=\"italics\">nq<\/em> &gt; 5; the approximation is better if they are both greater than or equal to 10). Then the binomial can be approximated by the normal distribution with mean <em data-effect=\"italics\">\u03bc<\/em> = <em data-effect=\"italics\">np<\/em> and standard deviation $\\sigma = \\sqrt{npq}$. Remember that <em data-effect=\"italics\">q<\/em> = 1 \u2013 <em data-effect=\"italics\">p<\/em>. In order to get the best approximation, add 0.5 to <em data-effect=\"italics\">x<\/em> or subtract 0.5 from <em data-effect=\"italics\">x<\/em> (use <em data-effect=\"italics\">x<\/em> + 0.5 or <em data-effect=\"italics\">x<\/em> \u2013 0.5). The number 0.5 is called the <span id=\"term143\" data-type=\"term\">continuity correction factor<\/span> and is used in the following example.<\/p>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"eip-831\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.12<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<p id=\"eip-926\">Suppose in a local Kindergarten through 12<sup>th<\/sup> grade (K &#8211; 12) school district, 53 percent of the population favor a charter school for grades K through 5. A simple random sample of 300 is surveyed.<\/p>\n<ol id=\"eip-306\" type=\"a\">\n<li>Find the probability that <strong>at least 150<\/strong> favor a charter school.<\/li>\n<li>Find the probability that <strong>at most 160<\/strong> favor a charter school.<\/li>\n<li>Find the probability that <strong>more than 155<\/strong> favor a charter school.<\/li>\n<li>Find the probability that <strong>fewer than 147<\/strong> favor a charter school.<\/li>\n<li>Find the probability that <strong>exactly 175<\/strong> favor a charter school.<\/li>\n<\/ol>\n<p>Let <em data-effect=\"italics\">X<\/em> = the number that favor a charter school for grades K trough 5. <em data-effect=\"italics\">X<\/em> ~ <em data-effect=\"italics\">B<\/em>(<em data-effect=\"italics\">n, p<\/em>) where <em data-effect=\"italics\">n<\/em> = 300 and <em data-effect=\"italics\">p<\/em> = 0.53. Since <em data-effect=\"italics\">np<\/em> &gt; 5 and <em data-effect=\"italics\">nq<\/em> &gt; 5, use the normal approximation to the binomial. The formulas for the mean and standard deviation are <em data-effect=\"italics\">\u03bc<\/em> = <em data-effect=\"italics\">np<\/em> and $\\sigma = \\sqrt{npq}$. The mean is 159 and the standard deviation is 8.6447. The random variable for the normal distribution is <em data-effect=\"italics\">Y<\/em>. <em data-effect=\"italics\">Y<\/em> ~ <em data-effect=\"italics\">N<\/em>(159, 8.6447).<\/p>\n<p id=\"eip-214\" class=\"finger\">For part a, you <strong>include 150<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2265 150) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> \u2265 149.5) = 0.8641.<\/p>\n<p id=\"eip-554\"><code id=\"31\">1-NORM.DIST<\/code>(149.5,159,8.6447,TRUE) = 0.8641.<\/p>\n<p id=\"eip-323\">For part b, you <strong>include 160<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2264 160) has normal appraximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> \u2264 160.5) = 0.5689.<\/p>\n<p id=\"eip-162\"><code>NORM.DIST<\/code>(160.5,159,8.6447,TRUE) = 0.5689<\/p>\n<p id=\"eip-630\">For part c, you <strong>exclude 155<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &gt; 155) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">y<\/em> &gt; 155.5) = 0.6572.<\/p>\n<p id=\"eip-763\"><code>1-NORM.DIST<\/code>(155.5,159,8.6447,TRUE) = 0.6572.<\/p>\n<p id=\"eip-955\">For part d, you <strong>exclude 147<\/strong> so <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &lt; 147) has normal approximation <em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">Y<\/em> &lt; 146.5) = 0.0741.<\/p>\n<p id=\"eip-555\"><code>NORM.DIST<\/code>(146.5,159,8.6447,TRUE) = 0.0741<\/p>\n<p id=\"eip-788\">For part e,<em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> = 175) has normal approximation <em data-effect=\"italics\">P<\/em>(174.5 &lt; <em data-effect=\"italics\">Y<\/em> &lt; 175.5) = 0.0083.<\/p>\n<p id=\"eip-680\"><code id=\"35\"><code>NORM.DIST<\/code><\/code>(175.5,159,8.6447,TRUE)- <code><code>NORM.DIST<\/code><\/code>(174.5,159,8.6447,TRUE)= 0.0083<\/p>\n<p id=\"eip-863\" class=\"finger\"><strong>Because of calculators and computer software<\/strong> that let you calculate binomial probabilities for large values of <em data-effect=\"italics\">n<\/em> easily, it is not necessary to use the the normal approximation to the binomial distribution, provided that you have access to these technology tools. In a spreadsheet application (like Google Sheets), you can use the <a href=\"https:\/\/support.google.com\/docs\/answer\/3093987?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">BINOM.DIST()<\/a> function to calculate binomial distribution probabilities.<\/p>\n<p id=\"eip-251\">For <a class=\"autogenerated-content\" href=\"#fs-idm153949776\">Example 6.10<\/a>, the probabilities are calculated using the following binomial distribution: (<em data-effect=\"italics\">n<\/em> = 300 and <em data-effect=\"italics\">p<\/em> = 0.53). Compare the binomial and normal distribution answers.<\/p>\n<p id=\"eip-67\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2265 150) :<code id=\"36\">1 - BINOM.DIST<\/code>(149, 300,0.53,TRUE) = 0.8641<\/p>\n<p id=\"eip-560\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> \u2264 160) :<code>B INOM.DIST<\/code>(160,300,0.53,TRUE) = 0.5684<\/p>\n<p id=\"eip-727\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &gt; 155) :<code id=\"38\">1 - <\/code><code>BINOM.DIST<\/code>(155,300,0.53,TRUE) = 0.6576<\/p>\n<p id=\"eip-122\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> &lt; 147) :<code> BINOM.DIST<\/code>(146,300,0.53,TRUE) = 0.0742<\/p>\n<p id=\"eip-267\"><em data-effect=\"italics\">P<\/em>(<em data-effect=\"italics\">X<\/em> = 175) : <code>BINOM.DIST<\/code>(175,300,0.53,FALSE) = 0.0083 (You use the FALSE argument since we don&#8217;t want the cumulative probabilities)<\/p>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm31488240\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.12<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm78234832\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm12123632\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm152181632\">In a city, 46 percent of the population favor the incumbent, Dawn Morgan, for mayor. A simple random sample of 500 is taken. Using the continuity correction factor, find the probability that at least 250 favor Dawn Morgan for mayor.<\/p>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n","protected":false},"author":1,"menu_order":3,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-64","chapter","type-chapter","status-publish","hentry"],"part":57,"_links":{"self":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/64","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/users\/1"}],"version-history":[{"count":13,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/64\/revisions"}],"predecessor-version":[{"id":762,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/64\/revisions\/762"}],"part":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/parts\/57"}],"metadata":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/64\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/media?parent=64"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapter-type?post=64"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/contributor?post=64"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/license?post=64"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}