{"id":63,"date":"2021-01-12T22:19:37","date_gmt":"2021-01-12T22:19:37","guid":{"rendered":"https:\/\/textbooks.jaykesler.net\/introstats\/chapter\/centrallimittheoremforsums\/"},"modified":"2024-03-31T22:10:01","modified_gmt":"2024-03-31T22:10:01","slug":"centrallimittheoremforsums","status":"publish","type":"chapter","link":"https:\/\/textbooks.jaykesler.net\/introstats\/chapter\/centrallimittheoremforsums\/","title":{"rendered":"Central Limit Theorem for Sums &amp; Proportions"},"content":{"raw":"<span style=\"display: none;\">\r\n[latexpage]\r\n<\/span>\r\n<div id=\"28eabf92-5a02-4a30-9158-a88d4cc3f7a8\" class=\"chapter-content-module\" data-type=\"page\" data-cnxml-to-html-ver=\"2.1.0\">\r\n<h2>Central Limit Theorem for Sums<\/h2>\r\n\r\n<hr \/>\r\n<p id=\"delete_me\">Suppose <em data-effect=\"italics\">X<\/em> is a random variable with a distribution that may be <strong>known or unknown<\/strong> (it can be any distribution) and suppose:<\/p>\r\n\r\n<ol id=\"list-1\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">\u03bc<sub>X<\/sub><\/em> = the mean of <em data-effect=\"italics\">\u03a7<\/em><\/li>\r\n \t<li><em data-effect=\"italics\">\u03c3<sub>\u03a7<\/sub><\/em> = the standard deviation of <em data-effect=\"italics\">X<\/em><\/li>\r\n<\/ol>\r\n<p id=\"fs-idm29530720\">If you draw random samples of size $n$, then as $n$ increases, the random variable \u03a3<em data-effect=\"italics\">X<\/em> consisting of sums tends to be <span id=\"term132\" data-type=\"term\">normally distributed<\/span> and $\\sum X \\sim N \\left( n\\cdot \\mu_x, \\sqrt{n}\\cdot \\sigma_x \\right)$<\/p>\r\n<strong>The<\/strong> <span id=\"term133\" data-type=\"term\">central limit theorem for sums<\/span> says that if you repeatedly draw samples of a given size (such as repeatedly rolling ten dice) and calculate the sum of each sample, these sums tend to follow a normal distribution. As sample sizes increase, the distribution of means more closely follows the normal distribution. <strong>The normal distribution has a mean equal to the original mean multiplied by the sample size and a standard deviation equal to the original standard deviation multiplied by the square root of the sample size.<\/strong>\r\n<p id=\"element-7\">The random variable \u03a3<em data-effect=\"italics\">X<\/em> has the following <em data-effect=\"italics\">z<\/em>-score associated with it:<\/p>\r\n\r\n<ol id=\"element-434\" type=\"a\">\r\n \t<li>\u03a3<em data-effect=\"italics\">x<\/em> is one sum.<\/li>\r\n \t<li>$z = \\frac{\\sum x -n\\cdot \\mu_X}{\\sqrt{n}\\cdot \\sigma_x}$\r\n<ol id=\"element-434\" type=\"i\">\r\n \t<li>$n\\cdot \\mu_x = $ the mean of $\\sum X$<\/li>\r\n \t<li>$\\sqrt{n}\\cdot \\sigma_X = $ the standard deviation of $\\sum X$<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\n<div id=\"element-747\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.5<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<p id=\"element-455\">An unknown distribution has a mean of 90 and a standard deviation of 15. A sample of size 80 is drawn randomly from the population.<span data-type=\"newline\">\r\n<\/span><\/p>\r\n\r\n<div id=\"element-55\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"id12220262\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<ol id=\"element-710\" type=\"1\">\r\n \t<li>Find the probability that the sum of the 80 values (or the total of the 80 values) is more than 7,500.<\/li>\r\n \t<li>Find the sum that is 1.5 standard deviations above the mean of the sums.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"id12220316\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.5<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<p id=\"element-804\">Let <em data-effect=\"italics\">X<\/em> = one value from the original unknown population. The probability question asks you to find a probability for <strong>the sum (or total of) 80 values.<\/strong><\/p>\r\n<p id=\"element-56\">\u03a3<em data-effect=\"italics\">X<\/em> = the sum or total of 80 values. Since <em data-effect=\"italics\">\u03bc<sub>X<\/sub><\/em> = 90, <em data-effect=\"italics\">\u03c3<sub>X<\/sub><\/em> = 15, and <em data-effect=\"italics\">n<\/em> = 80, $\\sum X \\sim N(80\\cdot 90, \\sqrt{80}\\cdot 15)$<span id=\"MathJax-Element-41-Frame\" class=\"MathJax\" style=\"position: relative;\" tabindex=\"0\" role=\"presentation\" data-mathml=\"&lt;\/p&gt; &lt;math xmlns=&quot;http:\/\/www.w3.org\/1998\/Math\/MathML&quot; display=&quot;inline&quot;&gt;&lt;semantics&gt;&lt;mrow&gt;&lt;mrow&gt;&lt;mi&gt;&amp;#x3A3;&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/mrow&gt;&lt;annotation-xml encoding=&quot;MathML-Content&quot;&gt;&lt;mrow&gt;&lt;mi&gt;\u03a3&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/annotation-xml&gt;&lt;\/semantics&gt;&lt;\/math&gt; &lt;p&gt;\"><span id=\"MathJax-Span-612\" class=\"math\" style=\"width: 2.162em; display: inline-block;\"><\/span><\/span><\/p>\r\n\r\n<ul id=\"element-907\">\r\n \t<li>mean of the sums = $n\\cdot \\mu_X = 80\\cdot 90 = 7,200$<\/li>\r\n \t<li>standard deviation of the sums = $\\sqrt{80}\\cdot 15)$<span id=\"MathJax-Element-41-Frame\" class=\"MathJax\" tabindex=\"0\" role=\"presentation\" data-mathml=\"&lt;\/p&gt; &lt;math xmlns=&quot;http:\/\/www.w3.org\/1998\/Math\/MathML&quot; display=&quot;inline&quot;&gt;&lt;semantics&gt;&lt;mrow&gt;&lt;mrow&gt;&lt;mi&gt;&amp;#x3A3;&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/mrow&gt;&lt;annotation-xml encoding=&quot;MathML-Content&quot;&gt;&lt;mrow&gt;&lt;mi&gt;\u03a3&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/annotation-xml&gt;&lt;\/semantics&gt;&lt;\/math&gt; &lt;p&gt;\"><span id=\"MathJax-Span-612\" class=\"math\"><\/span><\/span><\/li>\r\n \t<li>sum of 80 values = <em data-effect=\"italics\">\u03a3x<\/em> = 7,500<span data-type=\"newline\">\r\n<\/span><\/li>\r\n<\/ul>\r\n<ol>\r\n \t<li id=\"fs-idm55727008\">Find <em data-effect=\"italics\">P<\/em>(\u03a3<em data-effect=\"italics\">x<\/em> &gt; 7,500)<span id=\"id12212529\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\"><\/span><span id=\"id12212529\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\"><img id=\"4\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/eb76239c2eaeaa32db6e00ac06a06d516145c390.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\" width=\"380\" data-media-type=\"image\/jpg\" \/>\r\n<\/span>\r\n<div id=\"fs-idm54365952\" class=\"os-figure\">\r\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.3<\/span><\/div>\r\n<\/div>\r\n<div id=\"fs-idm202010592\" class=\"statistics textbox spreadsheet ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">Using Google Sheets\r\n<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"element-389\"><code id=\"5\"><bdo dir=\"ltr\"><span class=\"formula-content\"><span class=\" default-formula-text-color\" dir=\"auto\">=<\/span><span class=\"number\" dir=\"auto\">1<\/span><span class=\" default-formula-text-color\" dir=\"auto\">-<\/span><span class=\" default-formula-text-color\" dir=\"auto\">NORM.DIST<\/span><span class=\" default-formula-text-color\" dir=\"auto\">(<\/span><span class=\"number\" dir=\"auto\">7500<\/span><span class=\" default-formula-text-color\" dir=\"auto\">,<\/span><span class=\"number\" dir=\"auto\">80<\/span><span class=\" default-formula-text-color\" dir=\"auto\">*<\/span><span class=\"number\" dir=\"auto\">90<\/span><span class=\" default-formula-text-color\" dir=\"auto\">,<\/span><span class=\" default-formula-text-color\" dir=\"auto\">SQRT<\/span><span class=\" default-formula-text-color\" dir=\"auto\">(<\/span><span class=\"number\" dir=\"auto\">80<\/span><span class=\" default-formula-text-color\" dir=\"auto\">)<\/span><span class=\" default-formula-text-color\" dir=\"auto\">*<\/span><span class=\"number\" dir=\"auto\">15<\/span><span class=\" default-formula-text-color\" dir=\"auto\">,<\/span><span class=\"boolean\" dir=\"auto\">TRUE<\/span><span class=\" default-formula-text-color\" dir=\"auto\">)<\/span><\/span><\/bdo><\/code><\/p>\r\n<img class=\"aligncenter size-full wp-image-499\" src=\"https:\/\/textbooks.jaykesler.net\/introstats\/wp-content\/uploads\/sites\/2\/2021\/01\/Spreadsheet_7_2.png\" alt=\"\" width=\"439\" height=\"307\" \/>\r\n\r\n<\/div>\r\n<\/section><\/div>\r\n<p id=\"fs-idm48238000\">P(\u03a3x &gt; 7,500)= 0.0127<\/p>\r\n<\/li>\r\n \t<li>Find \u03a3<em data-effect=\"italics\">x<\/em> where <em data-effect=\"italics\">z<\/em> = 1.5.$\\sum x = n \\cdot \\mu_x + z \\cdot \\sqrt{n} \\cdot \\sigma_x = 80\\cdot 90 + 1.5 \\cdot \\sqrt{80}\\cdot 15 = 7,401.2$<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm51983600\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.5<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm105647824\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm71758576\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idp18637136\">An unknown distribution has a mean of 45 and a standard deviation of eight. A sample size of 50 is drawn randomly from the population. Find the probability that the sum of the 50 values is more than 2,400.<\/p>\r\n\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm133155984\" class=\"statistics textbox spreadsheet ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">Using Google Sheets\r\n<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"os-note-body\">\r\n<p id=\"fs-idm82544544\">To find percentiles for sums using a spreadsheet, use the <a href=\"https:\/\/support.google.com\/docs\/answer\/3094022?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">NORM.INV<\/a> function. Type the following in a cell:<\/p>\r\n<code>=NORM.INV(k, n * mean, SQRT(n)*standard deviation)<\/code>\r\n<p id=\"fs-idm95058576\">where:<\/p>\r\n\r\n<ul id=\"fs-idm10652880\">\r\n \t<li><em data-effect=\"italics\">k<\/em> is the <em data-effect=\"italics\">k<\/em><sup>th<\/sup> <span id=\"term135\" data-type=\"term\">percentile<\/span><\/li>\r\n \t<li><em data-effect=\"italics\">mean<\/em> is the mean of the original distribution<\/li>\r\n \t<li><em data-effect=\"italics\">standard deviation<\/em> is the standard deviation of the original distribution<\/li>\r\n \t<li><em data-effect=\"italics\">sample size<\/em> = <em data-effect=\"italics\">n<\/em><\/li>\r\n<\/ul>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm82232896\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.6<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<div id=\"fs-idm90047616\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm70958544\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm5626528\">In a study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. The sample of size is 50.<\/p>\r\n\r\n<ol id=\"fs-idm90393840\" type=\"a\">\r\n \t<li>What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution?<\/li>\r\n \t<li>Find the probability that the sum of the ages is between 1,500 and 1,800 years.<\/li>\r\n \t<li>Find the 80<sup>th<\/sup> percentile for the sum of the 50 ages.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp41767792\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.6<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol id=\"fs-idp6531984\" type=\"a\">\r\n \t<li><em>$\\mu_{\\sum x}=n\\mu_{x}=1700$ and $\\sigma_{\\sum x}=\\sqrt{n}~\\sigma_{x} = \\sqrt{50}\\cdot 15 = 106.07$\r\n<\/em>The distribution is normal for sums by the central limit theorem.<\/li>\r\n \t<li><em data-effect=\"italics\">P<\/em>(1500 &lt; \u03a3<em data-effect=\"italics\">x<\/em> &lt; 1800) =<code>\r\n=NORM.DIST(1800,50*34, SQRT(50)*15,TRUE)-NORM.DIST(1500,50*34,SQRT(50)*15,TRUE)<\/code>\r\n=0.7974<\/li>\r\n \t<li>Let <em data-effect=\"italics\">k<\/em> = the 80<sup>th<\/sup> percentile.\r\n<em data-effect=\"italics\">k<\/em> =\r\n<code>=NORM.INV(0.80,50*34,SQRT(50)*15)<\/code>\r\n=1,789.3<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idp87073664\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.6<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idm67768928\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm16414224\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm98358608\">In a recent study reported Oct.29, 2012 on the Flurry Blog, the mean age of tablet users is 35 years. Suppose the standard deviation is ten years. The sample size is 39.<\/p>\r\n\r\n<ol id=\"fs-idp46009888\" type=\"a\">\r\n \t<li>What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution?<\/li>\r\n \t<li>Find the probability that the sum of the ages is between 1,400 and 1,500 years.<\/li>\r\n \t<li>Find the 90<sup>th<\/sup> percentile for the sum of the 39 ages.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idm145402592\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.7<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<div id=\"fs-idm29377216\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm71536528\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm105248448\">The mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70.<\/p>\r\n\r\n<ol id=\"fs-idm71334512\" type=\"a\">\r\n \t<li>What are the mean and standard deviation for the sums?<\/li>\r\n \t<li>Find the 95<sup>th<\/sup> percentile for the sum of the sample. Interpret this value in a complete sentence.<\/li>\r\n \t<li>Find the probability that the sum of the sample is at least ten hours.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp2253552\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.7<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol id=\"fs-idm78488688\" type=\"a\">\r\n \t<li><em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n\u03bc<sub>x<\/sub><\/em> = 70(8.2) = 574 minutes and <em data-effect=\"italics\">\u03c3<sub>\u03a3x<\/sub><\/em> $=\\sqrt{n}\\cdot \\sigma_x= \\sqrt{70}\\cdot 1 = 8.37$ minutes<\/li>\r\n \t<li>Let <em data-effect=\"italics\">k<\/em> = the 95<sup>th<\/sup> percentile.\r\n<em data-effect=\"italics\">k<\/em> =\r\n<code>=NORM.INV(0.95, 70*8.2,SQRT(70)*1)<\/code>\r\n= 587.76 minutes\r\n<span data-type=\"newline\" data-count=\"1\">\r\n<\/span>Ninety five percent of the sums of app engagement times are at most 587.76 minutes.<\/li>\r\n \t<li>ten hours = 600 minutes\r\n<span data-type=\"newline\">\r\n<\/span><em data-effect=\"italics\">P<\/em>(\u03a3<em data-effect=\"italics\">x<\/em> \u2265 600) =\r\n<code>=1-NORM.DIST(600,70*8.2, SQRT(70)*1, TRUE)<\/code>\r\n<code id=\"14\"><\/code>= 0.0009<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<div id=\"fs-idp10112640\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.7<\/span><\/h3>\r\n<\/header><section>\r\n<div id=\"fs-idp7815296\" class=\" unnumbered\" data-type=\"exercise\"><section>\r\n<div id=\"fs-idm10364128\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idp93890400\">The mean number of minutes for app engagement by a table use is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample size of 70.<\/p>\r\n\r\n<ol id=\"fs-idm85536928\" type=\"a\">\r\n \t<li>What is the probability that the sum of the sample is between seven hours and ten hours? What does this mean in context of the problem?<\/li>\r\n \t<li>Find the 84<sup>th<\/sup> and 16<sup>th<\/sup> percentiles for the sum of the sample. Interpret these values in context.<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<h2>Central Limit Theorem for Proportions<\/h2>\r\n\r\n<hr \/>\r\n\r\nIf you draw random samples of size $n$ from a population that has a proportion $p$, then as $n$ increases, the random variable $X_{\\hat p}$ consisting of proportions tends to be <span id=\"term132\" data-type=\"term\">normally distributed<\/span> and $X_{\\hat p} \\sim N \\left( p, \\sqrt{\\dfrac{p (1-p)}{n}}\\right)$\r\n\r\nFor accuracy, we need to require $n\\cdot p &gt; 10$ and $n \\cdot (1-p) &gt; 10$.\r\n\r\n<strong>The<\/strong> <span id=\"term133\" data-type=\"term\">central limit theorem for proportions<\/span> says that if you repeatedly draw samples of a given size (such as repeatedly rolling ten dice) and calculate the proportion of each sample (for example calculate the proportion of 1's that show up), these sums tend to follow a normal distribution. As sample sizes increase, the distribution of means more closely follows the normal distribution. <strong>The normal distribution has a mean equal to the original population's proportion and a standard deviation equal to the square root of the population proportion multiplied by it's compliment 1-p divided by the sample size.<\/strong>\r\n<p id=\"element-7\">The random variable $X_{\\hat p}$ has the following <em data-effect=\"italics\">z<\/em>-score associated with it:<\/p>\r\n\r\n<ol id=\"element-434\" type=\"a\">\r\n \t<li>$\\hat p$ is one proportion.<\/li>\r\n \t<li>$z = \\dfrac{\\hat p -\\mu_{\\hat p}}{\\sigma_{\\hat p}}$\r\n<ol id=\"element-434\" type=\"i\">\r\n \t<li>$\\mu_{\\hat p} = p$<\/li>\r\n \t<li>$\\sigma_{\\hat p } = \\sqrt{\\dfrac{p(1-p)}{n}}$<\/li>\r\n<\/ol>\r\n<\/li>\r\n<\/ol>\r\nSo we could calculate the $z$-score as\r\n\r\n$z = \\dfrac{\\hat p -p}{ \\sqrt{\\dfrac{p(1-p)}{n}}}$\r\n<div id=\"fs-idm145402592\" class=\"ui-has-child-title\" data-type=\"example\"><header>\r\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.7.5<\/span><\/h3>\r\n<\/header><section>\r\n<div class=\"body\">\r\n<div id=\"fs-idm29377216\" class=\" unnumbered\" data-type=\"exercise\"><header><\/header><section>\r\n<div id=\"fs-idm71536528\" data-type=\"problem\">\r\n<div class=\"os-problem-container \">\r\n<p id=\"fs-idm105248448\">A die is rolled 50 times and the proportion of 3's is calculated.<\/p>\r\n\r\n<ol id=\"fs-idm71334512\" type=\"a\">\r\n \t<li>What are the mean and standard deviation for the proportions?<\/li>\r\n \t<li>Find the probability that at least 17% of the rolls are 3's<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/div>\r\n<div id=\"fs-idp2253552\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\r\n<div class=\"ui-toggle-wrapper\"><\/div>\r\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\r\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.7.5<\/span><\/h4>\r\n<div class=\"os-solution-container\">\r\n<ol id=\"fs-idm78488688\" type=\"a\">\r\n \t<li>\u00a0$\\mu_{\\hat p}=p = 1\/6 \u2248 0.1667$ and $\\sigma_{\\hat p}=\\sqrt{\\dfrac{p(1-p)}{n}}= \\sqrt{\\dfrac{0.1667 (0.8333)}{50}} = 0.0527$<\/li>\r\n \t<li>First, convert 17% (0.17)\u00a0 into a z-score\r\n$z = \\dfrac{\\hat p -\\mu_{\\hat p}}{\\sigma_{\\hat p}}= \\dfrac{0.17 -0.1667}{ 0.0527} = 0.06$\r\n<span data-type=\"newline\">\r\n<\/span>$P( \\hat p \u2265 0.17) =$\r\n<code>=1-NORM.S.DIST(0.06)<\/code>\r\n<code id=\"14\"><\/code>= 0.4761<\/li>\r\n<\/ol>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/section><\/div>\r\n<\/div>\r\n<\/section><\/div>\r\n<\/div>","rendered":"<p><span style=\"display: none;\"><br \/>\n[latexpage]<br \/>\n<\/span><\/p>\n<div id=\"28eabf92-5a02-4a30-9158-a88d4cc3f7a8\" class=\"chapter-content-module\" data-type=\"page\" data-cnxml-to-html-ver=\"2.1.0\">\n<h2>Central Limit Theorem for Sums<\/h2>\n<hr \/>\n<p id=\"delete_me\">Suppose <em data-effect=\"italics\">X<\/em> is a random variable with a distribution that may be <strong>known or unknown<\/strong> (it can be any distribution) and suppose:<\/p>\n<ol id=\"list-1\" type=\"a\">\n<li><em data-effect=\"italics\">\u03bc<sub>X<\/sub><\/em> = the mean of <em data-effect=\"italics\">\u03a7<\/em><\/li>\n<li><em data-effect=\"italics\">\u03c3<sub>\u03a7<\/sub><\/em> = the standard deviation of <em data-effect=\"italics\">X<\/em><\/li>\n<\/ol>\n<p id=\"fs-idm29530720\">If you draw random samples of size $n$, then as $n$ increases, the random variable \u03a3<em data-effect=\"italics\">X<\/em> consisting of sums tends to be <span id=\"term132\" data-type=\"term\">normally distributed<\/span> and $\\sum X \\sim N \\left( n\\cdot \\mu_x, \\sqrt{n}\\cdot \\sigma_x \\right)$<\/p>\n<p><strong>The<\/strong> <span id=\"term133\" data-type=\"term\">central limit theorem for sums<\/span> says that if you repeatedly draw samples of a given size (such as repeatedly rolling ten dice) and calculate the sum of each sample, these sums tend to follow a normal distribution. As sample sizes increase, the distribution of means more closely follows the normal distribution. <strong>The normal distribution has a mean equal to the original mean multiplied by the sample size and a standard deviation equal to the original standard deviation multiplied by the square root of the sample size.<\/strong><\/p>\n<p id=\"element-7\">The random variable \u03a3<em data-effect=\"italics\">X<\/em> has the following <em data-effect=\"italics\">z<\/em>-score associated with it:<\/p>\n<ol id=\"element-434\" type=\"a\">\n<li>\u03a3<em data-effect=\"italics\">x<\/em> is one sum.<\/li>\n<li>$z = \\frac{\\sum x -n\\cdot \\mu_X}{\\sqrt{n}\\cdot \\sigma_x}$\n<ol type=\"i\">\n<li>$n\\cdot \\mu_x = $ the mean of $\\sum X$<\/li>\n<li>$\\sqrt{n}\\cdot \\sigma_X = $ the standard deviation of $\\sum X$<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<div id=\"element-747\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.5<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<p id=\"element-455\">An unknown distribution has a mean of 90 and a standard deviation of 15. A sample of size 80 is drawn randomly from the population.<span data-type=\"newline\"><br \/>\n<\/span><\/p>\n<div id=\"element-55\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"id12220262\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<ol id=\"element-710\" type=\"1\">\n<li>Find the probability that the sum of the 80 values (or the total of the 80 values) is more than 7,500.<\/li>\n<li>Find the sum that is 1.5 standard deviations above the mean of the sums.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"id12220316\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.5<\/span><\/h4>\n<div class=\"os-solution-container\">\n<p id=\"element-804\">Let <em data-effect=\"italics\">X<\/em> = one value from the original unknown population. The probability question asks you to find a probability for <strong>the sum (or total of) 80 values.<\/strong><\/p>\n<p id=\"element-56\">\u03a3<em data-effect=\"italics\">X<\/em> = the sum or total of 80 values. Since <em data-effect=\"italics\">\u03bc<sub>X<\/sub><\/em> = 90, <em data-effect=\"italics\">\u03c3<sub>X<\/sub><\/em> = 15, and <em data-effect=\"italics\">n<\/em> = 80, $\\sum X \\sim N(80\\cdot 90, \\sqrt{80}\\cdot 15)$<span id=\"MathJax-Element-41-Frame\" class=\"MathJax\" style=\"position: relative;\" tabindex=\"0\" role=\"presentation\" data-mathml=\"&lt;\/p&gt; &lt;math xmlns=&quot;http:\/\/www.w3.org\/1998\/Math\/MathML&quot; display=&quot;inline&quot;&gt;&lt;semantics&gt;&lt;mrow&gt;&lt;mrow&gt;&lt;mi&gt;&amp;#x3A3;&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/mrow&gt;&lt;annotation-xml encoding=&quot;MathML-Content&quot;&gt;&lt;mrow&gt;&lt;mi&gt;\u03a3&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/annotation-xml&gt;&lt;\/semantics&gt;&lt;\/math&gt; &lt;p&gt;\"><span id=\"MathJax-Span-612\" class=\"math\" style=\"width: 2.162em; display: inline-block;\"><\/span><\/span><\/p>\n<ul id=\"element-907\">\n<li>mean of the sums = $n\\cdot \\mu_X = 80\\cdot 90 = 7,200$<\/li>\n<li>standard deviation of the sums = $\\sqrt{80}\\cdot 15)$<span class=\"MathJax\" tabindex=\"0\" role=\"presentation\" data-mathml=\"&lt;\/p&gt; &lt;math xmlns=&quot;http:\/\/www.w3.org\/1998\/Math\/MathML&quot; display=&quot;inline&quot;&gt;&lt;semantics&gt;&lt;mrow&gt;&lt;mrow&gt;&lt;mi&gt;&amp;#x3A3;&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/mrow&gt;&lt;annotation-xml encoding=&quot;MathML-Content&quot;&gt;&lt;mrow&gt;&lt;mi&gt;\u03a3&lt;\/mi&gt;&lt;mi&gt;X&lt;\/mi&gt;&lt;\/mrow&gt;&lt;\/annotation-xml&gt;&lt;\/semantics&gt;&lt;\/math&gt; &lt;p&gt;\"><span class=\"math\"><\/span><\/span><\/li>\n<li>sum of 80 values = <em data-effect=\"italics\">\u03a3x<\/em> = 7,500<span data-type=\"newline\"><br \/>\n<\/span><\/li>\n<\/ul>\n<ol>\n<li id=\"fs-idm55727008\">Find <em data-effect=\"italics\">P<\/em>(\u03a3<em data-effect=\"italics\">x<\/em> &gt; 7,500)<span id=\"id12212529\" data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\"><\/span><span data-type=\"media\" data-alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\"><img decoding=\"async\" id=\"4\" src=\"\/introstats\/wp-content\/uploads\/sites\/2\/2023\/07\/eb76239c2eaeaa32db6e00ac06a06d516145c390.jpg\" alt=\"This is a normal distribution curve. The peak of the curve coincides with the point 7200 on the horizontal axis. The point 7500 is also labeled. A vertical line extends from point 7500 to the curve. The area to the right of 7500 below the curve is shaded.\" width=\"380\" data-media-type=\"image\/jpg\" \/><br \/>\n<\/span><\/p>\n<div id=\"fs-idm54365952\" class=\"os-figure\">\n<div class=\"os-caption-container\"><span class=\"os-title-label\">Figure <\/span><span class=\"os-number\">6.3<\/span><\/div>\n<\/div>\n<div id=\"fs-idm202010592\" class=\"statistics textbox spreadsheet ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">Using Google Sheets<br \/>\n<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"element-389\"><code id=\"5\"><bdo dir=\"ltr\"><span class=\"formula-content\"><span class=\"default-formula-text-color\" dir=\"auto\">=<\/span><span class=\"number\" dir=\"auto\">1<\/span><span class=\"default-formula-text-color\" dir=\"auto\">-<\/span><span class=\"default-formula-text-color\" dir=\"auto\">NORM.DIST<\/span><span class=\"default-formula-text-color\" dir=\"auto\">(<\/span><span class=\"number\" dir=\"auto\">7500<\/span><span class=\"default-formula-text-color\" dir=\"auto\">,<\/span><span class=\"number\" dir=\"auto\">80<\/span><span class=\"default-formula-text-color\" dir=\"auto\">*<\/span><span class=\"number\" dir=\"auto\">90<\/span><span class=\"default-formula-text-color\" dir=\"auto\">,<\/span><span class=\"default-formula-text-color\" dir=\"auto\">SQRT<\/span><span class=\"default-formula-text-color\" dir=\"auto\">(<\/span><span class=\"number\" dir=\"auto\">80<\/span><span class=\"default-formula-text-color\" dir=\"auto\">)<\/span><span class=\"default-formula-text-color\" dir=\"auto\">*<\/span><span class=\"number\" dir=\"auto\">15<\/span><span class=\"default-formula-text-color\" dir=\"auto\">,<\/span><span class=\"boolean\" dir=\"auto\">TRUE<\/span><span class=\"default-formula-text-color\" dir=\"auto\">)<\/span><\/span><\/bdo><\/code><\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-499\" src=\"https:\/\/textbooks.jaykesler.net\/introstats\/wp-content\/uploads\/sites\/2\/2021\/01\/Spreadsheet_7_2.png\" alt=\"\" width=\"439\" height=\"307\" \/><\/p>\n<\/div>\n<\/section>\n<\/div>\n<p id=\"fs-idm48238000\">P(\u03a3x &gt; 7,500)= 0.0127<\/p>\n<\/li>\n<li>Find \u03a3<em data-effect=\"italics\">x<\/em> where <em data-effect=\"italics\">z<\/em> = 1.5.$\\sum x = n \\cdot \\mu_x + z \\cdot \\sqrt{n} \\cdot \\sigma_x = 80\\cdot 90 + 1.5 \\cdot \\sqrt{80}\\cdot 15 = 7,401.2$<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm51983600\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.5<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm105647824\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm71758576\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idp18637136\">An unknown distribution has a mean of 45 and a standard deviation of eight. A sample size of 50 is drawn randomly from the population. Find the probability that the sum of the 50 values is more than 2,400.<\/p>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm133155984\" class=\"statistics textbox spreadsheet ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\" data-type=\"title\"><span class=\"os-title-label\">Using Google Sheets<br \/>\n<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"os-note-body\">\n<p id=\"fs-idm82544544\">To find percentiles for sums using a spreadsheet, use the <a href=\"https:\/\/support.google.com\/docs\/answer\/3094022?hl=en\" target=\"_blank\" rel=\"noopener noreferrer\">NORM.INV<\/a> function. Type the following in a cell:<\/p>\n<p><code>=NORM.INV(k, n * mean, SQRT(n)*standard deviation)<\/code><\/p>\n<p id=\"fs-idm95058576\">where:<\/p>\n<ul id=\"fs-idm10652880\">\n<li><em data-effect=\"italics\">k<\/em> is the <em data-effect=\"italics\">k<\/em><sup>th<\/sup> <span id=\"term135\" data-type=\"term\">percentile<\/span><\/li>\n<li><em data-effect=\"italics\">mean<\/em> is the mean of the original distribution<\/li>\n<li><em data-effect=\"italics\">standard deviation<\/em> is the standard deviation of the original distribution<\/li>\n<li><em data-effect=\"italics\">sample size<\/em> = <em data-effect=\"italics\">n<\/em><\/li>\n<\/ul>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm82232896\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.6<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<div id=\"fs-idm90047616\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm70958544\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm5626528\">In a study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years. Suppose the standard deviation is 15 years. The sample of size is 50.<\/p>\n<ol id=\"fs-idm90393840\" type=\"a\">\n<li>What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution?<\/li>\n<li>Find the probability that the sum of the ages is between 1,500 and 1,800 years.<\/li>\n<li>Find the 80<sup>th<\/sup> percentile for the sum of the 50 ages.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idp41767792\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.6<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol id=\"fs-idp6531984\" type=\"a\">\n<li><em>$\\mu_{\\sum x}=n\\mu_{x}=1700$ and $\\sigma_{\\sum x}=\\sqrt{n}~\\sigma_{x} = \\sqrt{50}\\cdot 15 = 106.07$<br \/>\n<\/em>The distribution is normal for sums by the central limit theorem.<\/li>\n<li><em data-effect=\"italics\">P<\/em>(1500 &lt; \u03a3<em data-effect=\"italics\">x<\/em> &lt; 1800) =<code><br \/>\n=NORM.DIST(1800,50*34, SQRT(50)*15,TRUE)-NORM.DIST(1500,50*34,SQRT(50)*15,TRUE)<\/code><br \/>\n=0.7974<\/li>\n<li>Let <em data-effect=\"italics\">k<\/em> = the 80<sup>th<\/sup> percentile.<br \/>\n<em data-effect=\"italics\">k<\/em> =<br \/>\n<code>=NORM.INV(0.80,50*34,SQRT(50)*15)<\/code><br \/>\n=1,789.3<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idp87073664\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.6<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idm67768928\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm16414224\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm98358608\">In a recent study reported Oct.29, 2012 on the Flurry Blog, the mean age of tablet users is 35 years. Suppose the standard deviation is ten years. The sample size is 39.<\/p>\n<ol id=\"fs-idp46009888\" type=\"a\">\n<li>What are the mean and standard deviation for the sum of the ages of tablet users? What is the distribution?<\/li>\n<li>Find the probability that the sum of the ages is between 1,400 and 1,500 years.<\/li>\n<li>Find the 90<sup>th<\/sup> percentile for the sum of the 39 ages.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idm145402592\" class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.7<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<div id=\"fs-idm29377216\" class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div id=\"fs-idm71536528\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idm105248448\">The mean number of minutes for app engagement by a tablet user is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample of size 70.<\/p>\n<ol id=\"fs-idm71334512\" type=\"a\">\n<li>What are the mean and standard deviation for the sums?<\/li>\n<li>Find the 95<sup>th<\/sup> percentile for the sum of the sample. Interpret this value in a complete sentence.<\/li>\n<li>Find the probability that the sum of the sample is at least ten hours.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div id=\"fs-idp2253552\" data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.7<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol id=\"fs-idm78488688\" type=\"a\">\n<li><em data-effect=\"italics\">\u03bc<sub>\u03a3x<\/sub><\/em> = <em data-effect=\"italics\">n\u03bc<sub>x<\/sub><\/em> = 70(8.2) = 574 minutes and <em data-effect=\"italics\">\u03c3<sub>\u03a3x<\/sub><\/em> $=\\sqrt{n}\\cdot \\sigma_x= \\sqrt{70}\\cdot 1 = 8.37$ minutes<\/li>\n<li>Let <em data-effect=\"italics\">k<\/em> = the 95<sup>th<\/sup> percentile.<br \/>\n<em data-effect=\"italics\">k<\/em> =<br \/>\n<code>=NORM.INV(0.95, 70*8.2,SQRT(70)*1)<\/code><br \/>\n= 587.76 minutes<br \/>\n<span data-type=\"newline\" data-count=\"1\"><br \/>\n<\/span>Ninety five percent of the sums of app engagement times are at most 587.76 minutes.<\/li>\n<li>ten hours = 600 minutes<br \/>\n<span data-type=\"newline\"><br \/>\n<\/span><em data-effect=\"italics\">P<\/em>(\u03a3<em data-effect=\"italics\">x<\/em> \u2265 600) =<br \/>\n<code>=1-NORM.DIST(600,70*8.2, SQRT(70)*1, TRUE)<\/code><br \/>\n<code id=\"14\"><\/code>= 0.0009<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<div id=\"fs-idp10112640\" class=\"statistics try ui-has-child-title\" data-type=\"note\" data-has-label=\"true\" data-label=\"\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Try It <\/span><span class=\"os-number\">6.7<\/span><\/h3>\n<\/header>\n<section>\n<div id=\"fs-idp7815296\" class=\"unnumbered\" data-type=\"exercise\">\n<section>\n<div id=\"fs-idm10364128\" data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p id=\"fs-idp93890400\">The mean number of minutes for app engagement by a table use is 8.2 minutes. Suppose the standard deviation is one minute. Take a sample size of 70.<\/p>\n<ol id=\"fs-idm85536928\" type=\"a\">\n<li>What is the probability that the sum of the sample is between seven hours and ten hours? What does this mean in context of the problem?<\/li>\n<li>Find the 84<sup>th<\/sup> and 16<sup>th<\/sup> percentiles for the sum of the sample. Interpret these values in context.<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<h2>Central Limit Theorem for Proportions<\/h2>\n<hr \/>\n<p>If you draw random samples of size $n$ from a population that has a proportion $p$, then as $n$ increases, the random variable $X_{\\hat p}$ consisting of proportions tends to be <span data-type=\"term\">normally distributed<\/span> and $X_{\\hat p} \\sim N \\left( p, \\sqrt{\\dfrac{p (1-p)}{n}}\\right)$<\/p>\n<p>For accuracy, we need to require $n\\cdot p &gt; 10$ and $n \\cdot (1-p) &gt; 10$.<\/p>\n<p><strong>The<\/strong> <span data-type=\"term\">central limit theorem for proportions<\/span> says that if you repeatedly draw samples of a given size (such as repeatedly rolling ten dice) and calculate the proportion of each sample (for example calculate the proportion of 1&#8217;s that show up), these sums tend to follow a normal distribution. As sample sizes increase, the distribution of means more closely follows the normal distribution. <strong>The normal distribution has a mean equal to the original population&#8217;s proportion and a standard deviation equal to the square root of the population proportion multiplied by it&#8217;s compliment 1-p divided by the sample size.<\/strong><\/p>\n<p>The random variable $X_{\\hat p}$ has the following <em data-effect=\"italics\">z<\/em>-score associated with it:<\/p>\n<ol type=\"a\">\n<li>$\\hat p$ is one proportion.<\/li>\n<li>$z = \\dfrac{\\hat p -\\mu_{\\hat p}}{\\sigma_{\\hat p}}$\n<ol type=\"i\">\n<li>$\\mu_{\\hat p} = p$<\/li>\n<li>$\\sigma_{\\hat p } = \\sqrt{\\dfrac{p(1-p)}{n}}$<\/li>\n<\/ol>\n<\/li>\n<\/ol>\n<p>So we could calculate the $z$-score as<\/p>\n<p>$z = \\dfrac{\\hat p -p}{ \\sqrt{\\dfrac{p(1-p)}{n}}}$<\/p>\n<div class=\"ui-has-child-title\" data-type=\"example\">\n<header>\n<h3 class=\"os-title\"><span class=\"os-title-label\">Example <\/span><span class=\"os-number\">6.7.5<\/span><\/h3>\n<\/header>\n<section>\n<div class=\"body\">\n<div class=\"unnumbered\" data-type=\"exercise\">\n<header><\/header>\n<section>\n<div data-type=\"problem\">\n<div class=\"os-problem-container\">\n<p>A die is rolled 50 times and the proportion of 3&#8217;s is calculated.<\/p>\n<ol type=\"a\">\n<li>What are the mean and standard deviation for the proportions?<\/li>\n<li>Find the probability that at least 17% of the rolls are 3&#8217;s<\/li>\n<\/ol>\n<\/div>\n<\/div>\n<div data-type=\"solution\" aria-label=\"show solution\" aria-expanded=\"false\">\n<div class=\"ui-toggle-wrapper\"><\/div>\n<section class=\"ui-body\" style=\"display: block; overflow: hidden;\" role=\"alert\">\n<h4 data-type=\"solution-title\"><span class=\"os-title-label\">Solution <\/span><span class=\"os-number\">6.7.5<\/span><\/h4>\n<div class=\"os-solution-container\">\n<ol type=\"a\">\n<li>\u00a0$\\mu_{\\hat p}=p = 1\/6 \u2248 0.1667$ and $\\sigma_{\\hat p}=\\sqrt{\\dfrac{p(1-p)}{n}}= \\sqrt{\\dfrac{0.1667 (0.8333)}{50}} = 0.0527$<\/li>\n<li>First, convert 17% (0.17)\u00a0 into a z-score<br \/>\n$z = \\dfrac{\\hat p -\\mu_{\\hat p}}{\\sigma_{\\hat p}}= \\dfrac{0.17 -0.1667}{ 0.0527} = 0.06$<br \/>\n<span data-type=\"newline\"><br \/>\n<\/span>$P( \\hat p \u2265 0.17) =$<br \/>\n<code>=1-NORM.S.DIST(0.06)<\/code><br \/>\n<code><\/code>= 0.4761<\/li>\n<\/ol>\n<\/div>\n<\/section>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n<\/section>\n<\/div>\n<\/div>\n","protected":false},"author":1,"menu_order":2,"template":"","meta":{"pb_show_title":"on","pb_short_title":"","pb_subtitle":"","pb_authors":[],"pb_section_license":""},"chapter-type":[],"contributor":[],"license":[],"class_list":["post-63","chapter","type-chapter","status-publish","hentry"],"part":57,"_links":{"self":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/63","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters"}],"about":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/types\/chapter"}],"author":[{"embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/users\/1"}],"version-history":[{"count":20,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/63\/revisions"}],"predecessor-version":[{"id":773,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/63\/revisions\/773"}],"part":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/parts\/57"}],"metadata":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapters\/63\/metadata\/"}],"wp:attachment":[{"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/media?parent=63"}],"wp:term":[{"taxonomy":"chapter-type","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/pressbooks\/v2\/chapter-type?post=63"},{"taxonomy":"contributor","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/contributor?post=63"},{"taxonomy":"license","embeddable":true,"href":"https:\/\/textbooks.jaykesler.net\/introstats\/wp-json\/wp\/v2\/license?post=63"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}