BIOL/STAT 335 Sample Questions for Final Exam. 1. Eating out vegetarian. Independent random samples of 434 U.S. women and 747 U.S. men showed that 195 women and 276 men said that they sometimes order a restaurant dish “without mean, fish, or fowl” (vegetarian). Let p1 and p2 be the probabilities for women and men, respectively. Here p1-hat = 195/434, p1-tilde = 196/436, p2-hat = 276/747, p2-tilde = 277/749 Find a 95% confidence interval for (p1 – p2) 95% C.I. p1 – p2 = p1-tilde – p2-tilde +/- z_{.025}SE = (196/436 – 277/749) +/- (1.96)sqrt[(196/436)(240/436)/436 + (277/749)(472/749)/749] = .0797 +/- (1.96)(.0296) = .0797 +/- .0581. Or .022 < p1 – p2 < .138 Use this confidence interval to test at the 5% level of significance whether men and women eat out vegetarian the same percentage of times. State the null hypothesis H0: p1 – p2 = 0 State the alternative HA: p1 – p2 not= 0 Reject H0 because the C.I. does not contain p1 – p2 = 0. Conclusion of test Reject the Null State your conclusion in context of the problem There is sufficient evidence (P-value < .05) to conclude that men and women do not eat vegetarian in the same proportion. 2. Getting taller? Independent samples of 10 men aged 25-34 years and 15 men aged 45-54 years yields the following height data. Welch’s formula yields 21.7 Age n Mean SD 25-34 10 70.19 2.951 df = 21 45-54 15 68.58 3.543 Find the 95% confidence interval for the difference in height. mu1 – mu2 = (y1-bar – y2-bar) +/- t_{21,.025}SE = 1.61 +/- (2.080)sqrt[(2.951)^2/10 + (3.543)^2/15] = 1.61 +/- 2.718 Or -1.108 < mu1 – mu2 < 4.328 Test at the 5% level of significance, assuming the populations have equal standard deviations, to decide whether men aged 25-34 years are, on the average, taller than men aged 45-54 years. State the null hypothesis mu1 = mu2 State the alternative mu1 > mu2 Do we need to assume normality of the population to use this test? YES freedom df = 9 + 14 = 23 Pooled SD sp = sqrt[(9(2.951)^2 + 14(3.543)^2)/(9+14)] = 3.324 Degrees of SE = sp sqrt[1/10 + 1/15] = 3.324sqrt[.1667] = 1.357 Test statistic ts = {(y1-bar – y2-bar) – (mu1 – mu2)}/SE = 1.61/1.357 = 1.186 P-value (be as specific as possible) P-value = Pr(t > ts) = Pr(t > 1.186) = .1243 (using technology), or 0.10 < P-value < 0.20 (using Table 4) Conclusion of test Retain the Null Detailed conclusion (be specific in context of the problem): There is insufficient evidence (0.10 < P-value < 0.20) to conclude that men are getting taller. 3. Glaucoma and corneal thickness. A researcher measured corneal thickness (in microns) of eight patients who had glaucoma in one eye but not in the other. Patient 1 484 Normal Glaucoma 488 Difference -4 2 478 478 0 3 492 480 +12 4 444 426 +18 5 436 440 -4 6 398 410 -12 7 464 458 +6 8 476 460 +16 Mean 459 455 4 SD 31.31 27.69 10.74 Welch’s formula yields 13.79. At the 10% level of significance, use the t test to decide whether corneal thickness is greater in normal eyes than in eyes with glaucoma. State the null hypothesis mu1 = mu2 The two samples of data are paired State the alternative mu1 > mu2 Degrees of freedom df = n – 1 = 7 Do we need to assume normality of the population to use this test? YES The table from the book to use for this problem in on page Table 4 on page 618 Test statistic ts =(yd-bar – mud)/SEd = 4/(10.74/sqrt[8]) = 1.053 P-value (be as specific as possible) P-value = Pr(t > ts) = Pr(t > 1.053) = .16357 (using technology), or 0.10 < P-value < 0.20 (using Table 4) Conclusion of test Retain the Null Detailed conclusion (be specific in context of the problem): There is insufficient evidence (0.10 < P-value < 0.20) to conclude that corneal thickness is less in eye with glaucoma. 4. Glaucoma and corneal thickness again. Repeat problem above but use the sign test instead. A researcher measured corneal thickness (in microns) of eight patients who had glaucoma in one eye but not in the other. Patient Normal Glaucoma Difference 1 484 488 -4 2 478 478 0 3 492 480 +12 4 444 426 +18 5 436 440 -4 6 398 410 -12 7 464 458 +6 8 476 460 +16 Mean 459 455 4 SD 31.31 27.69 10.74 Welch’s formula yields 13.79. At the 10% level of significance, use the sign test to decide whether corneal thickness is greater in normal eyes than in eyes with glaucoma. State the null hypothesis mu1 = mu2 State the alternative mu1 > mu2 Do we need to assume normality of the population to use this test? NO Test statistic Bs = N+ = 4 (with nd = 7) The table from the book to use for this problem in on page Table 7 on page 625. P-value from the table (be as specific as possible) Off chart: P-value > .10 Calculate the exact P-value P-value = 7C4(.5)^4(1-.5)^3 + 7C5(.5)^5(1-.5)^2 + 7C6(.5)^6(1-.5)^1 + 7C7(.5)^7(1-.5)^0 = (35 + 21 + 7 + 1)/2^7 = 64/128 = .50 Conclusion of test Retain the Null Detailed conclusion (be specific in context of the problem): There is insufficient evidence (0.10 < P-value < 0.20) to conclude that corneal thickness is less in eye with glaucoma. 5. Glaucoma and corneal thickness again. Repeat problem above but use the Wilcoxon Signed-Rank test instead. A researcher measured corneal thickness (in microns) of eight patients who had glaucoma in one eye but not in the other. Patient 1 484 Normal Glaucoma 488 Difference Signed Rank -4 -1.5 2 478 478 0 XX 3 492 480 +12 +4.5 4 444 426 +18 +7 5 436 440 -4 -1.5 6 398 410 -12 -4.5 7 464 458 +6 +3 8 476 460 +16 +6 Mean 459 455 4 SD 31.31 27.69 10.74 Welch’s formula yields 13.79. At the 10% level of significance, use the Wilcoxon Signed-Rank test to decide whether corneal thickness is greater in normal eyes than in eyes with glaucoma. State the null hypothesis mu1 = mu2 State the alternative mu1 > mu2 Do we need to assume normality of the population to use this test? NO Test statistic W– = 7.5, W+ = 20.5, Ws = W+ = 20.5 (with nd = 7) The table from the book to use for this problem in on page Table 8 on page 626. P-value from the table (be as specific as possible) Off chart: P-value > .10 Conclusion of test Retain the Null Detailed conclusion (be specific in context of the problem): There is insufficient evidence (0.10 < P-value) to conclude that corneal thickness is less in eye with glaucoma. 6. Smoking and low birthweight babies. In a study done in 1971 on smoking among pregnant women and low birthweight, the following data was collected. TOTAL Low birthweight Normal birthweight Smokers 185 (143.81) 1891 (1932.19) 2076 Non-smokers 134 (175.19) 2395 (2353.81) 2529 TOTAL 319 4286 4605 Expected counts shown above in parentheses Perform a one-sided chi-square test at the 1% level of significance to determine whether low birthweights are associated with smoking. State the null hypothesis Smoking and birthweight are independent State the alternative Smoking and birthweight are related Do we need to assume normality of the population to use this test? NO (variable is categorical) Degrees of freedom df = (r-1)(c-1) = 1 Test statistic Chi-Sq = (185 – 143.81)^2/143.81 + …… + (2395 – 2353.81)^2/2535.81 = 11.798 + .8781 + 9.685 + 0.7208 = 23.081 P-value (be as specific as possible) (Off chart) P-value < 0.0001/2 = .00005 Conclusion of test Reject the Null Detailed conclusion (be specific in context of the problem): There is strong evidence (P-value < .00005) to conclude that smjoking and birthweight are related. 7. A study was conducted to determine whether dietary lipid content is important in the population growth of Chesapeake Bay copepods. Independent random samples of copepods were placed in containers with lipid-rich diatoms, bacteria, and leafy macroalgae. There were 12 containers total with four replicates per diet. Five egg bearing females were placed in each container. After 14 days, the number of copepods in each container were as follows. Sample Treatment Mean SD A Diatoms 426 467 438 497 457.00 31.74 B Bacteria 303 301 293 328 306.25 15.13 C Macroalgae 277 324 302 272 293.75 24.06 Calculate the pooled variance. sp^2 = {3(31.74)^2 + 3(15.13)^2 + 3(24.06)^2}/{3 + 3 + 3} = 605.11 Calculate the grand mean {4(457.00) + 4(306.25) + 4(293.75)}/12 = 352.3333 Complete the ANOVA table and carry out the F test at = 5%. Source df SS MS F Between treatments 2 66043 33021.5 Within treatments 9 5446 605.11 54.57 TOTAL 11 71489 Null hypothesis muA = muB =muC Alternative hypothesis Not all equal (at least one mu is different) = 54.58 Table from the book to use for this problem is on page Table 10 on page 629 Circle the correct decision: growth. Degrees of freedom 2 and 9 Test statistic Fs P-value (be as specific as possible) Off chart: P-value < .0001 Reject Null Detailed conclusion (be specific in context of the problem): Not all treatments result in the same mean 8. Use the Bonferroni method to construct a 95% confidence interval for the difference between the treatment means for treatments B and C and use this interval to state whether or not you feel they are indeed equal. Show all work. Here df = 9, sp^2 = 605.11, SEp = sqrt[sp^2/nB + sp^2/nC] = sqrt[605.11/4 + 605.11/4] = 17.39 Bonferroni Interval muB – muC = (yB-bar – yC-bar) +/- t_{9,.025/(3C2)}SE = 12.5 +/- (2.933)(17.39) = 12.5 +/- 51.01 Or -38.51 < muB – muC < 63.51. Note that this C.I. consistent with the null hypothesis that muB = muC at alpha = 5%, since the 95% Bonferroni confidence interval contains muB – muC = 0. 9. Age and body fat. Eighteen randomly selected adults were measured for percentage of body fat, using dual-photon absorptiometry. The following table shows the results of the measurements and the ages of the adults. Subject 1 2 3 4 5 6 x = Age 23 23 27 27 39 41 y = %Fat 9.5 27.9 7.8 17.8 31.4 25.9 Subject 7 8 9 10 11 12 x = Age 45 49 50 53 53 54 y = %Fat 27.4 25.2 31.1 34.7 42.0 29.1 Subject 13 14 15 16 17 18 x = Age 56 57 58 58 60 61 y = %Fat 32.5 30.3 33.0 33.8 41.1 34.5 For this sample we have x-bar = 46.33, y-bar = 28.61, sx = 13.22, sy = 9.14, r = .7918 and Σ (x – x-bar)(y – y-bar) = 1626.524,. SS(resid) = 529.736 Σ (x – x-bar)2 = 2971.063, Σ (y – y-bar)2 = 1420.173 Find an equation of the major axis Slope : m = sy/sx = 9.14/13/22 = .6914, Point: (x-bar,y-bar) = (46.33,28.61) Eqn: y – y-bar = b1(x – x-bar) or y – 28.61 = .6914(x – 46.33) or y = -3.42 + .6914x Find an equation of the regression line Slope : b1 = r(sy/sx) = (.7918)(9.14/13/22) = .5480, Point: (x-bar,y-bar) = (46.33,28.61) Eqn: y – y-bar = b1(x – x-bar) or y – 28.61 = .5480(x – 46.33) or y = 3.221 + .5480x Estimate the %Fat of a person whose age is 58 years. y-hat = 3.221 +(.5480)(58) = 35.00% Estimate the SD of the %Fat of a person aged 58 years s_{e} = sy(sqrt[1-r^2]sqrt[(n-1)/(2-1)] = sqrt[SS(resid)/(n-2)] = 5.754 Give a 95% confidence interval for the slope of the regression line beta1 = b1 +/- t_{16,.025}SE = .5480 +/- (2.120)(.1056) = .5480 +/- .2239 or .3241 < beta1 < .7719 Use linear regression to test the hypothesis that age has no effect on percentage body fat against the alternative that percentage body fat increases with age. Let alpha = 1%. H0: beta1 = 0, HA: beta1 > 0, alpha = .01, Test statistic ts = (b1 – beta1)/SE = (.5480 – 0)/.1056 = 5.19, P-value Off chart: P-value < .0005; Decision: Reject Null Detailed conclusion (be specific in context of the problem): There is strong evidence (P-value < .0005) to conclude that body fat increases with age. 10. The data in the table below are taken from a study on Parkinson’s disease, a disease that, among other things, affects a person’s ability to speak. Eight of the people in this study had received one of the most common operations to treat the disease, and the other fourteen received no operation. Subsequently, each patient was given a test to assess their speaking ability, and these scores are reported in the table (lower scores indicate more problems with speaking). Use alpha = 0.05 Use the Wilcoxon-Mann-Whitney test to decide whether there is a difference between the treated patients and the untreated patients. Note that Welch’s equation (7.1) yields df = 18.8. State the null hypothesis The population distributions are the same State the alternative The population distributions are different Do we need to assume normality of the population to use this test? NO Test statistic Here K1 = 90.5 and K2 = 21.5, so Us = 90.5 with n = 14 and n’ = 8 (check: K1 + K2 = 90.5 + 21.5 = 112, n*n’ = (14)(8) = 112) The table from the book to use for this problem in on page Table 6 on page 622 P-value from the table (be as specific as possible) .01 < P-value < .02 Conclusion of test (circle one) Reject the Null or Retain the Null Detailed conclusion (be specific in context of the problem): There is sufficient evidence (0.01 < P-value < 0.02) to conclude that the operation does improve a patient’s ability to speak. Table Number w/o operation that are smaller Pts. who had operation (ORDERED) 7 1.7 9.5 2.0 12,12 12,12 12.5 13.5 n’ = 8 K1 = 90.5 2.5,2.5 2.6,2.6 2.7 3.0 Pts. who didn’t have operation (ORDERED) 1.2 1.3,1.3,1.3 1.5,1.5 1.6 Number w/operation that are smaller 0 0,0,0 0,0 0 1.8,1.8 2.0 2.2 2.3 1,1 1.5 2 2 2.7 3.0 6.5 7.5 n=14 K2 = 21.5 How would you assess normality for the above data? Use the Anderson-Darling test or a normal probability plot MORE SAMPLE QUESTIONS 11. Six patients with renal disease underwent plasmapheresis. Urinary protein excretion (grams of protein per gram of creatinine) was measured for each patient before and after plasmapheresis. The data are given in the table below. Assume that the relevant population distribution(s) is (are) normally distributed. Note that the Welch’s Formula (Equation 7.1) yields 8.8 for these data. Patient 1 2 3 4 5 6 Mean SD Before 20.3 6.3 7.6 6.1 5.9 4.0 8.37 5.96 After 11.8 2.1 3.0 2.6 4.0 4.2 4.62 3.61 Difference 8.5 4.2 4.6 3.5 1.9 -0.2 3.75 2.92 a. Use these data to obtain a 98% confidence interval for the difference of the average protein excretion before and after plasmapheresis. Show all work. muA – muB = 3,75 +/- t_{5,.01}(2.92/sqrt[6]) = 3.76 +/- (3.365) (2.92/sqrt[6]) = 3.75 +/- 4.01 b. Use these data and = 2% to investigate whether or not plasmapheresis affects urinary protein excretion in patients with renal disease. Since muA – muB = 0 is included in the interval, we retain H0. There is insufficient evidence (P-value > .02) to conclude that there is a difference. 12. Use the sign test to test the same hypothesis. Calculated Test Statistic Bs = 5 Either [a] Use table from the book to use for this problem is on page 625 (Table 7) nondirectional P-value > .20 P-value from the table (be as specific as possible). This test is or [b] Calculate the exact P-value = 2(Pr{Y >= 5}) = 2(6C5(.5)^5(1-.5)^1 + 6C6(.5)^6}) = 14(.5)^6 = .2185 the Null Reject the Null Circle the correct decision: Fail to reject 13. Two drugs, zidovudine and didanosine, were tested for the effectiveness in preventing progression of HIV disease in children. In a double-blind clinical trial, 276 children with HIV were given zidovudine, 281 were given didanosine, and 274 were given zidovudine plus didanosine. The following table shows the survival data for the three groups. Died Survived Total Zidovudine Didanosine 17 (11.29) 259 (264.71) 276 7 (11.50) 274 (269.50) 281 Zidovudine & Didanosine 10 (11.21) 264 (262.79) 274 Total 34 797 831 Given that a patient received zidovudine, use these data to estimate the probability that (s)he died. 17/276 Given that a patient survived, use these data to estimate the probability that (s)he received (only) didanosine. 274/797 Use = 10% and these data to conduct a test of the null hypothesis that survival and treatment are independent. Null Hypothesis Survival and Treatment are independent Alternative Hypothesis Survival depends on Treatment Calculated Test Statistic (show all work) Chi-Sq = 2.885 + 1.759 + 0.131 + 0.123 + 0.075 + 0.006 = 4.978 Table from the book to use for this problem is on page 627 (Table 9) Degrees of Freedom 2 P-value (be as specific as possible) .05 < P-value < .10 (=alpha), reject H0 Circle the correct decision: Fail to reject the Null Reject the Null conclude that the survival rate is related to the treatment. Detailed conclusion: There is sufficient evidence (.05 < P-value < .10) to 14. In a study of the mutual effects of the air pollutants ozone and sulfur dioxide, Blue Lake snap beans were grown in open-top field chambers. Some chambers were fumigated repeatedly with sulfur dioxide. The air in some chambers was carbon-filtered to remove ambient ozone. In all, there were 4 treatments – labeled A (ozone absent, SO2 absent), B (ozone absent, SO2 present), C (ozone present, SO2 absent) and D (ozone present, SO2 present) – and there were three chambers per treatment combination (allocated at random). After one month of treatment, total yield (mg) of bean pods was recorded for each chamber, with results shown in the following table. The resulting data yielded SS(Between) = 54.68 and SS(Within) = 27.51. Chamber 1 2 3 Mean SD A 15.20 18.50 13.90 15.87 2.37 Treatment C 11.50 13.00 15.70 13.40 2.13 B 14.90 15.50 12.10 14.17 1.81 D 9.50 10.60 9.90 10.00 0.56 Complete the ANOVA table and carry out the F test at = 5%. Source df SS MS F 5.30 Between treatments 3 54.68 18.23 Within treatments 8 27.51 3.44 TOTAL 11 82.19 s_p = sqrt[3.44] = 1.8547 Null hypothesis muA = muB = muC = muD Alternative hypothesis At least one mean is not equal to others. Degrees of freedom 3 and 8 Test statistic Fs = 5.30 Table from the book to use for this problem is on page 630 Circle the correct decision: Reject Null P-value (be as specific as possible) .02 < P-value < .05 (= alpha), Reject H0. Fail to Reject Null Use the Bonferroni method to construct a 95% confidence interval for the difference between the treatment means for treatments C and D and use this interval to state whether or not you feel they are indeed equal. Show all work. Bonferroni Interval muC – muD = 3.40 +/- t_{8,.025/(4C2)}s_p{sqrt[1/3 + 1/3]} = 3.40 +/- 3.479(1.8547)sqrt[1/3+1/3] = 3.40 +/- 5.27. Since the interval contains muC = muD = 0, we retain equality. 15. Ten treatments were compared for their effect on the liver of mice. There were 13 animals in each treatment group. The ANOVA gave MS(within) = 0.5842. The mean liver weights are given in the table below. Group 1 2 3 4 5 6 7 8 9 10 Mean 2.59 2.28 2.34 2.40 2.59 2.84 2.29 2.45 2.76 2.37 n 13 13 13 13 13 13 13 13 13 13 Use the Bonferroni method with using = 5% to determine whether any pairs of means are significantly different. The sorted group means are as follows Treatment 4 2 7 Mean 2.07 2.28 2.29 3 2.34 10 2.37 5 2.40 8 2.45 1 2.59 9 2.76 6 2.84 The most extreme difference among groups occurs between groups 4 and 6, with means 2.84 and 2.07, respectively. We have df = (10)(13-1) =120. We use df = 100, spooled = sqrt[0.5842] = 0.7643, and the Bonferroni multiplier 2.871. The Bonferroni confidence interval for the difference of means between these two groups is (2.84 – 2.07) +/- (2.871)(spooled )sqrt[1/13 + 1/13] = 0.77 +/- (2.871)(0.7643) sqrt[1/13 + 1/13] = 0.77 +/- 0.8607. This confidence interval contains 0. Since even the largest difference among the groups is not significant, there is no significant difference among the group means. 16. In a study of the dietary treatment of anemia in cattle, researchers randomly divided 144 cows into four treatment groups. Group A was a control group, and groups B, C and D received different regimens of dietary supplementation with selenium. After a year of treatment, blood samples were drawn and assayed for selenium. The accompanying table shows the mean selenium concentrations (g/dl). The resulting data yielded a grand mean (y-bar-bar) of 4.17 and spooled = sqrt[MS(within)] = sqrt[ 2.071] = 1.439 Group A B C D Mean 0.8 5.4 6.2 5.0 n 36 36 36 36 It turns out that the test of A = B = C = D is rejected using = 5%. Use the Bonferroni method to compare all pairs of means at the 5% level. Summarize your findings using the “underline method” C B D A ______ ______ There are k = 6 pairs to compare and df = (4)(36 – 1) = 140. From Table 11, we find that the Bonferroni multiplier is 2.676. For each pair, the SE = (spooled )sqrt[1/36 + 1/36] = 0.3392. We obtain +/- (2.676)(0.3392) = +/-0.9077. The largest difference between means is that between C and A. We start with that pair. C vs A C vs D B vs A C vs B B vs D D vs A muC – muA = muC – muD = muB – muA = muC – muB = muB – muD = muD – muA = 5.4 +/- 0.9077. 1.2 +/- 0.9077. 4.6 +/- 0.9077. 0.8 +/- 0.9077. 0.4 +/- 0.9077. 4.2 +/- 0.9077. We reject equality of muC and muA. We reject equality of muC and muD. We reject equality of muB and muA. We do NOT reject equality of muC and muB. We do NOT reject equality of muB and muD. We reject equality of muD and muA. This is a summary of the underline method expressed in plain English. All diets are superior to A. While diet C produces the highest mean selenium concentration among cows in our sample, there is not enough evidence to claim C is the best. This is because we cannot conclude that C is superior to B as the C vs B interval contains zero. Similarly, we cannot conclude that B is superior to D as the B vs D interval contains zero. However, C is superior to D. 17. In a study of crop losses due to air pollution, plots of Blue Lake snap beans were grown in open-top field chambers, which were fumigated with various concentrations of sulfur dioxide (x, measured in ppm). After a month of fumigation, the plants were harvested and the total yield of bean pods (y, measured in kg) was recorded for each chamber. The results are summarized by n = 12, x-bar = .120, y-bar = 1.117, sx = .117, sy = .312 and the partial Minitab output: Regression Analysis: Yield versus SulfurDioxide Predictor Constant SulfurDio Coef 1.38810 -2.2619 S = 0.1719 SE Coef 0.07264 0.4421 R-Sq = 72.4% R-Sq(adj) = 69.6% [a] Calculate the equation of the linear regression of yield on SO2 concentration. Y = 1.38810 – 2.2619X [b] Give a measure of the variability of the yield measurements (with units) sy = .312 kg per chamber [c] Give a measure of the variability of the yield measurements about the regression line (with units) s_{e} = 0.1719 [d] Give a 95% confidence interval for 1 = -2.2619 +/- t_{10,.025}SE_{b1} = -2.2619 +/- 2.228(0.4421) = -2.2619 +/- .9850 [e] Give a biological a biological interpretation of b1: For each increase of 1 ppm SO2, there is a decrease in yeld of 2.2619 kg per chamber. [f] Perform a statistical test to decide whether the population correlation coefficient (), equivalently the slope of the regression line (1), differs from zero. Null hypothesis rho = 0 (beta1 = 0) Alternative rho not equal 0 (beta1 not equal 0) Calculated Test Statistic (show all work) (b1 – 0)/SE_{b1} = 2.2619/0.4421 = -5.12 Degrees of Freedom n-2 = 10 Table from the book to use for this problem is on page 618 P-value (be as specific as possible) P-value < 2(.0005) = .001 Circle the correct decision: Fail to reject the Null Reject the Null Assumptions we must make for this test . Each data pair (x,y) is independent of all other observations. For each X, the distribution of Y is normal. If for each X, we denote the population SD of Y by sigma_{Y|X}, then sigma_{Y|X} does not depend on X (that is, sigma_{Y|X} is constant for all X). If for each X we denote the mean of Y by mu_{Y|X}, then mu_{Y|X} is a linear function of X (that is, mu_{Y|X} = beat0 + (beta1)X). 18. Consider the following scores of six students on two tests in a statistics course. Student 1 2 3 4 5 6 7 Score on Score on first test second test 40 50 70 80 80 100 70 ----AV = 70 SD = 20 80 65 85 70 85 80 95 ----AV = 80 SD = 10 zx zy (zx)(zy) -1.5 -1.0 0.0 +0.5 +0.5 +1.5 0.0 0.0 -1.5 +0.5 -1.0 +0.5 0.0 +1.5 0.0 +1.5 0.0 -0.5 +0.25 0.0 0.0 sum = 0 sum = 0 sum = 1.25 b1 = rsy/sx = (1.25/6)(10/20) = .1042, b0 = y-bar – b1(x-bar) = 80 – (.1042)(70) = 72.71 Find the correlation coefficient r = sum(zx)(zy)/(n-1) = +1.25/6 = .208333 Find an equation of the regression line Y = .1042X + 72.71 19. To estimate forearm lengths of men, the following results were obtained for about 1000 men: Average height = 69 inches, SD for height = 2.5 inches, Average forearm length = 18 inches, SD = 1 inch; Correlation coefficient = .80. Estimate the length of the forearms of a man whose height is [a] unknown 18 + 1 Here we use y-bar and s_y. [b] 69 inches 18 + .6. Here we use y-bar(the regression line goes thru the point (x-bar,.y-bar))and s_{e} = sqrt[1-r^2](s_y) = sqrt[1 - .8^2](1) = 0.6 [c] 73 inches 19.28 + .6. Here we use the regression line and s_{e}. 1. x = 73, 2. z_x = (73-69)/2.5 = 1.6, 3. z_y = r z_x = (.8)(1.6) = 1.28, 4. y = 18 + (1.28)(1) = 18.28. [d] 66 inches 17.04 + .6
© Copyright 2024