X-Git-Url: https://code.communitydata.science/stats_class_2019.git/blobdiff_plain/dbde6a6880af749afd92848e06128fe161159d0b..07da118471a31405d2044b68b4eb30334dea9d8b:/problem_sets/week_03/ps3-worked_solution.Rmd diff --git a/problem_sets/week_03/ps3-worked_solution.Rmd b/problem_sets/week_03/ps3-worked_solution.Rmd index 76e2c63..35756c3 100644 --- a/problem_sets/week_03/ps3-worked_solution.Rmd +++ b/problem_sets/week_03/ps3-worked_solution.Rmd @@ -29,7 +29,7 @@ Now, I'll go ahead and load the CSV file into R. As with last week, I'll do this ### list.files("data/week_03") # just take a look around ### w3.data <- read.csv("data/week_03/group_01.csv") -w3.data <- read.csv(url("https://communitydata.cc/~ads/teaching/2019/stats/data/week_03/group_02.csv")) +w3.dtata <- read.csv(url("https://communitydata.cc/~ads/teaching/2019/stats/data/week_03/group_02.csv")) ``` ### PC3. Get to know your data! @@ -143,7 +143,7 @@ Inspecting the first few values returned by `head()` gave you a clue. Rounded to I can create a table comparing the sorted rounded values to check this. ```{r} -table(sort(round(w2.data, 6)) == sort(round(w3.data$x, 6))) +table(round(w2.data,6) == round(w3.data$x,6)) ``` Can you explain what each piece of that last line of code is doing? @@ -200,7 +200,7 @@ head(w3.data) lapply(w3.data, summary) ### Run this line again to assign the new dataframe to p -p <- ggplot(w3.data, aes(x=x, y=y)) +p <- ggplot(data=w3.data, mapping=aes(x=x, y=y)) p + geom_point(aes(color=j, size=l, shape=k)) ``` @@ -302,6 +302,6 @@ choose(10,2)*0.07^2*0.93^8 (d) Is random assignment to tents likely to ensure $\leq1~arachnophobe$ per tent? - Random assignment and the independence assumption means that the answer to part c is the inverse of the outcome we're looking to avoid: $P(\gt1~arachnophobes) = 1-P(\leq1~arachnophobe)$. So, $P(\gt1~arachnophobes) = 1-0.84 = 0.16 = 16\%$. Those are the probabilities, but the interpretation really depends on how confident the camp counselor feels about a $16\%$ chance of having multiple arachnophobic campers in one of the tents. + Random assignment and the independence assumption means that the answer to part c is the complement of the outcome we're looking to avoid: $P(\gt1~arachnophobes) = 1-P(\leq1~arachnophobe)$. So, $P(\gt1~arachnophobes) = 1-0.84 = 0.16 = 16\%$. Those are the probabilities, but the interpretation really depends on how confident the camp counselor feels about a $16\%$ chance of having multiple arachnophobic campers in one of the tents.