Site icon R-bloggers

AP Statistics R Programming Assignment 2

[This article was first published on R – Saturn Science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Our second assignment involves students downloading an Excel file from the internet and importing it into the RStudio environment. They will then write code to give summary statistics and the five-number summary.

The Sharp Sight website has a nice explanation of using ggplot for creating scatter plots. Go here to read the entire article.

This post will be part two for the students to check their code against mine and to make any final adjustments before submitting their assignment.

# Assignment 2
# Follow the usual procedures for code submission.

library(readxl) # Download this package if you dont have it already

# Go to import dataset, then "From Excel" and find you file "backpack" on you computer.

backpack <- read_excel("//ais-main/users/kevin-smith/Desktop/backpack.xls")
View(backpack)

# you need to make it a data.frame first with this command.
as.data.frame(backpack)

# don't forget to load ggplot, dplyr and/or tidyverse.

summary(backpack) # gives summary statistics

fivenum(backpack$boyweight) # because this is a data.frame, I need to use the "$" to indicate which column to analyse.
fivenum(backpack$packweight)

library(ggplot2)

scatter145=ggplot(data=backpack, aes(boyweight,packweight)) +   
  geom_point() 

scatter145 # this just give a basic scatterplot

# now we add some color
scatter145b=ggplot(data=backpack, aes(boyweight,packweight,colour=body.wt)) +   
  geom_point()
scatter145b

# Next we add a small color box.
scatter145c = scatter145b+ geom_point(size=2) +  
  xlab("Body Weight (lb)") +  
  ylab("Pack weight (lb)") +   
  ggtitle("Backpack Weight")

scatter145c


# Here we will add the confidence intervals and regression line with the "lm" command
# lm means linear model
scatter145d=scatter145c+ geom_point(size=3) + 
  xlab("Body Weight (lb)") +  
  ylab("Pack weight (lb)")+ 
  ggtitle("Backpack Weight")+ 
  geom_smooth(method = "lm")


scatter145d

Go here to read the article on creating another scatter plot.

Sharp Sight

 

To leave a comment for the author, please follow the link and comment on their blog: R – Saturn Science.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.