Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Answers to the exercises are available here.
Exercise 1
Download and read into R all data sets from Canadian Occupational Projection System (COPS) – 2015 to 2024 projections.
Exercise 2
Load library tidyr. Use gather
to rearrange any occupation related data set that present time series data into a tidy data format.
Exercise 3
Use gather
to rearrange ALL other occupation related data sets that present time series data into a tidy data format, and pile out them in a unique data frame.
Exercise 4
Remove lines that present NA values, columns in French, and the “X” in front every year. Take a look at your tidy data set.
- Learn indepth how to work with dplyr
- Get a full introduction to the data.table package
- And much more
Exercise 5
Let’s do the same with industries data sets. Start by taking one of the industries data sets that present data in a time series an use gather
.
Exercise 6
Do the same procedure of exercise 5 to all other industries data sets. Pile out them in a new data frame.
Exercise 7
Remove NAs, and French columns. In addition, set year and value as numeric, and take a look at your new tidy data set about industries.
Exercise 8
Find out the industries that have que lowest number of jobseekers, and create a new data set by sub setting the previous one.
Exercise 9
Plot the recently create data set using a line for each industry.
Exercise 10
Create a similar plot for the top 5 occupations in terms of low amount of jobseekers.
Related exercise sets:
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.