For anyone looking for job opportunities, it is nice to have an idea how the job market will perform in the future for your chosen career or industry. Many countries have open data sets that offer this kind of data. In these exercises we will use R to analyze the future perspective of Canadian labour market.
Answers to the exercises are available here.
Download and read into R all data sets from Canadian Occupational Projection System (COPS) – 2015 to 2024 projections.
Load library tidyr. Use
gather to rearrange any occupation related data set that present time series data into a tidy data format.
gather to rearrange ALL other occupation related data sets that present time series data into a tidy data format, and pile out them in a unique data frame.
Remove lines that present NA values, columns in French, and the “X” in front every year. Take a look at your tidy data set.
- Learn indepth how to work with dplyr
- Get a full introduction to the data.table package
- And much more
Let’s do the same with industries data sets. Start by taking one of the industries data sets that present data in a time series an use
Do the same procedure of exercise 5 to all other industries data sets. Pile out them in a new data frame.
Remove NAs, and French columns. In addition, set year and value as numeric, and take a look at your new tidy data set about industries.
Find out the industries that have que lowest number of jobseekers, and create a new data set by sub setting the previous one.
Plot the recently create data set using a line for each industry.
Create a similar plot for the top 5 occupations in terms of low amount of jobseekers.