Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The post Arrange Data by Month in R with example appeared first on Data Science Tutorials

Arrange Data by Month in R, To easily arrange data by month, use the floor_date() function from the lubridate package in R.

The following is the fundamental syntax for this function.

library(tidyverse)
df %>%
group_by(month = lubridate::floor_date(date_column, 'month')) %>%
summarize(sum = sum(value_column))

The example below demonstrates how to utilize this function in practice.

How to make a rounded corner bar plot in R? – Data Science Tutorials

## Example: Arrange Data by Month in R

Assume we have the following R data frame, which indicates total sales of a particular item on various dates:

make a data frame

df <- data.frame(date=as.Date(c('1/5/2022', '1/10/2022', '2/12/2022', '2/18/2022',
'3/15/2022', '3/20/2022', '3/30/2022'), '%m/%d/%Y'),
sales=c(22, 11, 32, 14, 15, 22, 33))

Let’s view the data frame

Quantiles by Group calculation in R with examples – Data Science Tutorials

df
date   sales
1 2022-01-05    22
2 2022-01-10    11
3 2022-02-12    32
4 2022-02-18    14
5 2022-03-15    15
6 2022-03-20    22
7 2022-03-30    33

To determine the total sales for each month, we can use the following code.

library(tidyverse)

Let’s group data by month and sum sales

df %>%
group_by(month = lubridate::floor_date(date, 'month')) %>%
summarize(sum_of_sales = sum(sales))
month      sum_of_sales
<date>            <dbl>
1 2022-01-01           33
2 2022-02-01           46
3 2022-03-01           70

We can observe the following from the output.

Free Best Online Course For Statistics – Data Science Tutorials

In January, a total of 33 sales were made.

In February, a total of 46 sales were made.

In March, a total of 70 sales were made.

We can also use another metric to aggregate the data.

We may, for example, calculate the maximum sales on a single day, categorised by month.

library(tidyverse)

Now we can group data by month and find max sales

df %>%
group_by(month = lubridate::floor_date(date, 'month')) %>%
summarize(max_of_sales = max(sales))
month      max_of_sales
<date>            <dbl>
1 2022-01-01           22
2 2022-02-01           32
3 2022-03-01           33

We can observe the following from the output.

Hypothesis Testing Examples-Quick Overview – Data Science Tutorials

In January, the highest number of sales in a single day was 22.

In February, the highest number of sales in a single day was 32.

In March, the highest number of sales made in a single day was 33.

Within the summary() function, you can use any measure you choose.

The post Arrange Data by Month in R with example appeared first on Data Science Tutorials