How to Join Multiple Data Frames in R

[This article was first published on Data Science Tutorials, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The post How to Join Multiple Data Frames in R appeared first on Data Science Tutorials

How to Join Multiple Data Frames in R?, you can find it useful to connect many data frames in R. Fortunately, the left join() function from the dplyr package makes this simple to accomplish.

Crosstab calculation in R – Data Science Tutorials

library(dplyr)

Consider the following three data frames, for instance:

Let’s create a data frame

df1 <- data.frame(Q1 = c('a', 'b', 'c', 'd', 'e', 'f'),
                  Q2 = c(152, 514, 114, 218, 322, 323))
df2 <- data.frame(Q1 = c('a', 'a', 'a', 'b', 'b', 'b'),
                  Q3 = c(523, 324, 233, 134, 237, 141))
df3 <- data.frame(Q1 = c('P1', 'e', 'P2', 'g', 'P5', 'i'),
                  Q4 = c(323, 224, 333, 324, 237, 441))

We can easily conduct two left joins, one after the other, to combine all three data frames.

Statistical test assumptions and requirements – Data Science Tutorials

connect the three data frames.

df1 %>%
  left_join(df2
, by='Q1') %>%  left_join(df3, by='Q1')
Q1  Q2  Q3  Q4
1   a 152 523  NA
2   a 152 324  NA
3   a 152 233  NA
4   b 514 134  NA
5   b 514 237  NA
6   b 514 141  NA
7   c 114  NA  NA
8   d 218  NA  NA
9   e 322  NA 224
10  f 323  NA  NA

Notably, the outcome of this join can also be saved as a data frame.

How to Count Distinct Values in R – Data Science Tutorials

After joining the three data frames, create an extra data frame called alldata and save the outcome.

alldata <- df1 %>%
              left_join(df2, by='Q1') %>%
              left_join(df3, by='Q1')

display the resultant data frame’s summary

glimpse(alldata)
Rows: 10
Columns: 4
$ Q1 <chr> "a", "a", "a", "b", "b", "b", "c", "d", "e", "f"
$ Q2 <dbl> 152, 152, 152, 514, 514, 514, 114, 218, 322, 323
$ Q3 <dbl> 523, 324, 233, 134, 237, 141, NA, NA, NA, NA
$ Q4 <dbl> NA, NA, NA, NA, NA, NA, NA, NA, 224, NA

The post How to Join Multiple Data Frames in R appeared first on Data Science Tutorials

To leave a comment for the author, please follow the link and comment on their blog: Data Science Tutorials.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)