Data analysis with ProjectTemplate in RStudio with Git

[This article was first published on data prone - R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Abstract

ProjectTemplate is a system for organizing and automating data analysis in R. Here is a recipe for creating a new ProjectTemplate project in RStudio and initializing version control with Git.

Order of Operations

I find that the following sequence is most efficient and error-proof:

  1. Create ProjectTemplate project
  2. Create RStudio project from directory created in Step 1
  3. Create a new Git repo at GitHub (or some other git hosting service)
  4. At command line, initialize git repo, add files, commit, and push to remote host
  5. Restart RStudio

Create ProjectTemplate project

At the console in RStudio, type

library('ProjectTemplate')
create.project('letters')

This will create the folder letters in your working directory with the ProjectTemplate file structure. The create.project() command also changes your working directory to this new ProjectTemplate project directory.

Create RStudio project

Select New Project... from the File menu in RStudio. In the New Project dialogue, select Existing Directory:

Screenshot of RStudio Create Project dialogue

Browse to the newly created directory (letters in the example) and click Create Project. A new sesson of RStudio will open and the new project will be loaded. (Note that the Git pane within RStudio will not be available until after the final step.)

Create a new repo at Github

Create a new repo at Github. In the current example, it would be named “letters.” Do not add any files (such as a README, .gitignore, or license) to the repo when creating it.

Initialize git repo and push to remote host

At the command line inside the project directory, type the following commands to initialize the git repo, add the newly created template files, make your first commit, and push the commit to the remote host. (Note that you will need to fill in your username and replace the repo name.)

git init
git add .
git commit -m "initialize project"
git remote add origin [email protected]:username/letters.git
git push -u origin master

Restart RStudio

The final step is to restart RStudio. The Git pane will appear automatically, and a .gitignore file will be added to your project. (I generally add the cache directory to .gitignore.)

To leave a comment for the author, please follow the link and comment on their blog: data prone - R.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)