Reading notes on Pro Git by Scott Chacon

[This article was first published on Maëlle's R blog on Maëlle Salmon's personal website, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

As mentioned about a million times on this blog, last year I read Git in practice by Mike McQuaid and it changed my life – not only giving me bragging rights about the reading itself. 😅 I decided to give Pro Git by Scott Chacon a go too. It is listed in the resources section of the excellent “Happy Git with R” by Jenny Bryan, Jim Hester and others. For unclear reasons I bought the first edition instead of the second one.

Git as an improved filesystem

In the Chapter 1 (Getting Started), I underlined:

“[a] mini filesystem with some incredibly powerful tools built on top of it”.

Awesome diagrams

One of my favorite parts of the book were the diagrams such as the one illustrating “Git stores data as snapshots of the project over time”.

A reminder of why we use Git

“after you commit a snapshot into Git, it is very difficult to lose, especially if you regularly push your database to another repository.”

The last chapter in the book, “Git internals”, include a “Data recovery” section about git reflog and git fsck.

One can negate patterns in .gitignore

I did not know about this pattern format. In hindsight it is not particularly surprising.

git log options

The format option lets one tell Git how to, well, format the log. I mostly interact with the Git log through a GUI or the gert R package, but that’s good to know.

The book also describes how to filter commits in the log (by date, author, committer). I also never do that with Git itself, but who knows when it might become useful.

A better understanding of branches

I remember reading “branches are cheap” years ago and accepting this as fact without questioning the reason behind the statement. Now thanks to reading the “Git Branching” chapter, but also Julia Evans’ blog post “git branches: intuition & reality”, I know they are cheap because they are just a pointer to a commit.

Likewise, the phrase “fast forward” makes more sense after reading “Git moves the pointer forward”.

The “Git Branching” chapter is also a place where diagrams really shine.

Is rebase better than merge

“(…) rebasing makes for a cleaner history. If you examine the log of a rebased branch, it looks like a linear history.”

“Rebasing replays changes from one line of work onto another in the order they were introduced, whereas merging takes the endpoints and merges them together.”

Reading this reminds me of the (newish?) option for merging pull requests on GitHub, “Rebase and merge your commits” – as opposed to merge or squash&merge.

Don’t rebase commits and force push to a shared branch

The opportunity to plug another blog post by Julia Evans, “git rebase: what can go wrong?”.

The chance of getting the same SHA-1 twice in your repository

“A higher probability exists that every member of your programming team will be attacked and killed by wolves in unrelated incidents on the same night.”

Ancestry references

The name of references with “^” or “~” (or both!) is “ancestry references”. Both HEAD~3 and HEAD^^^ are “the first parent of the first parent of the first parent”.

Double dot and triple dot

I do not intend to try and learn this but…

The double-dot syntax “asks Git to resolve a range of commits that are reachable from one commit but aren’t reachable from another”.

The triple-dot syntax “specifies all the commits that are reachable by either of the two references but not by both of them”.

New-to-me aspects of git rebase -i

git rebase -i lists commits in the reverse order compared to git log. I am not sure why I did not make a note of this before.

I had not really realized one could edit single commits in git rebase -i. When writing “edit”, rebasing will stop at the commit one wants to edit. That strategy is actually featured in the GitHub blog post “Write Better Commits, Build Better Projects”.

Plumbing vs porcelaine

I had seen these terms before but never taken the time to look them up. Plumbing commands are the low-level commands, porcelain commands are the more user-friendly commands. At this stage, I do not think I need to be super familiar with plumbing commands, although I did click around in a .git folder out of curiosity.

Removing objects

There is a section in the “Git internals” chapter called “Removing objects”. I might come back to it if I ever need to do that… Or I’d use git obliterate from the git-extras utilities!

Conclusion

Pro Git was a good read, although I do wish I had bought the second edition. I probably missed good stuff because of this! My next (and last?) Git book purchase will be Julia Evans’ new Git zine when it’s finished. I can’t wait!

To leave a comment for the author, please follow the link and comment on their blog: Maëlle's R blog on Maëlle Salmon's personal website.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)