Introduction to ggraph: Nodes

[This article was first published on Data Imaginist - R posts, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

This is the second post in my series of ggraph introductions. The
first post introduced
the concept of layouts, which is simply a specification on how nodes should be
placed on a plane. This post will dive into how the nodes are drawn, once a
layout has been calculated.


Nodes in a network are the entities that are connected. Sometimes these are also
referred to as vertices, but ggraph has opted for this nomenclature and use it
consistently. While the nodes in a graph are the abstract concepts of entities,
and the layout is their physical placement, the node geoms is the visual
manifestation of the entities. Conceptually one can simply think of it in terms
of a scatter plot — the layout provides the x, and y coordinates and these can
be used to draw nodes in different ways in the plotting window. Actually, due
to the design of ggraph the standard scatterplot-like geoms from ggplot2
can be used directly for plotting nodes:

gr <- graph_from_data_frame(highschool)

ggraph(gr, layout = 'kk') + 
    geom_point(aes(x=x, y=y))


The reason this works is that, as discussed in the previous post, layouts return
a data.frame of node positions and metadata and this is used as the default
plot data:

head(createLayout(gr, layout = 'kk'))
#>            x          y name circular ggraph.index
#> 1  0.2782438  2.4944195    1    FALSE            1
#> 2  0.1365268  3.1063039    2    FALSE            2
#> 3  0.9329938  3.2168940    3    FALSE            3
#> 4 -2.5457734 -1.5139415    4    FALSE            4
#> 5 -2.8447634 -0.2267242    5    FALSE            5
#> 6 -2.9897376  1.7369304    6    FALSE            6


While usage of the default ggplot2 is absolutely allowed, ggraph comes with
its own set of node geoms. Many of these are direct translations of ggplot2
own geoms like geom_point() so one could wonder why bother to use them.

The first reason is to provide clear code. It is not apparent anywhere that the
standard geoms are addressing the nodes and using geom_node_*() makes it clear
that this layer will draw nodes.

The second reason is that it will save typing.
Since ggraph are in control of the shape of the input data through the layout
calculations, it knows that x and y position is encoded in an x and y
column. This means that geom_node_* can default the x and y aesthetics so
there’s no need to type them:

ggraph(gr, layout = 'kk') + 


sometimes there is a need for addressing the x and y aesthetics, which is still
possible, for instance if a partition layout should be inverted:

gr <- graph_from_data_frame(flare$edges, vertices = flare$vertices)

ggraph(gr, layout = 'partition') + 
    geom_node_tile(aes(y = -y, fill = depth))


of course this could also be accomplished by reversing the y-axis using
scale_y_reverse() so this is just to illustrate that the defaults are easily
overwritten if needed.

The third reason is for the added functionality. All ggraph geoms gets a
filter aesthetic that allows you to quickly filter the input data. The use of
this can be illustrated when plotting a tree:

ggraph(gr, layout = 'dendrogram', circular = TRUE) + 
    geom_edge_diagonal() + 
    geom_node_point(aes(filter = leaf)) + 


In the above plot only the terminal nodes are drawn by filtering on the logical
leaf column provided by the dendrogram layout.

The different node geoms

The usual suspects are of course provided in the form of geom_node_point()
(showcased above), geom_node_text(), and geom_node_label(). These works as
expected, taking in the usual aesthetics (plus filter). Only x and y are
defaulted so everything else must be provided e.g. label which does not defaults
to the name column like is done in igraph. One feature sets
geom_node_text() and geom_node_label() from their ggplot2 counterparts:
both have a repel argument that, when set to TRUE will use the repel
functionality provided by the
ggrepel package to avoid
overlapping text.

Apart from these three geoms there’s a set of geoms mainly useful for spatial
node layouts such as treemaps, partition, and circle packing. geom_node_tile()
is the ggraph counterpart to ggplot2s geom_tile() while
geom_node_circle() and geom_node_arc_bar() maps to ggforces
geom_circle() and geom_arc_bar(). Collective for these is that the spatial
dimensions of the geoms (e.g. radius, width, and height) are precalculated by
their intended layouts and defaulted be the geoms:

ggraph(gr, layout = 'treemap', weight = 'size') + 
    geom_node_tile(aes(fill = depth))


all spatial node geoms will be center-based, meaning that the x and y value of
the layout will refer to the center of the layout and not e.g. the bottom-left
corner. This makes it easier to add labels to spatial layouts as well as using
spatial layouts in a non-spatial way:

l <- ggraph(gr, layout = 'partition', circular = TRUE)
l + geom_node_arc_bar(aes(fill = depth)) + 


l + geom_edge_diagonal(aes(width = ..index.., alpha = ..index..), lineend = 'round') + 
    scale_edge_width(range = c(0.2, 1.5)) + 
    geom_node_point(aes(colour = depth)) + 


More node geoms are sure to appear in ggraph with time but they will generally
be quite easily comprehensible due to their strong assemblance to the standard
ggplot2 geoms. After all it is just points on a plane…

More to come

This concludes our tour of the different ways to draw nodes in ggraph. Next up
is edges and it is fair to say that this is where it really gets exciting. Stay

To leave a comment for the author, please follow the link and comment on their blog: Data Imaginist - R posts. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)