Visualisation of Likert scale results
[This article was first published on Rcrastinate, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I wrote a function to visualise results of Likert scale items. Please find the function below the post. Here is an example plot:Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The function is called ‘plot.likert’ and takes the following arguments:
– vec: The vector with the raw results
– possible.values: A vector with all the possible values. This is sometimes important if not all possible responses were actually ticked by your participants. Defaults to all values found in vec.
– left: Annotation for the left side of the plot
– right: Annotation for the right side of the plot
– plot.median: Plot the median as a little line at the top? Defaults to FALSE.
– plot.sd: Plot the standard deviation around the mean? Defaults to TRUE.
– include.absolutes: Include the absolute values as small bold black numbers above the plot? Defaults to TRUE.
– include.percentages: Include the percentage values as blue numbers above the plot? Defaults to TRUE.
– own.margins: Override the default margins for the plot. Defaults to c(2,2,3,2).
– othermean: Plot another mean into the visualisation as grey line above the bars. I used this to compare the results to older results for the same questions. Defaults to NULL (no other mean is plotted in this case).
– …: Other parameters might be passed that are used for the first call to plot().
The example call for the plot shown above is:
plot.likert(sample(-2:2, 75, replace = T,
prob = c(0, .2, .2, .3, .3)),
left = “strongly disagree”,
right = “strongly agree”,
own.margins = c(2,2,5,2),
main = “I like this visualisation of Likert scale
results.”,
possible.values = -2:2,
othermean = 1.09)
NB: I know of all the stuff regarding the calculation of means on Likert scale items. However, it is still done a lot and you can also include the median after all…
Here’s the function:
plot.likert <- function (vec, possible.values = sort(unique(vec)), left = "linker Pol", right = "rechter Pol",
plot.median = F, plot.sd = T, include.absolutes = T, include.percentages = T, own.margins = c(2, 2, 3, 2),
othermean = NULL, …) {
tab <- table(vec)
if (length(tab) != length(possible.values)) {
values.not.in.tab <- possible.values[!(possible.values %in% names(tab))]
for (val in values.not.in.tab) {
tab[as.character(val)] <- 0
}
tab <- tab[sort(names(tab))]
}
prop.tab <- prop.table(tab) * 100
v.sd <- sd(vec, na.rm = T)
v.m <- mean(vec, na.rm = T)
v.med <- median(vec, na.rm = T)
old.mar <- par("mar")
par(mar = own.margins)
# Setting-up plot region
plot(x = c(min(as.numeric(names(tab)), na.rm = T) – 1.1, max(as.numeric(names(tab)), na.rm = T) + 1.1),
y = c(0, 100), type = “n”, xaxt = “n”, yaxt = “n”,
xlab = “”, ylab = “”, bty = “n”, …)
# Bars
rect(xleft = as.numeric(names(prop.tab)) – .4,
ybottom = 0,
xright = as.numeric(names(prop.tab)) + .4,
ytop = prop.tab,
border = “#00000000”, col = “#ADD8E6E6”)
# Lower black line
lines(x = c(min(as.numeric(names(tab)), na.rm = T) – .6, max(as.numeric(names(tab)), na.rm = T) + .6),
y = c(0, 0), col = “black”, lwd = 2)
# Upper black line
lines(x = c(min(as.numeric(names(tab)), na.rm = T) – .6, max(as.numeric(names(tab)), na.rm = T) + .6),
y = c(100, 100), col = “black”, lwd = 2)
# Blue lines
for (n.i in names(tab)) {
lines(x = c(n.i, n.i), y = c(0, 100), col = “blue”)
}
# Grey rectangles at sides
rect(xleft = min(as.numeric(names(tab)), na.rm = T) – 1.1,
ybottom = 0,
xright = min(as.numeric(names(tab)), na.rm = T) – .6,
ytop = 100,
border = “#00000000”, col = “grey”)
rect(xleft = max(as.numeric(names(tab)), na.rm = T) + .6,
ybottom = 0,
xright = max(as.numeric(names(tab)), na.rm = T) + 1.1,
ytop = 100,
border = “#00000000”, col = “grey”)
mtext(names(prop.tab), side = 1, at = names(prop.tab))
# Percentages and numbers at the top
if (include.percentages) mtext(paste(round(prop.tab, 0), “%”), side = 3, at = names(prop.tab), line = -.3, col = “blue”)
if (include.absolutes) mtext(tab, side = 3, at = names(tab), line = .5, cex = .8, font = 2)
# Mean line
lines(x = c(v.m, v.m), y = c(95, 85), lwd = 6, col = “blue”)
# Median line
if (plot.median) lines(x = c(v.med, v.med), y = c(95, 85), lwd = 4, col = “#00FF00AA”)
# Other mean line
if (!is.null(othermean)) lines(x = c(othermean, othermean), y = c(85, 75), lwd = 6, col = “#00000099”)
# SD lines
if (plot.sd) { arrows(x0 = c(v.m, v.m), x1 = c(v.m – v.sd, v.m + v.sd), y0 = c(90, 90), y1 = c(90, 90),
angle = 90, length = 0, lwd = 1) }
# Left label
mtext(left, side = 2, line = -.5)
# Right label
mtext(right, side = 4, line = -.5)
par(mar = old.mar)
}
To leave a comment for the author, please follow the link and comment on their blog: Rcrastinate.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.