Nice ggplot with sad data: something happens with women in science

March 14, 2018

(This article was first published on long time ago..., and kindly contributed to R-bloggers)

   Last March 8, millions of women in more than 170 countries around the world joined street protests calling for “a society free of sexist oppression, exploitation and violence”. Spanish strike was one of the most numerous, where around 5.3 million women joined the strike. Today, I will use some data (download) regarding the scientific career in Spain (2016) to make a plot using ggplot2 and discuss about it. Here you have the graph and the code:

dat <- read.csv("wis.csv")
dat$pos <-
factor(dat$pos, levels = c("pre", "post", "ryc", "ct", "ic" , "pi"))
ggplot(dat, aes(x = pos, y = per)) +
geom_line(size = 3, aes(group = sex, color = sex)) +
geom_point() +
geom_text(aes(label = round(per * 100, 1)), vjust = -1, size = 4) +
title = "Women in Science \n Spanish National Research Council",
x = "research stage",
y = "sex ratio",
color = " \n",
subtitle = "(2016)"
) +
scale_color_manual(labels = c("women", "men"),
values = c("purple", "orange")) +
breaks = c("pre", "post", "ryc", "ct", "ic" , "pi"),
labels = c(
"Ramón y Cajal",
"Profesor de\nInvestigación"
) +
scale_y_continuous(labels = percent) +
text = element_text(size = 15),
legend.text = element_text(size = 15),
axis.text.x = element_text(face = "bold", size = 11, angle = 30),
axis.title = element_text(size = 14, face = "bold"),
plot.title = element_text(hjust = 0.5, face = "bold", size = 20),
plot.subtitle = element_text(hjust = 0.5, size = 19)
In X axis, we can see the stages at the Spanish National Research Council (known in Spanish as CSIC) positions, from young PhD candidates (left – “Predoctoral”) to the highest level, Research Professor (right – “Profesor de Investigación”). It’s easy to understand: in Spain, there are more women starting a scientific career than men, but only few of them get to the highest positions in CSIC.

We usually call this the “scissors graph”, since the two curves soon cross each other changing the trend shown at the beginning. For the last stages this imbalance between males and females can be due to the age (mean age of Research Professor in CSIC is around 58 years old, and things have changed a lot in Spain since 30 years ago). However, the graph shows that for the earlier stages this imbalance is still being a problem: just from PhD candidate stage to first postdoctoral position, women lose around 15% positions in favor of men. And this is not only happening in Spain. Only active politics and special measures orientated to these first stages could change the trend for the highest positions in the future!

If you want to know a little bit more about women in science in Spain, here you have a poster made by some friends from the Ecology Department in Universidad de Alcalá.

To leave a comment for the author, please follow the link and comment on their blog: long time ago.... offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)