**DataScience+**, and kindly contributed to R-bloggers)

This is the first post of a series that will look at how to create graphics in R using the plot function from the base package. There are of course other packages to make cool graphs in R (like `ggplot2`

or `lattice`

), but so far plot always gave me satisfaction.

In this post we will see how to add information in basic scatterplots, how to draw a legend and finally how to add regression lines.

## Data simulation

#simulate some data dat<-data.frame(X=runif(100,-2,2),T1=gl(n=4,k=25,labels=c("Small","Medium","Large","Big")),Site=rep(c("Site1","Site2"),time=50)) mm<-model.matrix(~Site+X*T1,dat) betas<-runif(9,-2,2) dat$Y<-rnorm(100,mm%*%betas,1) summary(dat)

## Adding colors

First plot adding colors for the different treatments, one way to do this is to pass a vector of colors to the col argument in the plot function.

#select the colors that will be used library(RColorBrewer) #all palette available from RColorBrewer display.brewer.all() #we will select the first 4 colors in the Set1 palette cols<-brewer.pal(n=4,name="Set1") #cols contain the names of four different colors #create a color vector corresponding to levels in the T1 variable in dat cols_t1<-cols[dat$T1] #plot plot(Y~X,dat,col=cols_t1,pch=16)

## Change plotting symbols

We can also create a vector of plotting symbols to represent data from the two different sites, the different plotting symbols available can be seen here.

pch_site<-c(16,18)[factor(dat$Site)] #the argument that control the plotting symbols is pch plot(Y~X,dat,col=cols_t1,pch=pch_site)

## Add a legend to the graph

Now we should add a legend to the graph:

plot(Y~X,dat,col=cols_t1,pch=pch_site) legend("topright",legend=paste(rep(c("Small","Medium","Large","Big"),times=2),rep(c("Site 1","Site 2"),each=4),sep=", "),col=rep(cols,times=2),pch=rep(c(16,18),each=4),bty="n",ncol=2,cex=0.7,pt.cex=0.7)

The first argument to `legend`

is basically its position in the graph, then comes the text of the legend. Optionally one may also specify the colors, plotting symbols etc … of the legend symbol. Have a look at `?legend`

for more options.

We can also add a legend outside of the graph by setting `xpd=TRUE`

and by specifying the x and y coordinates of the legend.

plot(Y~X,dat,col=cols_t1,pch=pch_site) legend(x=-1,y=13,legend=paste(rep(c("Small","Medium","Large","Big"),times=2),rep(c("Site 1","Site 2"),each=4),sep=", "),col=rep(cols,times=2),pch=rep(c(16,18),each=4),bty="n",ncol=2,cex=0.7,pt.cex=0.7,xpd=TRUE)

## Add regression lines

The last thing we might want to add are regression lines

#generate a new data frame with ordered X values new_X<-expand.grid(X=seq(-2,2,length=10),T1=c("Small","Medium","Large","Big"),Site=c("Site1","Site2")) #the model m<-lm(Y~Site+X*T1,dat) #get the predicted Y values pred<-predict(m,new_X) #plot xs<-seq(-2,2,length=10) plot(Y~X,dat,col=cols_t1,pch=pch_site) lines(xs,pred[1:10],col=cols[1],lty=1,lwd=3) lines(xs,pred[11:20],col=cols[2],lty=1,lwd=3) lines(xs,pred[21:30],col=cols[3],lty=1,lwd=3) lines(xs,pred[31:40],col=cols[4],lty=1,lwd=3) lines(xs,pred[41:50],col=cols[1],lty=2,lwd=3) lines(xs,pred[51:60],col=cols[2],lty=2,lwd=3) lines(xs,pred[61:70],col=cols[3],lty=2,lwd=3) lines(xs,pred[71:80],col=cols[4],lty=2,lwd=3) legend(x=-1,y=13,legend=paste(rep(c("Small","Medium","Large","Big"),times=2),rep(c("Site 1","Site 2"),each=4),sep=", "),col=rep(cols,times=2),pch=rep(c(16,18),each=4),lwd=1,lty=rep(c(1,2),each=4),bty="n",ncol=2,cex=0.7,pt.cex=0.7,xpd=TRUE)

There is a whole bunch of function to draw elements within the plotting area, a few examples are: `points`

, `lines`

, `rect`

, `text`

. They are handy in many situations and are very similar of use.

That’s it for this basic post, next times we’ll see how to control axis labels and tick marks.

**leave a comment**for the author, please follow the link and comment on their blog:

**DataScience+**.

R-bloggers.com offers

**daily e-mail updates**about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...