R Markdown to other document formats

December 26, 2012

(This article was first published on Will Lowe » R, and kindly contributed to R-bloggers)

Perhaps you have a file written in Markdown with embedded R of the kind that RStudio makes so nice and easy but you’d like a range of output formats to keep your collaborators happy.  Say latex, pdf, html and MS Word.  Here’s what you might do

I shall imagine your file is called doc.Rmd

  1. Install pandoc http://code.google.com/p/pandoc/downloads/list on your machine. (If you’re on Windows you can also use Tal Galili’s handy install.pandoc function described here).
  2. Install the knitr package. From within R
  3. Save the following code into a file. I shall imagine you called it rmd.R
    rmd.convert <- function(fname, output=c('latex', 'word', 'html', "pdf")){
      thedir <- file_path_as_absolute(dirname(fname))
      thefile <- (basename(fname)) 
      create_latex <- function(f){
        knit(f, 'tmp-outputfile.md'); 
        newname <- paste0(file_path_sans_ext(f), ".tex")
        mess <- paste('pandoc -f markdown -t latex -s -o', shQuote(newname), 
                      "tmp-outputfile.md; rm tmp-outputfile.md")
        cat("The Latex file is", file.path(thedir, newname), 
            "\nIf transporting do not forget to include the folder", file.path(thedir, "figure"), "\n")
      create_word <- function(f){
        knit(f, 'tmp-outputfile.md');
        newname <- paste0(file_path_sans_ext(f),".docx")
        mess <- paste('pandoc -f markdown -t docx -o', shQuote(newname), "tmp-outputfile.md; rm tmp-outputfile.md")
        cat("The Word (docx) file is", file.path(thedir, newname), "\n")
      create_html <- function(f){
        cat("The main HTML file is", file.path(thedir, paste0(file_path_sans_ext(f), ".html")), 
            "\nIf transporting do not forget to include the folder", file.path(thedir, "figure"), "\n")
      create_pdf <- function(f){
        knit(f, 'tmp-outputfile.md');
        newname <- paste0(file_path_sans_ext(f),".pdf")
        mess <- paste('pandoc -f markdown -o', shQuote(newname), "tmp-outputfile.md; rm tmp-outputfile.md")
        cat("The PDF file is", file.path(thedir, newname), "\n")
      origdir <- getwd()  
        setwd(thedir) ## put us next to the original Rmarkdown file
        out <- match.arg(output)
        )}, finally=setwd(origdir))
  4. Within R, source the file to get access to the rmd.convert function it defines
  5. Apply rmd.convert to your source document
    rmd.convert('doc.Rmd', 'html') ## for a web page
    rmd.convert('doc.Rmd', 'latex') ## for a latex document
    rmd.convert('doc.Rmd', 'pdf') ## for a pdf document
    rmd.convert('doc.Rmd', 'word') ## for a word document

If the function runs correctly then the last line of output will tell you what your new output file is called. The resulting document and any generated folders will land next to (that is, in the same folder as) the original file.

The latex output option requires Latex to be installed on your machine, obviously. Less obviously, so does the pdf option. If you want other formats supported by pandoc, then just add a create_ function in the style of the existing ones and add it to the function’s output parameter list and to the switch statement at the bottom.

To leave a comment for the author, please follow the link and comment on their blog: Will Lowe » R.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)