(This article was first published on

**Learning Data Science**, and kindly contributed to R-bloggers)Quandl is a new database management tool which seeks to become the place to find datasets. That is, each unique indicator is considered an independent data set. This helps them to seem to have a ginormous quantity of data sets. Source : Blog Econometric Simulation.

To load or find the datasets, we have to authentify using the API like with Twitter First, we need to set an account to receive the *Quandl.auth*

**setwd("D:/PERSO/R_Working/Tutoriels/Probability")**

library(Quandl)

Quandl.auth("XXXXXXXXXX") ### Replace with yours

For example, we load the database about pollution and gdp and try to find the link.

**plot(date, pollution, col = "red", type = "o", lwd = 2, ylim = c(70, 150), ylab = "", **

main = "Evolution du PIB vs Pollution entre 94 et 2006 au Japon")

lines(date, pib, lty = 2, col = "purple", type = "o")

legend("topright", legend = c("PIB", "Pollution"), col = c("purple", "red"),

pch = 15, bty = "n", pt.cex = 2, cex = 0.8, text.col = "black", horiz = TRUE,

inset = c(0.1, 0.1))

We fit a model to know if the growth of GDP is responsible of pollution’s growth in japan between 1994 and 2006.

Ecologists are not totaly wrong ! ! !

The entire code to run this post

Share it ! ! !

To

**leave a comment**for the author, please follow the link and comment on their blog:**Learning Data Science**.R-bloggers.com offers

**daily e-mail updates**about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...