DeployR Enterprise Security Model

July 26, 2016
By
DeployR Enterprise Security Model

by Ramkumar Chandrasekeran, Software Engineer - Microsoft R Tiger Team DeployR Enterprise is designed to deliver analytics solutions at scale to whomever needs it: inside or outside the enterprise. It also guarantees secure delivery of your analytics via DeployR web services. These secure web services integrate seamlessly with existing enterprise security solutions: Single Sign-On, LDAP, Active Directory, PAM, and...

Read more »

The reproducibility crisis in science and prospects for R

July 26, 2016
By

Guest post by Gregorio Santori (<[email protected]>) The results that emerged from a recent Nature‘s survey confirm as, for many researchers, we are living in a weak reproducibility age (Baker M. Is there a reproducibility crisis? Nature 2016;533:453-454). Although the definition of reproducibility can vary widely between disciplines, in this survey was adopted the version for … Continue...

Read more »

Calculating Pi using Buffon’s Needle

July 26, 2016
By
Calculating Pi using Buffon’s Needle

I put together this example to illustrate some general R programming principles for my Data Science class at iXperience. The idea is to use Buffon’s Needle to generate a stochastic estimate for pi. Here are the results (click on the image for an interactive version). The orange line is the reference value and the blue The post

Read more »

rotl paper published

rotl paper published

We are excited to announce a paper describing rotl, our package for the Open Tree of Life data, has been published. The full citation is: Michonneau, F., Brown, J. W. and Winter, D. J. (2016), rotl: an R package to interact with the Open Tree of Life data. Methods Ecol Evol. doi: https://doi.org/10.1111/2041-210X.12593 The paper, which is freely available, describes the package and...

Read more »

RcppHoney Introduction

July 25, 2016
By
RcppHoney Introduction

Rationale In C++ we often have containers that are not compatible with R or Rcpp with data already in them (std::vector, std::set, etc.). One would like to be able to operate on these containers without having to copy them into Rcpp structures like I...

Read more »

Read a lot of datasets at once with R

July 25, 2016
By

I often have to read a lot of datasets at once using R. So I’ve wrote the following function to solve this issue: read_list <- function(list_of_datasets, read_func){ read_and_assign <- function(dataset, read_func){ dataset_name <- as.name(dataset) ...

Read more »

Microsoft R Open 3.3.0 now available

July 25, 2016
By
Microsoft R Open 3.3.0 now available

Microsoft R Open 3.3.0, the enhanced distribution of open source R, is now available for download for Windows, Mac and Linux systems. This release includes a significant updates to the R language engine from the R Core Group bundled with the multithreaded performance and CRAN package time-machine capabilities of Microsoft R Open. The core performance of the R language...

Read more »

Showing code changes when teaching

July 25, 2016
By

A key – but challenging – part of learning to program is moving from writing technically-correct code “that works” to writing high-quality code that is sensibly decomposed into functions, generically-applicable and generally “good”. Indeed, you could say that this is exactly what Software Carpentry is about – taking you from someone bodging together a few

Read more »

Economy and dynamic modelling: Haavelmo’s approach

July 25, 2016
By

Econometrics aims at estimating observables in the economy and their inter-dependencies and testing the estimates against the economic reality. A quantitative approach to express these inter-dependencies appear as simultaneous equations, an i.e. system of linear equations, this is  a mathematical structure of economic relationships that were made possible with the pioneering work of Nobel prize winning economist...

Read more »

Image recognition in R using convolutional neural networks with the MXNet package

July 25, 2016
By
Image recognition in R using convolutional neural networks with the MXNet package

Among R deep learning packages, MXNet is my favourite one. Why you may ask? Well I can’t really say why this time. It feels relatively simple, maybe because at first sight its workflow looks similar to the one used by Keras, maybe because it was my f...

Read more »

The collaborative innovation landscape in data science

July 24, 2016
By

Computing platforms should be like Lego. That is, they should provide the fundamental building blocks and enable the users' imagination to innovate. The latest issue of Stata Journal exemplifies how Stata and, by the same account, R provide the platform for the users to innovate beyond the innate capacity of the core group responsible...

Read more »

Using 2D Contour Plots within {ggplot2} to Visualize Relationships between Three Variables

July 24, 2016
By
Using 2D Contour Plots within {ggplot2} to Visualize Relationships between Three Variables

Guest post by John Bellettiere, Vincent Berardi, Santiago Estrada The Goal To visually explore relations between two related variables and an outcome using contour plots. We use the contour function in Base R to produce contour plots that are well-suited for initial investigations into three dimensional data. We then develop visualizations using ggplot2 to gain … Continue...

Read more »

RcppArmadillo 0.7.200.2.0

July 24, 2016
By
RcppArmadillo 0.7.200.2.0

The second Armadillo release of the 7.* series came out a few weeks ago: version 7.200.2. And RcppArmadillo version 0.7.200.2.0 is now on CRAN and uploaded to Debian. This followed the usual thorough reverse-dependecy checking of by now over 240 pac...

Read more »

Mid-year R Packages Update Summary

July 24, 2016
By

I been updating some existing packages and github-releasing new ones (before a CRAN push). Most are “cyber”-related, but there are some general purpose ones. Here’s a quick overview: docxtractr (CRAN, now, v0.2.0) was initially designed to make it easy to get data tables out of MS Word (docx) documents. The update removes use of a... Continue reading...

Read more »

Placement: An R package to Access the Google Maps API

July 24, 2016
By

A few months ago I set out to write an R package for accessing the Maps API with my employer’s (paid) Google for Work/Premium account. At the time, I was unable to find an R package that could generate the encrypted signature, send the URL to Google...

Read more »

Start here to learn R!

July 22, 2016
By
Start here to learn R!

Ready, set, go! On R-exercises, you will find hundreds of exercises that will help you to learn R. We’ve bundled them into exercise sets, where each set covers a specific concept or function. An exercise set typically contains about 10 exercises, progressing from easy to somewhat more difficult. In order to give you a full

Read more »

StatET IDE for R

July 22, 2016
By
StatET IDE for R

I personally do not use Integrated Development environments (IDEs) for R, or for that matter for any programming language. From my point of view, they take up too much precious real estate on the screen, and most important, they generally do not allow me to use my own text editor and my own abbreviations and … Continue...

Read more »

Don’t miss Hadley Wickham’s Master R Workshop September 12 and 13 in NYC

July 22, 2016
By
Don’t miss Hadley Wickham’s Master R Workshop September 12 and 13 in NYC

New York City is a wonderful place to be most of the time but especially in September! If you live or work in the city or just want a good business reason to visit, consider joining RStudio Chief Data Scientist Hadley Wickham in the heart of Manhattan on September 12th and 13th, just by Times Square

Read more »

More data scientists prefer R: survey

July 22, 2016
By
More data scientists prefer R: survey

by Joseph Rickert Last week in a webinar, Burtch Works, an Illinois based executive recruiting firm that specializes in finding analytic talent, released the results of their third annual survey of "quantitative business professionals". Other than having attended this webinar, I have no knowledge of Burtch Works, but I am willing to take their survey at face value as...

Read more »

Mortality by Year and Age

July 22, 2016
By
Mortality by Year and Age

Taking another look at the data from the lifespan package. Plot below shows the evolution of mortality in the US as a function of year and age. Also, following up on a suggestion from @robjohnnoble, population data have been included in the package. The post Mortality by Year and Age appeared first on Exegetic Analytics.

Read more »

Eclipse – an alternative to RStudio – part 1

July 22, 2016
By
Eclipse – an alternative to RStudio – part 1

Many R users enjoy working in RStudio, but it’s not your only option for reproducible research in R. Stephen Wahlbrink has written a plugin for Eclipse, Statet, that offers similar functionality to RStudio within the popular Eclipse framework. You can run an R console, compose a document in LaTeX with R code chunks, and link Related Post

Read more »

RcppCCTZ 0.0.5

July 21, 2016
By

Version 0.0.5 of RcppCCTZ arrived on CRAN a couple of days ago. It reflects an upstream fixed made a few weeks ago. CRAN tests revealed that g++-6 was tripping over one missing #define; this was added upstream and I subsequently synchronized with upstream. At the same time the set of examples was extended (see below). Somehow...

Read more »

A budget of classifier evaluation measures

July 21, 2016
By
A budget of classifier evaluation measures

Beginning analysts and data scientists often ask: “how does one remember and master the seemingly endless number of classifier metrics?” My concrete advice is: Read Nina Zumel’s excellent series on scoring classifiers. Keep notes. Settle on one or two metrics as you move project to project. We prefer “AUC” early in a project (when you … Continue...

Read more »

Return of the NEISS Data

July 21, 2016
By
Return of the NEISS Data

Almost six months ago (!) I wrote a blog post about the NEISS data set, a sample of accidents reported to emergency rooms in the U.S. that are related to consumer products. Ever since I did that exploration, I have been wanting to ask a bit of a different question from that sample of accidents. How do the...

Read more »

BioC 2016 Conference Overview and Few Ways of Downloading TCGA Data

July 21, 2016
By
BioC 2016 Conference Overview and Few Ways of Downloading TCGA Data

Few weeks ago I have a great pleasure of attending BioC 2016: Where Software and Biology Connect Conference at Stanford, where I have learned a lot! It wouldn’t be possible without the scholarship that I received from Bioconductor (organizers), whic...

Read more »

New: Traditional Chinese Translation of Intro to R

July 21, 2016
By

New Free Course: Introduction to R in Traditional Chinese The DataCamp team is thrilled to announce that our Introduction to R course has been generously translated by our friend and DataCamp user Tony Yao-Jen Kuo to Traditional Chinese! Tony holds a...

Read more »

Introducing the Microsoft Data Science Summit, Sep 26-27

July 21, 2016
By

Microsoft has a brand-new conference, exclusively for data scientists, big data engineers, and machine learning practitioners. The Microsoft Data Science Summit, to be held in Atlanta GA, September 26-27, will feature talks and lab sessions from Microsoft engineers and thought leaders on using data science techniques and Microsoft technology, applied to real-world problems. Included in the agenda are several...

Read more »

Bulk Downloading Adobe Analytics Data

July 21, 2016
By

This blog post also serves as release notes for RSiteCatalyst v1.4.9, as only one feature was added (batch report request and download). But it’s a feature big enough for its own post! Recently, I was asked how I would approach replicating the market basket analysis blog post I wrote for 33 Sticks, but using a

Read more »

Does sentiment analysis work? A tidy analysis of Yelp reviews

July 21, 2016
By
Does sentiment analysis work? A tidy analysis of Yelp reviews

This year Julia Silge and I released the tidytext package for text mining using tidy tools such as dplyr, tidyr, ggplot2 and broom. One of the canonical examples of tidy text mining this package makes possible is sentiment analysis. Sentiment analysis is often used by companies to quantify general social media opinion (for...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.