2067 search results for "Regression"

Integrating R with production systems using an HTTP API

August 19, 2014
By
Integrating R with production systems using an HTTP API

by Nick Elprin, Co-Founder of Domino Data Lab We built a platform that lets analysts deploy R code to an HTTP server with one click, and we describe it in detail below. If you have ever wanted to invoke your R model with a simple HTTP call, without dealing with any infrastructure setup or asking for help from developers...

Read more »

Propensity Modeling, Causal Inference, and Discovering Drivers of Growth

August 14, 2014
By
Propensity Modeling, Causal Inference, and Discovering Drivers of Growth

Imagine you just started a job at a new company. You watched World War Z recently, so you're in a skeptical mood, and given that your last two startups failed from what you believe to be a lack of data, you're giving everything an extra critical eye. You start by thinking about the impact of the sales team. How...

Read more »

Table comparing the statistical capabilities of software packages

August 13, 2014
By
Table comparing the statistical capabilities of software packages

A statistical consultant known only as "Stanford PhD" has put together a table comparing the statistical capabilities of the software packages R, Matlab, SAS, Stata and SPSS. For each of 57 methods (including techniques like "ridge regression", "survival analysis", "optimization") the author ranks the capabilities of each software package as "Yes" (fully supported), "Limited" or "Experimental". Here are the...

Read more »

Vtreat: designing a package for variable treatment

August 7, 2014
By
Vtreat: designing a package for variable treatment

When you apply machine learning algorithms on a regular basis, on a wide variety of data sets, you find that certain data issues come up again and again: Missing values (NA or blanks) Problematic numerical values (Inf, NaN, sentinel values like 999999999 or -1) Valid categorical levels that don’t appear in the training data (especially Related posts:

Read more »

Social Media Mining and Bioinformatics (with R)

August 5, 2014
By
Social Media Mining and Bioinformatics (with R)

In June and July, I receive copies of two books, Social Media Mining with R, by Nathan Danneman and Richard Heimann Bioinformatics with R Cookbook, by Paurush Praveen Sinha For the first one, two recent interesting books deal with the same topic. Reza Zafarani, Mohammad Ali Abbasi and Huan Liu published last year Social Media Mining: An Introduction. Actually, the book can...

Read more »

Guns are Cool – Differences between states

August 3, 2014
By
Guns are Cool – Differences between states

Last week my blog showed that there are differences between states in the shootingtracker database. This week it is attempted to understand why states are different. A number of variables were extracted from a few sources, among which gun laws, % ...

Read more »

ideal point graphics, via d3

July 30, 2014
By
ideal point graphics, via d3

I’ve updated some of the graphical displays of the ideal point estimates I serve up here. I’ve rendered some of these in d3, with some rollover lah-de-dah: (1) 113th House ideal points in a long “caterpillar” format; (2) scatterplot of ideal point against Obama 2012 vote in district. Screenshot of the scatterplot appears below. My

Read more »

Measuring Fat Loss without the scale

July 30, 2014
By
Measuring Fat Loss without the scale

@tdhopper posted his self-measurements of weight loss a few months back. I recently decided also that I wanted to lose fat-weight—the infamous “I could stand to be a few kilos lighter”—and I think I came up with a more productive way of thinking about my progress: I’m not going to look at the scale at all....

Read more »

Fast-track publishing using the new R markdown – a tutorial and a quick look behind the scenes

July 29, 2014
By
Fast-track publishing using the new R markdown – a tutorial and a quick look behind the scenes

The new R Markdown (rmarkdown-package) introduced in Rstudio 0.98.978 provides some neat features by combining the awesome knitr-package and the pandoc-system. The system allows for some neat simplifications of the fast-track-publishing (ftp) idea using so called formats. I've created a new package, the Grmd-package, with an extension to the html_document format, called the docx_document. The formatter allows an almost...

Read more »

Scraping information of CRAN packages

July 28, 2014
By

(This article is adapted to the latest version of rvest package.) In my previous post, I demonstrated how we can scrape online data using existing packages. In this post, I will take it a bit further: I will scrape more information of CRAN packages since each of them also has a web page like this. More specifically,...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)