904 search results for "SQL"

KDD Cup 2015: The story of how I built hundreds of predictive models….And got so close, yet so far away from 1st place!

June 25, 2015
By
KDD Cup 2015: The story of how I built hundreds of predictive models….And got so close, yet so far away from 1st place!

(This article was first published on Data Until I Die!, and kindly contributed to R-bloggers) The challenge from the KDD Cup this year was to use their data relating to student enrollment in online MOOCs to predict who would drop out vs who would stay. The short story is that using H2O and a lot of my free time,...

Read more »

Running RStudio on Digital Ocean, AWS etc Using Tutum and Docker Containers

June 24, 2015
By
Running RStudio on Digital Ocean, AWS etc Using Tutum and Docker Containers

Via RBloggers I noticed a tutorial today on Setting Rstudio server using Amazon Web Services (AWS). In the post Getting Started With Personal App Containers in the Cloud I described how I linked my tutum account to a Digital Ocean hosting account and then launched a Digital Ocean server. (How to link tutum to Amazon

Read more »

Using Hadoop with R: It Depends.

June 19, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics In the course of working with our Hadoop users, we are often asked, what's the best way to integrate R with Hadoop? The answer, in nearly all cases is, It depends. Alternatives ranging from open source R on workstations, to parallelized commercial products like Revolution R Enterprise and many steps...

Read more »

SparkR: Distributed data frames with Spark and R

June 12, 2015
By

R is now integrated with Apache Spark, the open-source cluster computing framework. The Databricks blog announced this week that yesterday's release of Spark 1.4 would include SparkR, "an R package that allows data scientists to analyze large datasets and interactively run jobs on them from the R shell". The SparkR 1.4 announcement led with the news: Spark 1.4 introduces...

Read more »

15 Easy Solutions To Your Data Frame Problems In R

June 11, 2015
By
15 Easy Solutions To Your Data Frame Problems In R

R’s data frames regularly create somewhat of a furor on public forums like Stack Overflow and Reddit. Starting R users often experience problems with the data frame in R and it doesn’t always seem to be straightforward. But does it really need to be so? Well, not necessarily. With today’s post, DataCamp wants to show The post

Read more »

In case you missed it: May 2015 roundup

June 10, 2015
By

In case you missed them, here are some articles from May of particular interest to R users. RStudio 0.99 released with improved autocomplete and data viewer features. A tutorial on the new Naive Bayes classifier in the RevoScaleR package. R is the most popular Predictive Analytics / Data Mining / Data Science software in the latest KDnuggets poll. A...

Read more »

Mortgages Are About Math: Open-Source Loan-Level Analysis of Fannie and Freddie

June 9, 2015
By
Mortgages Are About Math: Open-Source Loan-Level Analysis of Fannie and Freddie

ortgages were acknowledged to be the most mathematically complex securities in the marketplace. The complexity arose entirely out of the option the homeowner has to prepay his loan; it was poetic that the single financial complexity contributed to the marketplace by the common man was the Gordian knot giving the best brains on Wall Street a run...

Read more »

The challenge of combining 176 x #otherpeoplesdata to create the Biomass And Allometry Database

June 3, 2015
By
The challenge of combining 176 x #otherpeoplesdata to create the Biomass And Allometry Database

Despite the hype around "big data", a more immediate problem facing many scientific analyses is that large-scale databases must be assembled from a collection of small independent and heterogeneous fragments -- the outputs of many and isolated scientific studies conducted around the globe. Collecting and compiling these fragments is challenging at both political and technical levels. The political challenge is...

Read more »

How To Analyze Data: Seven Modern Remakes Of The Most Famous Graphs Ever Made

June 2, 2015
By
How To Analyze Data: Seven Modern Remakes Of The Most Famous Graphs Ever Made

Graphs can be beautiful, powerful tools. Graphs help us explore and explain the world. For hundreds of years, humans have used graphs to tell stories with data. To pay homage to the history of data visualization and to the power of graphs, we’ve recreated the most iconic graphs ever made. Some are remakes of the original shown in a...

Read more »

Simple Data Science To Maximize Return On Lottery Investment

June 1, 2015
By
Simple Data Science To Maximize Return On Lottery Investment

Every finite game has an equilibrium point (John Nash, Non-Cooperative Games, 1950) I read recently this amazing book, where I discovered that we (humans) are not capable of generating random sequences of numbers by ourselves when we play lottery. John Haigh demonstrates this fact analyzing a sample of 282 raffles of 6/49 UK Lotto. Once … Continue reading...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)