881 search results for "sql"

Using Hadoop with R: It Depends.

June 19, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics In the course of working with our Hadoop users, we are often asked, what's the best way to integrate R with Hadoop? The answer, in nearly all cases is, It depends. Alternatives ranging from open source R on workstations, to parallelized commercial products like Revolution R Enterprise and many steps...

Read more »

SparkR: Distributed data frames with Spark and R

June 12, 2015
By

R is now integrated with Apache Spark, the open-source cluster computing framework. The Databricks blog announced this week that yesterday's release of Spark 1.4 would include SparkR, "an R package that allows data scientists to analyze large datasets and interactively run jobs on them from the R shell". The SparkR 1.4 announcement led with the news: Spark 1.4 introduces...

Read more »

15 Easy Solutions To Your Data Frame Problems In R

June 11, 2015
By
15 Easy Solutions To Your Data Frame Problems In R

R’s data frames regularly create somewhat of a furor on public forums like Stack Overflow and Reddit. Starting R users often experience problems with the data frame in R and it doesn’t always seem to be straightforward. But does it really need to be so? Well, not necessarily. With today’s post, DataCamp wants to show The post

Read more »

In case you missed it: May 2015 roundup

June 10, 2015
By

In case you missed them, here are some articles from May of particular interest to R users. RStudio 0.99 released with improved autocomplete and data viewer features. A tutorial on the new Naive Bayes classifier in the RevoScaleR package. R is the most popular Predictive Analytics / Data Mining / Data Science software in the latest KDnuggets poll. A...

Read more »

Mortgages Are About Math: Open-Source Loan-Level Analysis of Fannie and Freddie

June 9, 2015
By
Mortgages Are About Math: Open-Source Loan-Level Analysis of Fannie and Freddie

ortgages were acknowledged to be the most mathematically complex securities in the marketplace. The complexity arose entirely out of the option the homeowner has to prepay his loan; it was poetic that the single financial complexity contributed to the marketplace by the common man was the Gordian knot giving the best brains on Wall Street a run...

Read more »

The challenge of combining 176 x #otherpeoplesdata to create the Biomass And Allometry Database

June 3, 2015
By
The challenge of combining 176 x #otherpeoplesdata to create the Biomass And Allometry Database

Despite the hype around "big data", a more immediate problem facing many scientific analyses is that large-scale databases must be assembled from a collection of small independent and heterogeneous fragments -- the outputs of many and isolated scientific studies conducted around the globe. Collecting and compiling these fragments is challenging at both political and technical levels. The political challenge is...

Read more »

How To Analyze Data: Seven Modern Remakes Of The Most Famous Graphs Ever Made

June 2, 2015
By
How To Analyze Data: Seven Modern Remakes Of The Most Famous Graphs Ever Made

Graphs can be beautiful, powerful tools. Graphs help us explore and explain the world. For hundreds of years, humans have used graphs to tell stories with data. To pay homage to the history of data visualization and to the power of graphs, we’ve recreated the most iconic graphs ever made. Some are remakes of the original shown in a...

Read more »

Simple Data Science To Maximize Return On Lottery Investment

June 1, 2015
By
Simple Data Science To Maximize Return On Lottery Investment

Every finite game has an equilibrium point (John Nash, Non-Cooperative Games, 1950) I read recently this amazing book, where I discovered that we (humans) are not capable of generating random sequences of numbers by ourselves when we play lottery. John Haigh demonstrates this fact analyzing a sample of 282 raffles of 6/49 UK Lotto. Once … Continue reading...

Read more »

R #1 by Wide Margin in Latest KDnuggets Poll

May 27, 2015
By
R #1 by Wide Margin in Latest KDnuggets Poll

The results of the latest KDnuggets Poll on software for Analytics, Big Data and Data Mining are out, and R has moved into the #1 position by a wide margin. I’ve updated the Surveys of Use section of The Popularity of Data … Continue reading →

Read more »

First Day Highlights from the Extremely Large Databases Conference

May 21, 2015
By
First Day Highlights from the Extremely Large Databases Conference

by Joseph Rickert The 8th XLDB (Extremely Large Databases) Conference open at Stanford on Tuesday with an outstanding program. This conference has been providing leadership in the "Big Data" world since its first workshop which was held in 2007. For example, the summary report for that year notes: "Both communities (industry and science) are moving towards parallel ... architectures...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de









ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)