412 search results for "hadoop"

R in a 64 bit world

June 8, 2015
By
R in a 64 bit world

32 bit data structures (pointers, integer representations, single precision floating point) have been past their “best before date” for quite some time. R itself moved to a 64 bit memory model some time ago, but still has only 32 bit integers. This is going to get more and more awkward going forward. What is R … Continue reading...

Read more »

Any R code as a cloud service: R demonstration at BUILD

June 5, 2015
By
Any R code as a cloud service: R demonstration at BUILD

At last month's BUILD conference for Microsoft developers in San Francisco, R was front-and-center on the keynote stage. In the keynote, Microsoft CVP Joseph Sirosh introduced the "language of data": open source R. Sirosh encouraged the audience to learn R, saying "if there is a single language that you choose to learn today .. let it be R". The...

Read more »

Update on Snowdoop, a MapReduce Alternative

May 29, 2015
By
Update on Snowdoop, a MapReduce Alternative

In blog posts a few months ago, I proposed an alternative to MapReduce, e.g. to Hadoop, which I called “Snowdoop.” I pointed out that systems like Hadoop and Spark are very difficult to install and configure, are either too primitive (Hadoop)  or too abstract (Spark) to program, and above all, are SLOW. Spark is of … Continue reading...

Read more »

SparkR preview by Vincent Warmerdam

May 28, 2015
By
SparkR preview by Vincent Warmerdam

SparkR preview in Rstudio Apache Spark is the hip new technology on the block. It allows you to write scripts in a functional style and the technology behind it will allow you to run iterative tasks very quickly on a cluster of machines. It’s benchmarked to be quicker than hadoop for most machine learning use

Read more »

RevoScaleR’s Naive Bayes Classifier rxNaiveBayes()

May 28, 2015
By
RevoScaleR’s Naive Bayes Classifier rxNaiveBayes()

by Joseph Rickert, Because of its simplicity and good performance over a wide spectrum of classification problems the Naïve Bayes classifier ought to be on everyone's short list of machine learning algorithms. Now, with version 7.4 we have a high performance Naïve Bayes classifier in Revolution R Enterprise too. Like all Parallel External Memory Algorithms (PEMAs) in the RevoScaleR...

Read more »

R tops 2015 KDnuggets Software Poll

May 27, 2015
By
R tops 2015 KDnuggets Software Poll

R is the leading choice for Predictive Analytics / Data Mining / Data Science software according to the results of the 2015 KDnuggets Software Poll, now in its 16th year. Each of the 28,000 participants selected one or more tools they had used in the last year from a list of 93 options, and R was selected by 46.9%...

Read more »

R #1 by Wide Margin in Latest KDnuggets Poll

May 27, 2015
By
R #1 by Wide Margin in Latest KDnuggets Poll

The results of the latest KDnuggets Poll on software for Analytics, Big Data and Data Mining are out, and R has moved into the #1 position by a wide margin. I’ve updated the Surveys of Use section of The Popularity of Data … Continue reading →

Read more »

First Day Highlights from the Extremely Large Databases Conference

May 21, 2015
By
First Day Highlights from the Extremely Large Databases Conference

by Joseph Rickert The 8th XLDB (Extremely Large Databases) Conference open at Stanford on Tuesday with an outstanding program. This conference has been providing leadership in the "Big Data" world since its first workshop which was held in 2007. For example, the summary report for that year notes: "Both communities (industry and science) are moving towards parallel ... architectures...

Read more »

Course on using Oracle R Enterprise

Course on using Oracle R Enterprise

BNOSAC will be giving from June 08 up to June 12 a 5-day crash course on the use of R using Oracle R Enterprise. The course is given together with our Oracle Partner in Leuven, Belgium. If you are interested in attending, contact us for further details. For R users who aren't aware of this yet....

Read more »

Open soure software has changed the way we do business

May 20, 2015
By

Earlier this month TechCrunch published an article of mine, "The Business Economics And Opportunity Of Open-Source Data Science". With this article I wanted to share how open-source software has disrupted the economics of doing business, now that data is a fundamental component of every businesses' operations. Open source projects like Hadoop and R, coupled with commodity hardware, have fundamentally...

Read more »