500 search results for "hadoop"

Free eBook on Big Data and Data Science

March 24, 2014
By

The fine folks behind the Big Data Journal have just published a new e-book Big Data: Harnessing the Power of Big Data Through Education and Data-Driven Decision Making. (Note: Adobe Flash is required to view the e-book.) In the eBook, you'll find the following technical papers on the topics of Big Data, Data Science, and R: Data Science and...

Read more »

Call for Presentations – EARL Conference, London

March 3, 2014
By

The EARL (Effective Applications of the R Language) Conference takes place in London on the 16th and 17th September 2014. It will provide an opportunity for those using and developing the open source statistical programming language R to share and discuss their innovative practices of the R Language. The Conference will create opportunities for: • The post

Read more »

Google Summer of Code opportunities in data science and machine learning with Ganglia

February 28, 2014
By
Google Summer of Code opportunities in data science and machine learning with Ganglia

As mentioned in my blog on Monday, the Ganglia Project is proud to be part of Google Summer of Code in 2014 The Ganglia team are offering various types of projects and different parts of Ganglia would welcome students with different skills, for example: Component Skills gmond agent C gmond modules C or Python JMXetric Java gmetad and rrdtool for storing time series data C,

Read more »

Foundations of Statistical Algorithms [book review]

February 27, 2014
By
Foundations of Statistical Algorithms [book review]

There is computational statistics and there is statistical computing. And then there is statistical algorithmic. Not the same thing, by far. This 2014 book by Weihs, Mersman and Ligges, from TU Dortmund, the later being also a member of the R Core team, stands at one end of this wide spectrum of techniques required by

Read more »

R and (Software) Relatives

February 18, 2014
By
R and (Software) Relatives

Post also available with code executed inline at rpubs.com. O'Reilly recently published the results of a survey from attendees of the Strata Conference related to tool usage and salary.  The entire survey is available for download.  In the survey results, R was heralded as second only to SQL as a tool used by conference attendees.  An chart from the...

Read more »

There is no Such Thing as Biomedical "Big Data"

February 11, 2014
By
There is no Such Thing as Biomedical "Big Data"

At the moment, the world is obsessed with “Big Data” yet it sometimes seems that people who use this phrase don’t have a good grasp of its meaning.  Like most good buzz-words, “Big Data” sparks the idea of something grand and complicated, while sounding ordinary enough that listeners feel like they have an intuitive understanding of the concept.  However...

Read more »

Revolution Analytics announces $999 site licenses for universities and public service organizations

February 4, 2014
By

by Joseph Rickert Revolution Analytics is announcing three new programs today that we hope will be modest but positive contributions to data science education and public service analytics. The first new program, the Academic Institution Program (AIP) enables colleges, universities and other educational institutions to obtain a site license for Revolution Analytics' commercial distribution of the R Language, Revolution...

Read more »

Data Analysis Steps

February 3, 2014
By
Data Analysis Steps

After going through the overview of tools & technologies needed to become a Data scientist in my previous blog post, in this post, we shall understand how to tackle a data analysis problem.Any data analysis project starts with identifying a business problem where historical data exists. A business problem can be anything which can include prediction...

Read more »

Book review: "Doing Data Science" by Rachel Schutt and Cathy O’Neil

January 23, 2014
By

by Joseph Rickert Every once in a while a single book comes to crystallize a new discipline. If books still have this power in the era of electronic media, "Doing Data Science, Straight Talk from the Frontline" by Rachel Schutt and Cathy O’Neil: O'Reilly, 2013 might just be the book that defines data science. "Doing Data Science", which is...

Read more »

AMPLab Announces Developer Preview of SparkR

January 20, 2014
By

The team at AMPLab has announced a developer preview of SparkR, an R package enabling R users to run jobs on an Apache Spark cluster. Spark is an open source project that supports distributed in-memory computing for advanced analytics, such as fast queries, machine learning, streaming analytics and graph engines. Spark works with every data format supported in Hadoop,...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)