Upcoming data preparation and modeling article series

September 23, 2017
By
Upcoming data preparation and modeling article series

I am pleased to announce that vtreat version 0.6.0 is now available to R users on CRAN. vtreat is an excellent way to prepare data for machine learning, statistical inference, and predictive analytic projects. If you are an R user we strongly suggest you incorporate vtreat into your projects. vtreat handles, in a statistically sound … Continue reading Upcoming...

Read more »

Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-9)

September 23, 2017
By
Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-9)

Statistics are often taught in school by and for people who like Mathematics. As a consequence, in those class emphasis is put on leaning equations, solving calculus problems and...

Read more »

How Random Forests improve simple Regression Trees?

September 22, 2017
By
How Random Forests improve simple Regression Trees?

By Gabriel Vasconcelos Regression Trees In this post I am going to discuss some features of Regression Trees an Random Forests. Regression Trees are know to be very unstable,...

Read more »

Welcome to R/exams

September 22, 2017
By
Welcome to R/exams

Welcome everybody, we are proud to introduce the brand new web page and blog http://www.R-exams.org/. This provides a central access point for the open-source software “exams” implemented in the R system for...

Read more »

Big Data Analytics with H20 in R Exercises -Part 1

September 22, 2017
By
Big Data Analytics with H20 in R Exercises -Part 1

We have dabbled with RevoScaleR before , In this exercise we will work with H2O , another high performance R library which can handle big data very effectively .It...

Read more »

Tutorial: Launch a Spark and R cluster with HDInsight

September 22, 2017
By

If you'd like to get started using R with Spark, you'll need to set up a Spark cluster and install R and all the other necessary software on the...

Read more »

Multi-Dimensional Reduction and Visualisation with t-SNE

September 22, 2017
By
Multi-Dimensional Reduction and Visualisation with t-SNE

t-SNE is a very powerful technique that can be used for visualising (looking for patterns) in multi-dimensional data. Great things have been said about this technique. In this blog...

Read more »

My advice on dplyr::mutate()

September 22, 2017
By
My advice on dplyr::mutate()

There are substantial differences between ad-hoc analyses (be they: machine learning research, data science contests, or other demonstrations) and production worthy systems. Roughly: ad-hoc analyses have to be correct...

Read more »

Mining USPTO full text patent data – Analysis of machine learning and AI related patents granted in 2017 so far – Part 1

September 21, 2017
By
Mining USPTO full text patent data – Analysis of machine learning and AI related patents granted in 2017 so far – Part 1

The United States Patent and Trademark office (USPTO) provides immense amounts of data (the data I used are in the form of XML files). After coming across these datasets,...

Read more »

Will Stanton hit 61 home runs this season?

September 21, 2017
By
Will Stanton hit 61 home runs this season?

So...

Read more »

Pirating Pirate Data for Pirate Day

September 21, 2017
By
Pirating Pirate Data for Pirate Day

This past Tuesday was Talk Like A Pirate Date, the unofficial holiday of R (aRRR!) users worldwide. In recognition of the day, Bob Rudis used R to create this...

Read more »

Exploratory Data Analysis of Tropical Storms in R

September 21, 2017
By
Exploratory Data Analysis of Tropical Storms in R

Exploratory Data Analysis of Tropical Storms in R The disastrous impact of recent hurricanes, Harvey and Irma, generated a large influx of data within the online community. I was...

Read more »

Gold-Mining – Week 3 (2017)

September 21, 2017
By

Week 3 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining – Week 3 (2017) appeared first on Fantasy Football Analytics.

Read more »

Don’t teach students the hard way first

September 21, 2017
By

Imagine you were going to a party in an unfamiliar area, and asked the host for directions to their house. It takes you thirty minutes to get there, on...

Read more »

ggformula: another option for teaching graphics in R to beginners

September 21, 2017
By
ggformula: another option for teaching graphics in R to beginners

A previous entry (http://sas-and-r.blogspot.com/2017/07/options-for-teaching-r-to-beginners.html) describes an approach to teaching graphics in R that also “get students doing powerful things quickly”, as David Robinson suggested. In this guest blog entry, Randall Pruim...

Read more »

Comparing Trump and Clinton’s Facebook pages during the US presidential election, 2016

September 21, 2017
By

R has a lot of packages for users to analyse posts on social media. As an experiment in this field, I decided to start with the biggest one: Facebook....

Read more »

Visualizing the Spanish Contribution to The Metropolitan Museum of Art

September 21, 2017
By
Visualizing the Spanish Contribution to The Metropolitan Museum of Art

Well I walk upon the river like it’s easier than land (Love is All, The Tallest Man on Earth) The Metropolitan Museum of Art provides here a dataset with...

Read more »

Pandigital Products: Euler Problem 32

September 20, 2017
By

Euler Problem 32 returns to pandigital numbers, which are numbers that contain one of each digit. Like so many of the Euler Problems, these numbers serve no practical purpose...

Read more »

Report from Mexico City

September 20, 2017
By
Report from Mexico City

Editors Note: It has been heartbreaking watching the images from México City. Teresa Ortiz, co-organizer of R-Ladies CDMX reports on efforts of data scientists to help. Our thoughts are...

Read more »

Monte Carlo Simulations & the "SimDesign" Package in R

September 20, 2017
By

Past posts on this blog have included several relating to Monte Carlo simulation - e.g., see here, here, and here.Recently I came across a great article by Matthew Sigal...

Read more »

Answer probability questions with simulation (part-2)

September 20, 2017
By
Answer probability questions with simulation (part-2)

This is the second exercise set on answering probability questions with simulation. Finishing the first exercise set is not a prerequisite. The difficulty level is about the same –...

Read more »

EARL London 2017 – That’s a wrap!

September 20, 2017
By
EARL London 2017 – That’s a wrap!

...

Read more »

Preview: ALTREP promises to bring major performance improvements to R

September 20, 2017
By

Changes are coming to the internals of the R engine which promise to improve performance and reduce memory use, with dramatic impacts in some circumstances. The changes were first...

Read more »

pinp 0.0.2: Onwards

September 20, 2017
By
pinp 0.0.2: Onwards

A first update 0.0.2 of the pinp package arrived on CRAN just a few days after the initial release. We added a new vignette for the package (see below),...

Read more »

MLJAR R API

September 20, 2017
By
MLJAR R API

Hi! We have added R API for mljar - so you can run sklearn, xgboost, lightGBM, Keras, RGF from one R line :) Please check it on https://github.com/mljar/mljar-api-R

Read more »

Major update of D3partitionR: Interactive viz’ of nested data with R and D3.js

September 20, 2017
By
Major update of D3partitionR: Interactive viz’ of nested data with R and D3.js

D3partitionR is an R package to visualize interactively nested and hierarchical data using D3.js and HTML widget. These last few weeks I’ve been working on a major D3partitionR update...

Read more »

Regression Analysis — What You Should’ve Been Taught But Weren’t, and Were Taught But Shouldn’t Have Been

September 20, 2017
By
Regression Analysis — What You Should’ve Been Taught But Weren’t,  and Were Taught But Shouldn’t Have Been

The above title was the title of my talk this evening at our Bay Area R Users Group. I had been asked to talk about my new book, and...

Read more »

12 Visualizations to Show a Single Number

September 20, 2017
By
12 Visualizations to Show a Single Number

Infographics, dashboards, and reports often need to highlight or visualize a single number. But how do you highlight a single number so that it has an impact and looks...

Read more »

Improve the Quality of Data Visualizations Using Redundancy

September 20, 2017
By
Improve the Quality of Data Visualizations Using Redundancy

Using multiple visual elements to represent one variable in a chart can increase accuracy and improve readability. This is called adding redundancy or redundant encoding and, if done right, it will...

Read more »

Search R-bloggers

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC2

ODSC1

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training



statcon.de

mljar.com

Contact us if you wish to help support R-bloggers, and place your banner here.