Data.Table by Example – Part 1

September 26, 2017
By
Data.Table by Example – Part 1

For many years, I actively avoided the data.table package and preferred to utilize the tools available in either base R or dplyr for data aggregation and exploration. However, over the past year, I have come to realize that this was a mistake. Data tables are incredible and provide R users with a syntatically concise and

Read more »

How to Create an Interactive Infographic

September 25, 2017
By
How to Create an Interactive Infographic

An interactive infographic can be used to communicate a lot of information in an engaging way. With the right tools, they are also relatively straightforward to create. In this...

Read more »

News Roundup from Microsoft Ignite

September 25, 2017
By

It's been a big day for the team here at Microsoft, with a flurry of announcements from the Ignite conference in Orlando. We'll provide more in-depth details in the...

Read more »

Custom Level Coding in vtreat

September 25, 2017
By
Custom Level Coding in vtreat

One of the services that the R package vtreat provides is level coding (what we sometimes call impact coding): converting the levels of a categorical variable to a meaningful...

Read more »

Time Series Analysis in R Part 2: Time Series Transformations

September 25, 2017
By
Time Series Analysis in R Part 2: Time Series Transformations

In Part 1 of this series, we got started by looking at the ts object in R and how it represents time series data. In Part 2, I’ll discuss...

Read more »

Speeding Up Digital Arachnids

September 25, 2017
By
Speeding Up Digital Arachnids

spiderbar, spiderbar Reads robots rules from afar. Crawls the web, any size; Fetches with respect, never lies. Look Out! Here comes the spiderbar. Is it fast? Listen bud, It's...

Read more »

Survival Analysis with R

September 24, 2017
By
Survival Analysis with R

With roots dating back to at least 1662 when John Graunt, a London merchant, published an extensive set of inferences based on mortality records, survival analysis is one of...

Read more »

Super excited for R promises

September 24, 2017
By
Super excited for R promises

We at Appsilon are excited about RStudio introducing promises in R quite soon which is going to be a huge step forward in programming in R (we have already...

Read more »

eXtremely Boost your machine learning Exercises (Part-1)

September 24, 2017
By
eXtremely Boost your machine learning Exercises (Part-1)

eXtreme Gradient Boosting is a machine learning model which became really popular few years ago after winning several Kaggle competitions. It is very powerful algorithm that use an ensemble...

Read more »

RcppGSL 0.3.3

September 24, 2017
By

A maintenance update RcppGSL 0.3.3 is now on CRAN. It switched the vignette to the our new pinp package and its two-column pdf default. The RcppGSL package provides an...

Read more »

RcppCNPy 0.2.7

September 23, 2017
By

A new version of the RcppCNPy package arrived on CRAN yesterday. RcppCNPy provides R with read and write access to NumPy files thanks to the cnpy library by Carl...

Read more »

RcppClassic 0.9.8

September 23, 2017
By

A bug-fix release RcppClassic 0.9.8 for the very recent 0.9.7 release which fixes a build issue on macOS introduced in 0.9.7. No other changes. Courtesy of CRANberries, there are...

Read more »

Upcoming data preparation and modeling article series

September 23, 2017
By
Upcoming data preparation and modeling article series

I am pleased to announce that vtreat version 0.6.0 is now available to R users on CRAN. vtreat is an excellent way to prepare data for machine learning, statistical...

Read more »

Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-9)

September 23, 2017
By
Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-9)

Statistics are often taught in school by and for people who like Mathematics. As a consequence, in those class emphasis is put on leaning equations, solving calculus problems and...

Read more »

How Random Forests improve simple Regression Trees?

September 22, 2017
By
How Random Forests improve simple Regression Trees?

By Gabriel Vasconcelos Regression Trees In this post I am going to discuss some features of Regression Trees an Random Forests. Regression Trees are know to be very unstable,...

Read more »

Welcome to R/exams

September 22, 2017
By
Welcome to R/exams

Welcome everybody, we are proud to introduce the brand new web page and blog http://www.R-exams.org/. This provides a central access point for the open-source software “exams” implemented in the R system for...

Read more »

Big Data Analytics with H20 in R Exercises -Part 1

September 22, 2017
By
Big Data Analytics with H20 in R Exercises -Part 1

We have dabbled with RevoScaleR before , In this exercise we will work with H2O , another high performance R library which can handle big data very effectively .It...

Read more »

Tutorial: Launch a Spark and R cluster with HDInsight

September 22, 2017
By

If you'd like to get started using R with Spark, you'll need to set up a Spark cluster and install R and all the other necessary software on the...

Read more »

Multi-Dimensional Reduction and Visualisation with t-SNE

September 22, 2017
By
Multi-Dimensional Reduction and Visualisation with t-SNE

t-SNE is a very powerful technique that can be used for visualising (looking for patterns) in multi-dimensional data. Great things have been said about this technique. In this blog...

Read more »

My advice on dplyr::mutate()

September 22, 2017
By
My advice on dplyr::mutate()

There are substantial differences between ad-hoc analyses (be they: machine learning research, data science contests, or other demonstrations) and production worthy systems. Roughly: ad-hoc analyses have to be correct...

Read more »

Mining USPTO full text patent data – Analysis of machine learning and AI related patents granted in 2017 so far – Part 1

September 21, 2017
By
Mining USPTO full text patent data – Analysis of machine learning and AI related patents granted in 2017 so far – Part 1

The United States Patent and Trademark office (USPTO) provides immense amounts of data (the data I used are in the form of XML files). After coming across these datasets,...

Read more »

Will Stanton hit 61 home runs this season?

September 21, 2017
By
Will Stanton hit 61 home runs this season?

So...

Read more »

Pirating Pirate Data for Pirate Day

September 21, 2017
By
Pirating Pirate Data for Pirate Day

This past Tuesday was Talk Like A Pirate Date, the unofficial holiday of R (aRRR!) users worldwide. In recognition of the day, Bob Rudis used R to create this...

Read more »

Exploratory Data Analysis of Tropical Storms in R

September 21, 2017
By
Exploratory Data Analysis of Tropical Storms in R

Exploratory Data Analysis of Tropical Storms in R The disastrous impact of recent hurricanes, Harvey and Irma, generated a large influx of data within the online community. I was...

Read more »

Gold-Mining – Week 3 (2017)

September 21, 2017
By

Week 3 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining – Week 3 (2017) appeared first on Fantasy Football Analytics.

Read more »

Don’t teach students the hard way first

September 21, 2017
By

Imagine you were going to a party in an unfamiliar area, and asked the host for directions to their house. It takes you thirty minutes to get there, on...

Read more »

ggformula: another option for teaching graphics in R to beginners

September 21, 2017
By
ggformula: another option for teaching graphics in R to beginners

A previous entry (http://sas-and-r.blogspot.com/2017/07/options-for-teaching-r-to-beginners.html) describes an approach to teaching graphics in R that also “get students doing powerful things quickly”, as David Robinson suggested. In this guest blog entry, Randall Pruim...

Read more »

Comparing Trump and Clinton’s Facebook pages during the US presidential election, 2016

September 21, 2017
By

R has a lot of packages for users to analyse posts on social media. As an experiment in this field, I decided to start with the biggest one: Facebook....

Read more »

Visualizing the Spanish Contribution to The Metropolitan Museum of Art

September 21, 2017
By
Visualizing the Spanish Contribution to The Metropolitan Museum of Art

Well I walk upon the river like it’s easier than land (Love is All, The Tallest Man on Earth) The Metropolitan Museum of Art provides here a dataset with...

Read more »

Search R-bloggers

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC2

ODSC1

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training



statcon.de

mljar.com

Contact us if you wish to help support R-bloggers, and place your banner here.