More book, more cricket! 2nd edition of my books now on Amazon

March 25, 2017
By
More book, more cricket! 2nd edition of my  books now on Amazon

The 2nd edition of both my books a) Cricket analytics with cricketr b) Beaten by sheer pace – Cricket analytics with yorkr is now available on Amazon, both as Paperback and Kindle versions. Pick up your copies today!!! A) Cricket analytics with cricketr: Second Edition B) Beaten by sheer pace: Cricket analytics with yorkr(2nd edition) … Continue...

Read more »

R<-Slovakia meetup started to build community in Bratislava

March 25, 2017
By
R<-Slovakia meetup started to build community in Bratislava

On 22. March a first special R related meetup called R<-Slovakia took place. As the name of the meetup group implies, it is based in Slovakia, in its capital - Bratislava. I am very happy to be the first speaker on this event ever. R<-Slovakia ha...

Read more »

R&lt;-Slovakia meetup started to build community in Bratislava

March 25, 2017
By
R&lt;-Slovakia meetup started to build community in Bratislava

On 22. March a first special R related meetup called R<-Slovakia took place. As the name of the meetup group implies, it is based in Slovakia, in its capital - Bratislava. I am very happy to be the first speaker on this event ever. R<-Slovakia ha...

Read more »

RApiDatetime 0.0.2

March 25, 2017
By

Two days after the initial 0.0.1 release, a new version of RApiDatetime has just arrived on CRAN. RApiDatetime provides six entry points for C-level functions of the R API for Date and Datetime calculations. The functions asPOSIXlt and asPOSIXct conv...

Read more »

Linear regression in “The Man who counted”

March 25, 2017
By
Linear regression in “The Man who counted”

Recently, I got a book by Brasilian writer  Júlio César de Mello e Souza (published under pen name Malba Tahan), titled The Man who counted. Book is a collection of mathematical stories very similar to Scheherazada’s 1001 Nights, where mathematical story-telling is the center of book.                                                In story 5“In so many words”, Malba describes … Continue...

Read more »

Perform pairwise Wilcoxon test, classify groups by significance and plot results

March 25, 2017
By
Perform pairwise Wilcoxon test, classify groups by significance and plot results

This post is the result of work performed in collaboration with my colleague Eleonora Paparelli (who actually did most of the work!). We wanted to compare several distributions using Wilcoxon test and summarize results (i.e. indicate the comparisons showing significant differences). R base includes pairwise.wilcox.test to perform Wilcoxon rank sum test between all pairs of

Read more »

Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

March 25, 2017
By
Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

This is a note on debugging magrittr pipelines in R using Bizarro Pipe and eager assignment. Pipes in R The magrittr R package supplies an operator called “pipe” which is written as “%>%“. The pipe operator is partly famous due to its extensive use in dplyr and use by dplyr users. The pipe operator is … Continue...

Read more »

New Zealand election forecasts

March 25, 2017
By
New Zealand election forecasts

Over the weekend I released a new webpage, connected to this blog, with forecasts for the New Zealand 2017 General Election. The aim is to go beyond poll aggregation to something that takes the uncertainty of the future into account, as well as relati...

Read more »

Suggests and Vignettes

March 24, 2017
By

Dirk Eddelbuettel quite rightly reminded us the other day that Suggests is not Depends. I am sorry to say that I am one of those who are using Suggests… “casually”. Mea culpa. I must say that this is restricted to vignettes: there are no tests nor examples using suggested packages. But I am not checking… Continuar leyendo...

Read more »

On a First Name Basis with Statistics Sweden

March 24, 2017
By
On a First Name Basis with Statistics Sweden

Abstract Jugding from recent R-Bloggers posts, it appears that many data scientists are concerned with scraping data from various media sources (Wikipedia, twitter, etc.). However, one should be aware that well structured and high quality datasets are available through state's and country's bureau of statistics. Increasingly these are offered to the public through direct database access, e.g., using a REST...

Read more »

Comparing subreddits, with Latent Semantic Analysis in R

March 24, 2017
By
Comparing subreddits, with Latent Semantic Analysis in R

FiveThirtyEight published a fascinating article this week about the subreddits that provided support to Donald Trump during his campaign, and continue to do so today. Reddit, for those not in the know, is an popular online social community organized into thousands of discussion topics, called subreddits (the names all begin with "r/"). Most of the subreddits are a useful...

Read more »

Building Shiny App Exercises (part-8)

March 24, 2017
By
Building Shiny App Exercises (part-8)

Transform your App into Dashboard Now that we covered the basic staff that you need to know in order to build your App it is time to enhance its appearance and its functionality. The interface is very important fot the user as it must not only be friendly but also easy to use. At this Related exercise sets:

Read more »

Superpixels in imager

March 24, 2017
By
Superpixels in imager

Superpixels are used in image segmentation as a pre-processing step. Instead of segmenting pixels directly, we first group similar pixels into “super-pixels”, which can then be processed further (and more cheaply). (image from Wikimedia) The current version of imager doesn’t implement them, but it turns out that SLIC superpixels are particularly easy to implement. SLIC

Read more »

Writing a conference abstract the data science way

March 24, 2017
By
Writing a conference abstract the data science way

Adnan Fiaz Conferences are an ideal platform to share your work with the wider community. However, as we all know, conferences require potential speakers to submit abstracts about their talk. And writing abstracts is not necessarily the most rewarding work … Continue reading →

Read more »

Some Random Weekend Reading

March 24, 2017
By

by Joseph Rickert Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to

Read more »

R Weekly Bulletin Vol – I

March 24, 2017
By
R Weekly Bulletin Vol – I

We are starting with R weekly bulletins which will contain some interesting ways and methods to write codes in R and solve bugging problems. We will also cover R functions and shortcut keys for beginners. We understand that there can be more than one way of writing a code in R, and the solutions listed... The post R...

Read more »

Web data acquisition: parsing json objects with tidyjson (Part 3)

March 24, 2017
By
Web data acquisition: parsing json objects with tidyjson (Part 3)

The collection of example flight data in json format available in part 2, described the libraries and the structure of the POST request necessary to collect data in a json object. Despite the process generated and transferred locally a proper response, the data collected were neither in a suitable structure for data analysis nor immediately … Continue...

Read more »

R – Change columns names in a spatial dataframe

March 24, 2017
By
R – Change columns names in a spatial dataframe

Ordnance Survey have a great OpenRoads dataset, but unfortunately it contains a column called ‘primary’, which is a keyword in SQL. This makes it challenging/impossible to import the OpenRoads dataset into a SQL… Continue reading →

Read more »

Neural Networks for Learning Lyrics

March 24, 2017
By
Neural Networks for Learning Lyrics

I created a Twitter account which was inspired by a couple Twitter accounts that applied a particular type of machine learning technique to learn how two (at the time) presidential hopefuls spoke. I thought, why not see what a model like this could do with lyrics from my favorite rock n roll artist?Long short term memory (LSTM) is a...

Read more »

Lesser known purrr tricks

purrr is a package that extends R’s functional programming capabilities. It brings a lot of new stuff to the table and in this post I show you some of the most useful (at least to me) functions included in purrr. Getting rid of loops with map() library(purrr) numbers <- list(11, 12, 13, 14) map_dbl(numbers, sqrt) ## 3.316625 3.464102...

Read more »

Lesser known purrr tricks

purrr is package that extends R’s functional programming capabilities. It brings a lot of new stuff to the table and in this post I show you some of the most useful (at least to me) functions included in purrr. Getting rid of loops with map() library(purrr) numbers <- list(11, 12, 13, 14) map_dbl(numbers, sqrt) ## 3.316625 3.464102 3.605551...

Read more »

US Counties – Race/Ethnicity (using choroplethr R package)

US Counties – Race/Ethnicity (using choroplethr R package)

As a statistician, I’ve always had a soft spot in my heart for the US Census. I love the rich data sets that are made publicly available and I’ve often experimented with visualizing the results. A couple of months ago, Ari Lamstein (formerly a data scientist at Trulia) released the choroplethr package on CRAN (a repository

Read more »

Emojis Analysis in R

March 23, 2017
By
Emojis Analysis in R

A while ago I developed and shared an emoji decoder because I was facing problems when retrieving data from Twitter and Instragram. In a nutshell, the issue is that R encodes emojis in a way that makes it a hassle identifying them. This is where t...

Read more »

RApiDatetime 0.0.1

March 23, 2017
By

Very happy to announce a new package of mine is now up on the CRAN repository network: RApiDatetime. It provides six entry points for C-level functions of the R API for Date and Datetime calculations: asPOSIXlt and asPOSIXct convert between long and compact datetime representation, formatPOSIXlt and Rstrptime convert to and from character strings, and POSIXlt2D and D2POSIXlt...

Read more »

Some Random Weekend Reading

March 23, 2017
By

Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to a book store with a good technical...

Read more »

Some Random Weekend Reading

March 23, 2017
By

Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to a book store with a good technical...

Read more »

Mapping Racial Segregation in Chicago

Mapping Racial Segregation in Chicago

This spring break I was reading a couple of papers about segregation in the US (Cutler and Galeser 1997 and Cutler et al 1999). A paragraph on Cutler et al (1999) caught my attention: “Chicago was the most segregated city in 1890, the fourth most segregated city in 1940, the most segregated MSA in 1970, and the fourth most...

Read more »

QR Decomposition with the Gram-Schmidt Algorithm

March 23, 2017
By

QR decomposition is another technique for decomposing a matrix into a form that is easier to work with in further applications. The QR decomposition technique decomposes a square or rectangular matrix, which we will denote as , into two components, , and . Where is an orthogonal matrix, and is... The post QR Decomposition with the Gram-Schmidt Algorithm...

Read more »

Announcing R Tools 1.0 for Visual Studio 2015

March 23, 2017
By
Announcing R Tools 1.0 for Visual Studio 2015

by Shahrokh Mortazavi, Partner PM, Visual Studio Cloud Platform Tools at Microsoft I’m delighted to announce the general availability of R Tools 1.0 for Visual Studio 2015 (RTVS). This release will be shortly followed by R Tools 1.0 for Visual Studio 2017 in early May. RTVS is a free and open source plug-in that turns Visual Studio into a...

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



www.ama.org/events-training

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.