## Data science for Doctors: Inferential Statistics Exercises (part-3)

March 27, 2017
Data science enhances people’s decision making. Doctors and researchers are making critical decisions every day. Therefore, it is absolutely necessary for those people to have some basic knowledge of data science. This series aims to help people that are around medical field to enhance their data science skills. We will work with a health related Related exercise sets:

## An Introduction to Stock Market Data Analysis with R (Part 1)

March 27, 2017
This is my first article in a two-part series introducing stock data analysis using R.

## Analyzing Accupedo step count data in R: Part 2 – Adding weather data

March 27, 2017
In my last set of posts, I wrote about analyzing data from the Accupedo step counter app I have on my phone. In this post, I'll talk about some additional analysis I've done by merging the step counter data with weather data from another source.The website www.wunderground.com has freely available weather data available for most parts of the...

## #0: Introducing R^4

March 27, 2017
So I had been toying with the idea of getting back to the blog and more regularly writing / posting little tips and tricks. I even starting taking some notes but because perfect is always the enemy of the good it never quite materialized. But the rel...

## Introducing brotools

I’m happy to announce my first R package, called brotools. This is a package that contains functions that are specific to my needs but that you might find also useful. I blogged about some of these functions, so if you follow my blog you might already be familiar with some of...

## Fitting Bayesian Linear Mixed Models for continuous and binary data using Stan: A quick tutorial

March 27, 2017
I want to give a quick tutorial on fitting Linear Mixed Models (hierarchical models) with a full variance-covariance matrix for random effects (what Barr et al 2013 call a maximal model) using Stan. For a longer version of this tutorial, see: Sorensen, Hohenstein, Vasishth, 2016. Prerequisites: You need to have R and preferably RStudio installed;...

## Updated Shiny app

March 26, 2017
A short post to alert the world that my modest Shiny application, showing Major League Baseball run scoring trends since 1901, has been updated to include the 2016 season. The application can be found here:https://monkmanmh.shinyapps.io/MLBrunscoring_shiny/.In addition to the underlying data, the update removed some of the processing that was happening inside the application, and put it into...

## Le Monde puzzle [#1001]

March 26, 2017
After a long lag (due to my missing the free copies distributed at Paris-Dauphine!), here is a Sudoku-like Le Monde mathematical puzzle: A grid of size (n,n) holds integer values such that any entry larger than 1 is the sum of one term in the same column and one term in the same row. What

## RcppTOML 0.1.2

March 26, 2017
A new release of RcppTOML is now on CRAN. This release fixes a few parsing issues for less frequently-used inputs: vectors of boolean or date(time) types, as well as table array input. RcppTOML brings TOML to R. TOML is a file format that is most suitable for configurations, as it is meant to be edited...

## Economics chapter added to “Empirical software engineering using R”

March 26, 2017
The Economics chapter of my Empirical software engineering book has been added to the draft pdf (download here). This is a slim chapter, it might grow a bit, but I suspect not by a huge amount. Reasons include lots of interesting data being confidential and me not having spent a lot of time on this

## Using R to study the Yemen Conflict with night light images

March 26, 2017
The Yemeni civil war has received very little attention despite the growing humanitarian disaster. There is a lack of reliable figures on the extent of the human suffering in Yemen. The few data that is available suggests that it is immense. According to the UN, from March 2015 to August 2016, over 10,000 people have … Continue...

## New book and package pmfdR

March 26, 2017
Processing and modelling financial data with R - My Portuguese book about finance and R was published a couple of months ago and, given its positive feedback, I decided to work on the english version immediately. You can find det...

## More book, more cricket! 2nd edition of my books now on Amazon

March 25, 2017
The 2nd edition of both my books a) Cricket analytics with cricketr b) Beaten by sheer pace – Cricket analytics with yorkr is now available on Amazon, both as Paperback and Kindle versions. Pick up your copies today!!! A) Cricket analytics with cricketr: Second Edition B) Beaten by sheer pace: Cricket analytics with yorkr(2nd edition) … Continue...

## R&lt;-Slovakia meetup started to build community in Bratislava

March 25, 2017
On 22. March a first special R related meetup called R<-Slovakia took place. As the name of the meetup group implies, it is based in Slovakia, in its capital - Bratislava. I am very happy to be the first speaker on this event ever. R<-Slovakia ha...

## RApiDatetime 0.0.2

March 25, 2017
Two days after the initial 0.0.1 release, a new version of RApiDatetime has just arrived on CRAN. RApiDatetime provides six entry points for C-level functions of the R API for Date and Datetime calculations. The functions asPOSIXlt and asPOSIXct conv...

## Linear regression in “The Man who counted”

March 25, 2017
Recently, I got a book by Brasilian writer  Júlio César de Mello e Souza (published under pen name Malba Tahan), titled The Man who counted. Book is a collection of mathematical stories very similar to Scheherazada’s 1001 Nights, where mathematical story-telling is the center of book.                                                In story 5“In so many words”, Malba describes … Continue...

## Perform pairwise Wilcoxon test, classify groups by significance and plot results

March 25, 2017
This post is the result of work performed in collaboration with my colleague Eleonora Paparelli (who actually did most of the work!). We wanted to compare several distributions using Wilcoxon test and summarize results (i.e. indicate the comparisons showing significant differences). R base includes pairwise.wilcox.test to perform Wilcoxon rank sum test between all pairs of

## Debugging Pipelines in R with Bizarro Pipe and Eager Assignment

March 25, 2017
This is a note on debugging magrittr pipelines in R using Bizarro Pipe and eager assignment. Pipes in R The magrittr R package supplies an operator called “pipe” which is written as “%>%“. The pipe operator is partly famous due to its extensive use in dplyr and use by dplyr users. The pipe operator is … Continue...

## New Zealand election forecasts

March 25, 2017
Over the weekend I released a new webpage, connected to this blog, with forecasts for the New Zealand 2017 General Election. The aim is to go beyond poll aggregation to something that takes the uncertainty of the future into account, as well as relati...

## Suggests and Vignettes

March 24, 2017
Dirk Eddelbuettel quite rightly reminded us the other day that Suggests is not Depends. I am sorry to say that I am one of those who are using Suggests… “casually”. Mea culpa. I must say that this is restricted to vignettes: there are no tests nor examples using suggested packages. But I am not checking… Continuar leyendo...

## On a First Name Basis with Statistics Sweden

March 24, 2017
Abstract Jugding from recent R-Bloggers posts, it appears that many data scientists are concerned with scraping data from various media sources (Wikipedia, twitter, etc.). However, one should be aware that well structured and high quality datasets are available through state's and country's bureau of statistics. Increasingly these are offered to the public through direct database access, e.g., using a REST...

## Comparing subreddits, with Latent Semantic Analysis in R

March 24, 2017
FiveThirtyEight published a fascinating article this week about the subreddits that provided support to Donald Trump during his campaign, and continue to do so today. Reddit, for those not in the know, is an popular online social community organized into thousands of discussion topics, called subreddits (the names all begin with "r/"). Most of the subreddits are a useful...

## Building Shiny App Exercises (part-8)

March 24, 2017
Transform your App into Dashboard Now that we covered the basic staff that you need to know in order to build your App it is time to enhance its appearance and its functionality. The interface is very important fot the user as it must not only be friendly but also easy to use. At this Related exercise sets:

## Superpixels in imager

March 24, 2017
Superpixels are used in image segmentation as a pre-processing step. Instead of segmenting pixels directly, we first group similar pixels into “super-pixels”, which can then be processed further (and more cheaply). (image from Wikimedia) The current version of imager doesn’t implement them, but it turns out that SLIC superpixels are particularly easy to implement. SLIC

## Writing a conference abstract the data science way

March 24, 2017
Adnan Fiaz Conferences are an ideal platform to share your work with the wider community. However, as we all know, conferences require potential speakers to submit abstracts about their talk. And writing abstracts is not necessarily the most rewarding work … Continue reading →

March 24, 2017
by Joseph Rickert Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to

## R Weekly Bulletin Vol – I

March 24, 2017
We are starting with R weekly bulletins which will contain some interesting ways and methods to write codes in R and solve bugging problems. We will also cover R functions and shortcut keys for beginners. We understand that there can be more than one way of writing a code in R, and the solutions listed... The post R...

## Web data acquisition: parsing json objects with tidyjson (Part 3)

March 24, 2017
The collection of example flight data in json format available in part 2, described the libraries and the structure of the POST request necessary to collect data in a json object. Despite the process generated and transferred locally a proper response, the data collected were neither in a suitable structure for data analysis nor immediately … Continue...