August 2017

Why are Racing Drivers Born on March 23

August 31, 2017 | 0 Comments

Wikipedia has pages for each day of the year (e.g., January 1, April 25). Each page contains a list of names of famous people with that birthday along with a short description of each person. I wrote an R script to scrape these lists from each wikipedi...
[Read more...]

RcppAnnoy 0.0.9

August 31, 2017 | 0 Comments

An new version 0.0.9 of RcppAnnoy, our Rcpp-based R integration of the nifty Annoy library by Erik, is now on CRAN. Annoy is a small and lightweight C++ template header library for very fast approximate nearest neighbours. This release corrects an is... [Read more...]

OpenML Workshop 2017

August 31, 2017 | 0 Comments

What is OpenML? The field of Machine Learning has grown tremendously over the last years, and is a key component of data-driven science. Data analysis algorithms are being invented and used every day, but their results and experiments are published almost exclusively in journals or separated repositories. However, data by ... [Read more...]

Mapping to a ‘t'(map)

August 31, 2017 | 0 Comments

tmap More maps of the Highlands? Yep, same as last time, but no need to install dev versions of anything, we can get awesome maps courtesy of the tmap package. Get the shapefile from the last post library(tmap) library(tmaptools) library(viridis) scot
[Read more...]

Mapping to a ‘t'(map)

August 31, 2017 | 0 Comments

tmap More maps of the Highlands? Yep, same as last time, but no need to install dev versions of anything, we can get awesome maps courtesy of the tmap package. Get the shapefile from the last post library(tmap) library(tmaptools) library(viridis) scot [Read more...]

Multiplicative Congruential Generators in R

August 31, 2017 | 0 Comments

Part 2 of 2 in the series Random Number GenerationMultiplicative congruential generators, also known as Lehmer random number generators, is a type of linear congruential generator for generating pseudorandom numbers in . The multiplicative congruential generator, often abbreviated as MLCG or MCG, is defined as a recurrence relation similar to... The post Multiplicative ...
[Read more...]

Probability functions intermediate

August 31, 2017 | 0 Comments

In this set of exercises, we are going to explore some of the probability functions in R by using practical applications. Basic probability knowledge is required. In case you are not familiarized with the function apply, check the R documentation. Note: We are going to use random numbers functions and ... [Read more...]

Text featurization with the Microsoft ML package

August 31, 2017 | 0 Comments

Last week I wrote about how you can use the MicrosoftML package in Microsoft R to featurize images: reduce an image to a vector of 4096 numbers that quantify the essential characteristics of the image, according to an AI vision model. You can perform a similar featurization process with text as ... [Read more...]

Why to use the replyr R package

August 31, 2017 | 0 Comments

Recently I noticed that the R package sparklyr had the following odd behavior: suppressPackageStartupMessages(library("dplyr")) library("sparklyr") packageVersion("dplyr") #__ [1] '0.7.2.9000' packageVersion("sparklyr") #__ [1] '0.6.2' packageVersion("dbplyr") #__ [1] '1.1.0.9000' sc * Using Spark: 2.1.0 d [1] NA ncol(d) #__ [1] NA nrow(d) #__ [1] NA … Continue reading Why to use the replyr R package
[Read more...]

Pulling Data Out of Census Spreadsheets Using R

August 31, 2017 | 0 Comments

In this post, I show a method for extracting small amounts of data from somewhat large Census Bureau Excel spreadsheets, using R.  The objects of interest are expenditures of state and local governments on hospital capital in Iowa for the years 2004 to 2014. The data can be found at http://www2.... [Read more...]

Community Call – rOpenSci Software Review and Onboarding

August 31, 2017 | 0 Comments

Are you thinking about submitting a package to rOpenSci's open peer software review? Considering volunteering to review for the first time? Maybe you're an experienced package author or reviewer and have ideas about how we can improve. Join our Community Call on Wednesday, September 13th. We want to get your ... [Read more...]

Create and Update PowerPoint Reports using R

August 30, 2017 | 0 Comments

In my sordid past, I was a data science consultant. One thing about data science that they don’t teach you at school is that senior managers in most large companies require reports to be in PowerPoint....
[Read more...]

Pacific Island Hopping using R and iGraph

August 30, 2017 | 0 Comments

Use R as your travel guide and plan your next Pacific island hopping holiday with the igraph package. This code analyses flight routes and finds routes. Continue reading → The post Pacific Island Hopping using R and iGraph appeared first on The Devil is in the Data.
[Read more...]

Pacific Island Hopping using R and iGraph

August 30, 2017 | 0 Comments

Use R as your travel guide and plan your next Pacific island hopping holiday with the igraph package. This code analyses flight routes and finds routes. Continue reading → The post Pacific Island Hopping using R and iGraph appeared first on The Devil is in the Data.
[Read more...]

Project management for scalable data analysis

August 30, 2017 | 0 Comments

Project management This post can really be seen as an extension of the last post Getting started with R and RStudio in that we will be getting to know some more advanced, but indispensable features of RStudio. These features, in combination with some organizational and programming strategies, will enable us ...
[Read more...]

Finding distinct rows of a tibble

August 30, 2017 | 0 Comments

I’ve been using R or its predecessors for about 30 years, so I tend to I know a lot about R, but I don’t necessarily know how to use modern R tools. Lately, I’ve been teaching my students the tidyverse approach to data analysis, which means that I ... [Read more...]

Community Call – rOpenSci Software Review and Onboarding

August 30, 2017 | 0 Comments

Are you thinking about submitting a package to rOpenSci’s open peer software review? Considering volunteering to review for the first time? Maybe you’re an experienced package author or reviewer and have ideas about how we can improve. Join our Community Call on Wednesday, September 13th. We want to ... [Read more...]
1 2 3 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)