Suggests and Vignettes

March 24, 2017
By

Dirk Eddelbuettel quite rightly reminded us the other day that Suggests is not Depends. I am sorry to say that I am one of those who are using Suggests… “casually”. Mea culpa. I must say that this is restricted to vignettes: there are no tests nor examples using suggested packages. But I am not checking… Continuar leyendo...

Read more »

On a First Name Basis with Statistics Sweden

March 24, 2017
By
On a First Name Basis with Statistics Sweden

Abstract Jugding from recent R-Bloggers posts, it appears that many data scientists are concerned with scraping data from various media sources (Wikipedia, twitter, etc.). However, one should be aware that well structured and high quality datasets are available through state's and country's bureau of statistics. Increasingly these are offered to the public through direct database access, e.g., using a REST...

Read more »

Comparing subreddits, with Latent Semantic Analysis in R

March 24, 2017
By
Comparing subreddits, with Latent Semantic Analysis in R

FiveThirtyEight published a fascinating article this week about the subreddits that provided support to Donald Trump during his campaign, and continue to do so today. Reddit, for those not in the know, is an popular online social community organized into thousands of discussion topics, called subreddits (the names all begin with "r/"). Most of the subreddits are a useful...

Read more »

Building Shiny App Exercises (part-8)

March 24, 2017
By
Building Shiny App Exercises (part-8)

Transform your App into Dashboard Now that we covered the basic staff that you need to know in order to build your App it is time to enhance its appearance and its functionality. The interface is very important fot the user as it must not only be friendly but also easy to use. At this Related exercise sets:

Read more »

Superpixels in imager

March 24, 2017
By
Superpixels in imager

Superpixels are used in image segmentation as a pre-processing step. Instead of segmenting pixels directly, we first group similar pixels into “super-pixels”, which can then be processed further (and more cheaply). (image from Wikimedia) The current version of imager doesn’t implement them, but it turns out that SLIC superpixels are particularly easy to implement. SLIC

Read more »

Writing a conference abstract the data science way

March 24, 2017
By
Writing a conference abstract the data science way

Adnan Fiaz Conferences are an ideal platform to share your work with the wider community. However, as we all know, conferences require potential speakers to submit abstracts about their talk. And writing abstracts is not necessarily the most rewarding work … Continue reading →

Read more »

Some Random Weekend Reading

March 24, 2017
By

by Joseph Rickert Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to

Read more »

R Weekly Bulletin Vol – I

March 24, 2017
By
R Weekly Bulletin Vol – I

We are starting with R weekly bulletins which will contain some interesting ways and methods to write codes in R and solve bugging problems. We will also cover R functions and shortcut keys for beginners. We understand that there can be more than one way of writing a code in R, and the solutions listed... The post R...

Read more »

Web data acquisition: parsing json objects with tidyjson (Part 3)

March 24, 2017
By
Web data acquisition: parsing json objects with tidyjson (Part 3)

The collection of example flight data in json format available in part 2, described the libraries and the structure of the POST request necessary to collect data in a json object. Despite the process generated and transferred locally a proper response, the data collected were neither in a suitable structure for data analysis nor immediately … Continue...

Read more »

R – Change columns names in a spatial dataframe

March 24, 2017
By
R – Change columns names in a spatial dataframe

Ordnance Survey have a great OpenRoads dataset, but unfortunately it contains a column called ‘primary’, which is a keyword in SQL. This makes it challenging/impossible to import the OpenRoads dataset into a SQL… Continue reading →

Read more »

Neural Networks for Learning Lyrics

March 24, 2017
By
Neural Networks for Learning Lyrics

I created a Twitter account which was inspired by a couple Twitter accounts that applied a particular type of machine learning technique to learn how two (at the time) presidential hopefuls spoke. I thought, why not see what a model like this could do with lyrics from my favorite rock n roll artist?Long short term memory (LSTM) is a...

Read more »

Lesser known purrr tricks

purrr is a package that extends R’s functional programming capabilities. It brings a lot of new stuff to the table and in this post I show you some of the most useful (at least to me) functions included in purrr. Getting rid of loops with map() library(purrr) numbers <- list(11, 12, 13, 14) map_dbl(numbers, sqrt) ## 3.316625 3.464102...

Read more »

Lesser known purrr tricks

purrr is package that extends R’s functional programming capabilities. It brings a lot of new stuff to the table and in this post I show you some of the most useful (at least to me) functions included in purrr. Getting rid of loops with map() library(purrr) numbers <- list(11, 12, 13, 14) map_dbl(numbers, sqrt) ## 3.316625 3.464102 3.605551...

Read more »

US Counties – Race/Ethnicity (using choroplethr R package)

US Counties – Race/Ethnicity (using choroplethr R package)

As a statistician, I’ve always had a soft spot in my heart for the US Census. I love the rich data sets that are made publicly available and I’ve often experimented with visualizing the results. A couple of months ago, Ari Lamstein (formerly a data scientist at Trulia) released the choroplethr package on CRAN (a repository

Read more »

Emojis Analysis in R

March 23, 2017
By
Emojis Analysis in R

A while ago I developed and shared an emoji decoder because I was facing problems when retrieving data from Twitter and Instragram. In a nutshell, the issue is that R encodes emojis in a way that makes it a hassle identifying them. This is where t...

Read more »

RApiDatetime 0.0.1

March 23, 2017
By

Very happy to announce a new package of mine is now up on the CRAN repository network: RApiDatetime. It provides six entry points for C-level functions of the R API for Date and Datetime calculations: asPOSIXlt and asPOSIXct convert between long and compact datetime representation, formatPOSIXlt and Rstrptime convert to and from character strings, and POSIXlt2D and D2POSIXlt...

Read more »

Some Random Weekend Reading

March 23, 2017
By

Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to a book store with a good technical...

Read more »

Some Random Weekend Reading

March 23, 2017
By

Few of us have enough time to read, and most of us already have depressingly deep stacks of material that we would like to get through. However, sometimes a random encounter with something interesting is all that it takes to regenerate enthusiasm. Just in case you are not going to get to a book store with a good technical...

Read more »

Mapping Racial Segregation in Chicago

Mapping Racial Segregation in Chicago

This spring break I was reading a couple of papers about segregation in the US (Cutler and Galeser 1997 and Cutler et al 1999). A paragraph on Cutler et al (1999) caught my attention: “Chicago was the most segregated city in 1890, the fourth most segregated city in 1940, the most segregated MSA in 1970, and the fourth most...

Read more »

QR Decomposition with the Gram-Schmidt Algorithm

March 23, 2017
By

QR decomposition is another technique for decomposing a matrix into a form that is easier to work with in further applications. The QR decomposition technique decomposes a square or rectangular matrix, which we will denote as , into two components, , and . Where is an orthogonal matrix, and is... The post QR Decomposition with the Gram-Schmidt Algorithm...

Read more »

Announcing R Tools 1.0 for Visual Studio 2015

March 23, 2017
By
Announcing R Tools 1.0 for Visual Studio 2015

by Shahrokh Mortazavi, Partner PM, Visual Studio Cloud Platform Tools at Microsoft I’m delighted to announce the general availability of R Tools 1.0 for Visual Studio 2015 (RTVS). This release will be shortly followed by R Tools 1.0 for Visual Studio 2017 in early May. RTVS is a free and open source plug-in that turns Visual Studio into a...

Read more »

Survminer Cheatsheet to Create Easily Survival Plots

March 23, 2017
By
Survminer Cheatsheet to Create Easily Survival Plots

We recently released the survminer verion 0.3, which includes many new features to help in visualizing and sumarizing survival analysis results. In this article, we present a cheatsheet for...

Read more »

Make the [R] Kenntnis-Tage 2017 your stage

March 23, 2017
By
Make the [R] Kenntnis-Tage 2017 your stage

At the Kenntnis-Tage 2017 on November 8 and 9, 2017 you will get the chance to benefit not only from the exchange about the programming language R in a business context and practical tutorials but also from the audience: use the Kenntnis-Tage 2017 as your platform and hand in your topic for the …

Read more »

The Tidyverse Curse

March 23, 2017
By

I’ve just finished a major overhaul to my widely read article, Why R is Hard to Learn. It describes the main complaints I’ve heard from the participants to my workshops, and how those complaints can often be mitigated. Here’s the only … Continue reading →

Read more »

10 Million Dots: Mapping European Population

March 23, 2017
By
10 Million Dots: Mapping European Population

Creating a giant dot density map using R.

Read more »

Mean trophic levels of a genera from FishBase

March 22, 2017
By
Mean trophic levels of a genera from FishBase

Mean trophic levels of a genera from FishBase How would you selectively aggregate observations using R? For instance, say you have a table of trophic level estimates by fish species, but many species are missing values. For those species missing a val...

Read more »

New mlr Logo

March 22, 2017
By

We at mlr are currently deciding on a new logo, and in the spirit of open-source, we would like to involve the community in the voting process! You can vote for your favorite logo on GitHub by reacting to the logo with a +1. Thanks to Hannah Atkin fo...

Read more »

Euler Problem 17: Number Letter Counts

March 22, 2017
By
Euler Problem 17: Number Letter Counts

A solution in the R language to Euler Problem 17: If all the numbers from 1 to 1000 inclusive were written out in words, how many letters would be used? Continue reading → The post Euler Problem 17: Number Letter Counts appeared first on The Devil is in the Data.

Read more »

Data Visualization – Part 2

March 22, 2017
By
Data Visualization – Part 2

A Quick Overview of the ggplot2 Package in R While it will be important to focus on theory, I want to explain the ggplot2 package because I will be using it throughout the rest of this series. Knowing how it works will keep the focus on the results rather than the code. It’s an incredibly

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



www.ama.org/events-training

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.