Articles by David Smith

Preview of EARL London 2017

September 5, 2017 | David Smith

The next event in the Effective Applications of the R Language (EARL) conference series takes place next week, with EARL London 2017. The EARL conference series got its start in London, and the London event remains the biggest and brightest of the venues. This year's program is no exception, with an ... [Read more...]

Practical Data Science for Stats

September 1, 2017 | David Smith

PeerJ Preprints has recently published a collection of articles that focus on the practical side of statistical analysis: Practical Data Science for Stats. While the articles are not peer-reviewed, they have been selected and edited by Jennifer Bryan and Hadley Wickham, both well-respected members of the R community. And while ... [Read more...]

Text featurization with the Microsoft ML package

August 31, 2017 | David Smith

Last week I wrote about how you can use the MicrosoftML package in Microsoft R to featurize images: reduce an image to a vector of 4096 numbers that quantify the essential characteristics of the image, according to an AI vision model. You can perform a similar featurization process with text as ... [Read more...]

Probably more likely than probable

August 30, 2017 | David Smith

What kind of probability are people talking about when they say something is "highly likely" or has "almost no chance"? The chart below, created by Reddit user zonination, visualizes the responses of 46 other Reddit users to "What probability would you assign to the phase: " for various statements of probability. Each ... [Read more...]

3-D animations with R

August 30, 2017 | David Smith

R is often used to visualize and animate 2-dimensional data. (Here are just a few examples.) But did you know you can create 3-dimensional animations as well? As Thomas Lins Pedersen explains in a recent blog post, the trick is in using the persp function to translate points in 3-D ... [Read more...]

Packages to simplify mapping in R

August 28, 2017 | David Smith

Computerworld's Sharon Machlis has published a very useful tutorial on creating geographic data maps with R. (The tutorial was actually published back in March, but I only came across it recently.) While it's been possible to create maps in R for a long time, some recent packages and data APIs ... [Read more...]

Tips and tricks on using R to query data in Power BI

August 25, 2017 | David Smith

In Power BI, the dashboarding and reporting tool, you can use R to filter, transform, or restructure data via the Query Editor. For example, you could use the mice package to impute missing values, or use the tidytext package to assign sentiment scores to text inputs. As Imke Feldmann explains, ... [Read more...]

Recreating and updating Minard with ggplot2

August 23, 2017 | David Smith

Minard's chart depicting Napoleon's 1812 march on Russia is a classic of data visualization that has inspired many homages using different time-and-place data. If you'd like to recreate the original chart, or create one of your own, Andrew Heiss has created a tutorial on using the ggplot2 package to re-envision the ... [Read more...]

Gender roles in film direction, analyzed with R

August 22, 2017 | David Smith

What do women do in films? If you analyze the stage directions in film scripts — as Julia Silge, Russell Goldenberg and Amber Thomas have done for this visual essay for ThePudding — it seems that women (but not men) are written to snuggle, giggle and squeal, while men (but not women) ... [Read more...]

Obstacles to performance in parallel programming

August 18, 2017 | David Smith

Making your code run faster is often the primary goal when using parallel programming techniques in R, but sometimes the effort of converting your code to use a parallel framework leads only to disappointment, at least initially. Norman Matloff, author of Parallel Computing for Data Science: With Examples in R, ... [Read more...]

20 years of the R Core Group

August 17, 2017 | David Smith

The first "official" version of R, version 1.0.0, was released on February 29, 200. But the R Project had already been underway for several years before then. Sharing this tweet, from yesterday, from R Core member Peter Dalgaard: It was twenty years ago today, Ross Ihaka got the band to play.... #rstats pic.... [Read more...]

How to build an image recognizer in R using just a few images

August 16, 2017 | David Smith

Microsoft Cognitive Services provides several APIs for image recognition, but if you want to build your own recognizer (or create one that works offline), you can use the new Image Featurizer capabilities of Microsoft R Server. The process of training an image recognition system requires LOTS of images — millions and ... [Read more...]

Buzzfeed trains an AI to find spy planes

August 15, 2017 | David Smith

Last year, Buzzfeed broke the story that US law enforcement agencies were using small aircraft to observe points of interest in US cities, thanks to analysis of public flight-records data. With the data journalism team no doubt realizing that the Flightradar24 data set hosted many more stories of public interest, ... [Read more...]

In case you missed it: July 2017 roundup

August 11, 2017 | David Smith

In case you missed them, here are some articles from July of particular interest to R users. A tutorial on using the rsparkling package to apply H20's algorithms to data in HDInsight. Several exercises to learn parallel programming with the foreach package. A presentation on the R6 class system, ... [Read more...]

How to make best use of the byte compiler in R

August 7, 2017 | David Smith

Tomas Kalibera, the newest member of the R Core Team, has been working for the last several years with fellow Core Team member Luke Tierney implementing R's byte-code compiler and interpreter. Byte-compiling R code often improves its speed of execution, and usually happens without you having to take any explicit ... [Read more...]

Painting with Data

August 4, 2017 | David Smith

The accidental aRt tumblr (mentioned here a few years ago) continues to provide a steady stream of images that wouldn't look out of place in a modern art gallery, but which in fact are data visualizations (mostly attempted in R), gone wrong. (Here's a typical recent entry.) But now, Giora ... [Read more...]

Text categorization with deep learning, in R

August 3, 2017 | David Smith

Given a short review of a product, like "I couldn't put it down!", can you predict what the product is? In that case it's pretty easy — it's for a book — but this general problem of text categorization comes up in a lot of natural language analysis problems. In his talk ... [Read more...]
1 10 11 12 13 14 94

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)