Articles by David Smith

In case you missed it: October 2017 roundup

November 7, 2017 | David Smith

In case you missed them, here are some articles from October of particular interest to R users. A recent survey of competitors on the Kaggle platform reveals that Python (76%) and R (59%) are the preferred tools for building predictive models. Microsoft's "Team Data Science Process" has been updated with new guidelines ... [Read more...]

A history-oriented introduction to R for Excel users

November 6, 2017 | David Smith

While spreadsheets are fine tools for collecting and sharing data, the temptation is often there to also use them for in-depth analysis better suited to reproducible systems like R. Historian Jesse Sadler recently published the useful guide Excel vs R: A Brief Introduction to R, which provides useful advice to ... [Read more...]

New RStudio cheat sheet: Strings in R

November 3, 2017 | David Smith

The RStudio team has created another very useful cheat sheet for R: Working with Strings. This cheat sheet provides an example-laden menu of operations you can perform on strings (character verctors) in R using the stringr package. While base R provides a solid set of string manipulation functions, the stringr ... [Read more...]

R: the least disliked programming language

November 1, 2017 | David Smith

According to a recent analysis of Stack Overflow "Developer Stories", where programmer candidates list the technologies the would and would not like to work with, R is the least disliked programming language: This is probably related to the fact that there's high demand in the job market for fast-growing technologies, ... [Read more...]

Survey of Kagglers finds Python, R to be preferred tools

October 31, 2017 | David Smith

Competitive predictive modeling site Kaggle conducted a survey of participants in prediction competitions, and the 16,000 responses provide some insights about that user community. (Whether those trends generalize to the wider community of all data scientists is unclear, however.) One question of interest asked what tools Kagglers use at work. Python ... [Read more...]

Recent updates to the Team Data Science Process

October 30, 2017 | David Smith

It's been over a year since we first introduced introduced the Team Data Science Process (TDSP). The data, technology and practices behind Data Science continue to evolve, and the TDSP has evolved in parallel. Over the past year, several new facets have been added, including: The IDEAR (Interactive Data Exploration, ... [Read more...]

Microsoft R Open 3.4.2 now available

October 27, 2017 | David Smith

Microsoft R Open (MRO), Microsoft's enhanced distribution of open source R, has been upgraded to version 3.4.2 and is now available for download for Windows, Mac, and Linux. This update upgrades the R language engine to the latest R 3.4.2 and updates the bundled packages. MRO is 100% compatible with all R packages. ... [Read more...]

Two upcoming webinars

October 25, 2017 | David Smith

Two new Microsoft webinars are taking place over the next week that may be of interest: AI Development in Azure using Data Science Virtual Machines The Azure Data Science Virtual Machine (DSVM) provides a comprehensive development and production environment to Data Scientists and AI-savvy developers. DSVMs are specialized virtual machine ... [Read more...]

Create editable Microsoft Office charts from R

October 24, 2017 | David Smith

R has a rich and infinitely flexible graphics system, and you can easily embed R graphics into Microsoft Office documents like PowerPoint or Word. The one thing I dread hearing after delivering such a document, though, is "how can I tweak that graphic?". I could change the colors or fonts ... [Read more...]

Statistical Machine Learning with Microsoft ML

October 23, 2017 | David Smith

MicrosoftML is an R package for machine learning that works in tandem with the RevoScaleR package. (In order to use the MicrosoftML and RevoScaleR libraries, you need an installation of Microsoft Machine Learning Server or Microsoft R Client.) A great way to see what MicrosoftML can do is to take ... [Read more...]

An Updated History of R

October 20, 2017 | David Smith

Here's a refresher on the history of the R project: 1992: R development begins as a research project in Auckland, NZ by Robert Gentleman and Ross Ihaka 1993: First binary versions of R published at Statlib [see update, below] 1995: R first distributed as open-source software, under GPL2 license 1997: R core group formed 1997: ... [Read more...]

The R manuals in bookdown format

October 19, 2017 | David Smith

While there are hundreds of excellent books and websites devoted to R, the canonical source of truth regarding the R system remains the R manuals. You can find the manuals at your local CRAN mirror and on your laptop as part of the R distribution (try Help __ Manuals in RGui, ... [Read more...]

Is it faster to take a bike or taxi in NYC?

October 18, 2017 | David Smith

Taxis are plentiful and convenient in New York City, but the city is also served by a wide network of commuter bicycles (Citi Bikes). If you need to get from, say, the West Village to the Garment District, are you better off time-wise hailing a cab, or heading over to ... [Read more...]

My interview with ROpenSci

October 16, 2017 | David Smith

The ROpenSci team has started publishing a new series of interviews with the goal of “demystifying the creative and development processes of R community members”. I had the great pleasure of being interviewed by Kelly O'Briant earlier this year, and the interview was published on Friday. Thanks for being a ... [Read more...]

An AI pitches startup ideas

October 13, 2017 | David Smith

Take a look at this list of 13 hot startups, from a list compiled by Alex Bresler. Perhaps one of them is the next Juicero? FAR ATHERA: A CLINICAL AI PLATFORM THAT CAN BE ACCESSED ON DEMAND. ZAPSY: TRY-AT-HOME SERVICE FOR CONSUMER ELECTRONICS. MADESS: ON-DEMAND ACCESS TO CLEAN WATER. DEERG: AI ... [Read more...]

A cRyptic crossword with an R twist

October 12, 2017 | David Smith

Last week's R-themed crossword from R-Ladies DC was popular, so here's another R-related crossword, this time by Barry Rowlingson and published on page 39 of the June 2003 issue of R-news (now known as the R Journal). Unlike the last crossword, this one follows the conventions of a British cryptic crossword: the ... [Read more...]

Tutorial: Azure Data Lake analytics with R

October 11, 2017 | David Smith

The Azure Data Lake store is an Apache Hadoop file system compatible with HDFS, hosted and managed in the Azure Cloud. You can store and access the data within directly via the API, by connecting the filesystem directly to Azure HDInsight services, or via HDFS-compatible open-source applications. And for data ... [Read more...]

R’s remarkable growth

October 10, 2017 | David Smith

Python has been getting some attention recently for its impressive growth in usage. Since both R and Python are used for data science, I sometimes get asked if R is falling by the wayside, or if R developers should switch course and learn Python. My answer to both questions is ... [Read more...]

A cRossword about R

October 6, 2017 | David Smith

The members of the R Ladies DC user group put together an R-themed crossword for a recent networking event. It's a fun way to test out your R knowledge. (Click to enlarge, or download a printable version here.) If you get stuck, you can find the answers here or at ... [Read more...]

In case you missed it: September 2017 roundup

October 5, 2017 | David Smith

In case you missed them, here are some articles from September of particular interest to R users. The mathpix package converts images of hand-drawn equations to their LaTeX equivalent. R 3.4.2 is released. Applying image featurization to the problem of classifying wood knots in lumber. Microsoft ML Server 9.2, which provides operationalization ... [Read more...]
1 8 9 10 11 12 94

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)