March 2019

Data Science Software Reviews: Forrester vs. Gartner

March 19, 2019 | Bob Muenchen

In my previous post, I discussed Gartner's reviews of data science software companies. In this post, I show Forrester's coverage and discuss how radically different it is. As usual, this post is already integrated into my regularly-updated article, The Popularity of Data Science Software. Continue reading →
[Read more...]

The importance of Graphing Your Data – Anscombe’s Clever Quartet!

March 19, 2019 | Francis Smart

Francis Anscombe's seminal paper on "Graphs in Statistical" analysis (American Statistician, 1973) effectively makes the case that looking at summary statistics of data is insufficient to identify the relationship between variables. He demonstrates this by generating four different data sets (Anscombe's quartet) which have nearly identical summary statistics. His data have ...
[Read more...]

Scooters, mapped

March 19, 2019 | Tim Black

Do you know where all the scooters are in your city? Available devices by provider in Los AngelesYou may have been following the scooter craze or have seen scooters or e-bikes pop-up on sidewalks throughout your neighborhood. Have you ever been curious about where all the scooters are located in ...
[Read more...]

Tidyverse users: gather/spread are on the way out

March 19, 2019 | John Mount

From https://twitter.com/sharon000/status/1107771331012108288: From https://tidyr.tidyverse.org/dev/articles/pivot.html: There are two important new features inspired by other R packages that have been advancing of reshaping in R: The reshaping operation can be specified with a data frame that describes precisely how metadata stored ...
[Read more...]

Learning Data Science: Predicting Income Brackets

March 19, 2019 | Learning Machines

As promised in the post Learning Data Science: Modelling Basics we will now go a step further and try to predict income brackets with real world data and different modelling approaches. We will learn a thing or two along the way, e.g. about the so-called Accuracy-Interpretability Trade-Off, so read ... [Read more...]

Assumptions Matter More Than Dependencies

March 18, 2019 | hrbrmstr

There’s been alot of talk about “dependencies” in the R universe of late. This is not really a post about that but more of a “really, don’t do this” if you decide you want to poke the dependency bear by trying to build a deeply flawed model off ... [Read more...]

Using Scoped dplyr verbs

March 18, 2019 | Rstats on goonR blog

Introduction Over the past several months, I have really started to increase the amount that I have been using scoped dplyr verbs. For those of you who don’t know about these functions, they are handy variants to the normal dplyr verbs, such as filter, mutate, and summarize, that allow ... [Read more...]

The Credibility Crisis in Data Science

March 18, 2019 | Ryan Sheehy

Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Skipper Seabold, a Director of Data Science at Civis Analytics. Introducing Skipper Seabold Hugo: Hi there, Skipper, and welcome to Data Framed. Skipper: Thanks. Happy to be here. Hugo: Great to have you on the show. I'm really excited ...
[Read more...]

RStudio Connect Quickstart

March 18, 2019 | Ben Wolstenholme

RStudio have recently announced ‘RStudio Connect QuickStart’ which is a VM containing a full suite of RStudio’s pro tools, available to be trialled for a 45 day period. RStudio Connect Quickstart allows R users and people exploring the idea of using R in production, a quick and easy way to ...
[Read more...]

Quantifying R Package Dependency Risk

March 18, 2019 | John Mount

We recently commented on excess package dependencies as representing risk in the R package ecosystem. The question remains: how much risk? Is low dependency a mere talisman, or is there evidence it is a good practice (or at least correlates with other good practices)? Well, it turns out we can ...
[Read more...]

Handling & Sharing PCAPs Like a Boss with PacketTotal

March 17, 2019 | hrbrmstr

The fine folks over at @PacketTotal bequeathed an API token on me so I cranked out an R package for it to enable more dynamic investigations work (RStudio makes for an amazing incident responder investigations console given that you can script in multiple languages, code in C[++], and write documentation ... [Read more...]

R meta programmation

March 17, 2019 | Neonira

Lately, R meta programmation seems to be in vogue. Very huge promises, encompassing code to produce code, variable indirection naming schemes, and many others marvelous features are now available through the tidyverse. Indeed, very huge promises, are g...
[Read more...]

Diary of becoming a Bayesian

March 17, 2019 | Bluecology blog

Diary of becoming a Bayesian Here’s my timeline of going from frequentist stats to becoming a fully fledged Bayesian eco-statistician. 2006 Read the Ecological Detective (Hilborn and Walters), the philosophy makes sense, the history of falsificatio... [Read more...]

Are R ecosystems the future?

March 17, 2019 | Jamie Lendrum

Some random thoughts… Over the past 6 months I’ve been creating, refining, and delivering a variety of ‘Introduction to R’ training courses. The more I do this, the more I come to the view that not nearly enough is made of taking an ecosystem-oriented view to packages. A good way ... [Read more...]
1 5 6 7 8 9 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)