October 2018

Designing Transforms for Data Reshaping with cdata

October 25, 2018 | John Mount

Authors: John Mount, and Nina Zumel 2018-10-25 As a followup to our previous post, this post goes a bit deeper into reasoning about data transforms using the cdata package. The cdata packages demonstrates the "coordinatized data" theory and includes an implementation of the "fluid data" methodology for general data ...
[Read more...]

My fi(R)st day with Jupyter Lab

October 25, 2018 | paoloeusebi

Today was my first day with Jupyter Lab . I knew about Jupyter Lab listening episode 44 of  DataFramed, the DataCamp‘s official podcast presented by Hugo Bowne-Anderson. In this episode, Project Jupiter was described in the context of interactive computing by Brian Granger, professor of physics and data science at Cal ...
[Read more...]

Getting started Stamen maps with ggmap

October 25, 2018 | kjytay

Spatial visualizations really come to life when you have a real map as a background. In R, ggmap is the package that you’ll want to use to get these maps. In what follows, we’ll demonstrate how to use ggmap with … Continue reading →
[Read more...]

How DataCamp Handles Course Quality

October 25, 2018 | Richie Cotton

At DataCamp, we pride ourselves on having the best platform and the best curriculum for learning data science. To this end, we put a lot of effort into ensuring that every exercise is both effective at educating and enjoyable. After a course is launche...
[Read more...]

Blazing Fast Financial Backtesting from R

October 24, 2018 | pivotteam

As a data scientist, whenever I am developing and testing financial models in R I’ve consistently run into data size limitations, large or distributed compute clusters, and many long waits for my results to be processed and returned. That's why I was genuinely impressed with how our recently released ...
[Read more...]

Popular Halloween Candy on US State Grid Map

October 24, 2018 | R on Chi's Impe[r]fect Blog

Halloween is coming..! Halloween is just around the corner, I am still trying to decide which candies to purchase this year for trick-or-treaters. Initially I was looking for data sets maybe comparing American chocolate bars vs Canadian chocolate bars possibly with sugar contents or lists of ingredients. I am really ...
[Read more...]

When the numbers don’t tell the whole story

October 24, 2018 | David Smith

Anscombe's Quartet is a famous collection of four small data sets — just 11 (x,y) pairs each — that was developed in the 1970s to emphasize the fact that sometimes, numerical summaries of data aren't enough. (For a modern take on this idea, see also the Datasaurus Dozen.) In this case, it ... [Read more...]

automl package: part 2/2 first steps how to

October 24, 2018 | Alex Boulangé

first steps: how to For those who will laugh at seeing deep learning with one hidden layer and the Iris data set of 150 records, I will say: you’re perfectly right ???? The goal at this stage is simply to take the first steps fit a regression model manually (hard way) ...
[Read more...]

M4 Forecasting Conference

October 23, 2018 | R on Rob J Hyndman

Following the highly successful M4 Forecasting Competition, there will be a conference held on 10-11 December at Tribeca Rooftop, New York, to discuss the results. The conference will elaborate on the findings of the M4 Competition, with prominent sp...
[Read more...]

Computer Vision for Model Assessment

October 23, 2018 | David Smith

One of the differences between statistical data scientists and machine learning engineers is that while the latter group are concerned primarily with the predictive performance of a model, the former group are also concerned with the fit of the model. A model that misses important structures in the data — for ...
[Read more...]

Create a Glossary in R Markdown

October 23, 2018 | Yongfu, Liao

I was thinking about creating a glossary in bookdown and found out that there was already an issue about it. I like Yihui’s recommendation: use Pandoc’s definition lists. This was exactly what I had been doing, but I quickly found out that there was a major drawback – the ...
[Read more...]

Introducing gratia

October 23, 2018 | Gavin L. Simpson

I use generalized additive models (GAMs) in my research work. I use them a lot! Simon Wood’s mgcv package is an excellent set of software for specifying, fitting, and visualizing GAMs for very large data sets. Despite recently dabbling with brms, mgcv is still my go-to GAM package. The ...
[Read more...]
1 2 3 4 5 6 15

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)