July 2018

K-fold cross-validation in Stan

July 30, 2018 | Lionel Hertzog

Comparing multiple models is one of the core but also one of the trickiest element of data analysis. Under a Bayesian framework the loo package in R allows you to derive (among other things) leave-one-out cross-validation metrics to compare the predictive abilities of different models. Cross-validation is basically: (i) separating ... [Read more...]

Following the Movement of Birds in the United States

July 29, 2018 | Alex Baransky

The American Birder For the millions of bird watchers in America, relevant and useful resources are always a welcome sight. Range maps and ecological histories enhance the bird watching experience by adding a layer of conservation awareness and help hobbyists become more acquainted with the birds they observe. As a ...
[Read more...]

Version 0.6-12 of NIMBLE released

July 29, 2018 | Chris Paciorek

We’ve released the newest version of NIMBLE on CRAN and on our website. Version 0.6-12 is primarily a maintenance release with various bug fixes. Changes include: a fix for the bootstrap particle filter to correctly calculate weights when particles are not resampled (the filter had been omitting the previous ...
[Read more...]

Readable code with base R (part 1)

July 29, 2018 | Roman Pahl

Producing readable R code is of great importance, especially if there is a chance that you will share your code with people other than your future self. The now widely used magrittr pipe operator and dplyr tools are great frameworks for this purpose. However, if you want to keep your ...
[Read more...]

Import Data into R – Part 1

July 29, 2018 | Rsquared Academy Blog

Introduction In this post, we will learn to: read data from flat or delimited files handle column names/header skip text/info present before data specify column/variable types read specific columns/variables Libraries, Data & Code We will use the readr package. The data sets can be downloaded from here ...
[Read more...]

About lazy evaluation

July 29, 2018 | Colin Fay

A follow-up on Thomas Lumley follow-up post on Miles McBain post about quotation. In this post, Thomas is continuing Miles exploration of the concept of quoting and evaluation in R. Thomas speaks a little bit about lazy evaluation, and I decided to continue to explore this concept. Notably I wish ...
[Read more...]

What’s the value of a college degree?

July 29, 2018 | Hayley Caddes

Shortly after commencing this immersive boot camp, I couldn't help but remark on the fact that everyone in my cohort has at least a bachelor's degree, and a huge chunk of us have graduate degrees as well; yet here we all were, investing a significant amount of time and money ...
[Read more...]

Visualizing Wine Reviews

July 29, 2018 | Erin Dugan

  Have you ever been wine shopping and wondered if the ratings actually mean anything? Do only high-priced wines get good reviews? This analysis attempts to demystify some of the confusion behind these ratings by examining a wide range of wines reviewed by a popular wine publication and showing ways to ...
[Read more...]

RcppCNPy 0.2.10

July 29, 2018 | Thinking inside the box

Another small maintenance release of the RcppCNPy package arrived on CRAN a few minutes ago. RcppCNPy provides R with read and write access to NumPy files thanks to the cnpy library by Carl Rogers. I updated and refreshed the vignettes, and also ment... [Read more...]

Co-integration and Pairs Trading

July 29, 2018 | statcompute

The co-integration is an important statistical concept behind the statistical arbitrage strategy named “Pairs Trading”. While projecting a stock price with time series models is by all means difficult, it is technically feasible to find a pair of (or even a portfolio of) stocks sharing the common trend such that ...
[Read more...]

cheatR: an R package for catching cheaters

July 29, 2018 | Mattan S.

cheatR is a mini package to help you find cheaters by comparing hand-ins. It was developed by Almog Simchon and me in response to students overheard bragging about how an assignment in an first-year undergrad course was "super easy" because "we all just copied from each other!" (though this would ...
[Read more...]

House price data cleansing and segmentation tool.

July 29, 2018 | Nacho Moreno

Project background Land Registry publishes data for each housing sale transaction that is registered in England & Wales. This data has been used extensively for many analysis, from price evolution in time to the assessment of price differences between areas. This dataset is publicly available under the government licence and dates ...
[Read more...]

A look at the Bay Area Bike Share

July 29, 2018 | Aungshuman Zaman

Introduction: Bicycles are one of the best ways to travel in an urban setting.  Because of its zero-emission and active lifestyle promoting reputation, it has increasingly become mainstream in cities across the world.  City planners have moved to make their streets more bike-friendly, and several private sector bike-sharing programs have ... [Read more...]

ggplot “Doodling” with HIBP Breaches

July 29, 2018 | hrbrmstr

After reading this interesting analysis of “How Often Are Americans’ Accounts Breached?” by Gaurav Sood (which we need more of in cyber-land) I gave in to the impulse to do some gg-doodling with the “Have I Been Pwnd” JSON data he used. It’s just some basic data manipulation with ... [Read more...]

CHAID vs. ranger vs. xgboost — a comparison

July 29, 2018 | Chuck Powell

In an earlier post, I focused on an in-depth visit with CHAID (Chi-square automatic interaction detection). Quoting myself, I said “As the name implies it is fundamentally based on the venerable Chi-square test – and while not the most powerful (in terms of detecting the smallest possible differences) or the fastest, ...
[Read more...]

But can ravens forecast?

July 29, 2018 | Carl Goodwin

Why forecast sales? Humans have the magical ability to plan for future events, for future gain. It’s not quite a uniquely human trait. Because apparently ravens can match a 4-year-old. An abundance of data, and some very nice R packages, make our ability to plan all the more powerful. ...
[Read more...]

5 Things You Should Know About The Future of Population

July 28, 2018 | Colleen Tapen

Motivation About a decade ago, I stumbled upon a TED Talk by a Swedish global health professor named Hans Rosling.  During the video I learned more about global macro-trends (infant mortality, GDP, etc.), opening my eyes to some misconceptions and piquing my interest.  Dr. Rosling’s moving bubble plots enraptured ... [Read more...]
1 2 3 4 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)