Highlighting with ggplot2: The Old School and New School Way

July 31, 2018
By
Highlighting with ggplot2: The Old School and New School Way

A tutorial showing how to highlight a subset of your data within the context of the full data set. We review the old school way (layering) and new school way (gghighlight).

Read more »

Business confidence and economic growth by @ellis2013nz

Business confidence and economic growth by @ellis2013nz

The ANZ bank have a nice bit of publicity for themselves each month in New Zealand with the release of the results of their monthly Business Outlook Survey. This week it caused a bit of a stir, with the ANZ’s own commentary reporting New Zealand corporate sector is “in a funk” (ANZ’s words) with a net 45 percentage...

Read more »

How to use Hierarchical Bayes for Choice Modeling In R

July 31, 2018
By
How to use Hierarchical Bayes for Choice Modeling In R

There are multiple add-on packages available in R to fit choice models in a Bayesian framework.  These include bayesm, choiceModelR, and flipChoice. In this article,...

Read more »

ALLSTATisticians in decline? A polite look at ALLSTAT email Archives

ALLSTATisticians in decline? A polite look at ALLSTAT email Archives

I was until recently subscribed to an email list, ALLSTAT, “A UK-based worldwide e-mail broadcast system for the statistical community, operated by ICSE for HEA Statistics.” created in 1998. That’s how I saw the ad for my previous job in Barcelona! Now, I dislike emails more and more so I unsubscribed, but I’d still check out the archives any...

Read more »

rstudio::conf 2019 is open for registration!

July 30, 2018
By
rstudio::conf 2019 is open for registration!

rstudio::conf, the conference for all things R and RStudio, will take place January 17 and 18, 2019 (Thursday and Friday) in Austin, Texas. It will be preceded by Training Days on January 15 and 16 (Tuesday and Wednesday). Early Bird registration is now open! Conference: Thursday January 17 - Friday January 18, 2019 Join host and RStudio Chief Scientist Hadley Wickham...

Read more »

Judging Freehand Circle Drawing Competitions

July 30, 2018
By
Judging Freehand Circle Drawing Competitions

Abstract: In 2007 Alexander Overwijk went viral with his 'Perfect Circle' video. The same year a World Freehand Circle Drawing Championship was organized, which he won. In this post we show how a mobile camera, R and the imager package can be used to develop an image analysis based method to judge future instances of the championship. This work is...

Read more »

New R package debugr – use automatic debug messages to improve your code

July 30, 2018
By

debugr is a new package designed to support debugging in R. It mainly provides the dwatch() function which prints a debug output to the console or to a file. A debug output can consist of a static text message, the values of one or more objects (p...

Read more »

Assignment in R: slings and arrows

July 30, 2018
By
Assignment in R: slings and arrows

Having recently shared my post about defensive programming in R on the r/rstats subreddit, I was blown away by the sheer number of comments as much as I was blown away by the insight many displayed. One particular comment by u/guepier struck my attention. In my previous post, I came out quite vehemently against using theRead more The post Assignment in...

Read more »

Long Running Tasks With Shiny: Challenges and Solutions

July 30, 2018
By
Long Running Tasks With Shiny: Challenges and Solutions

One of the great additions to the R ecosystem in recent years is RStudio's Shiny package. With it, you can easily whip up and share a user interface for a new statistical method in just a few hours. Today I want to share some of the methods and challenges that come up when the actual

Read more »

A Certification for R Package Quality

July 30, 2018
By
A Certification for R Package Quality

There are more than 12,000 packages for R available on CRAN, and many others available on Github and elsewhere. But how can you be sure that a given R package follows best development practices for high-quality, secure software? Based on a recent survey of R users related to challenges in selecting R packages, the R Consortium now recommends a...

Read more »

Podcast on Nonclinical Statistics

July 30, 2018
By
Podcast on Nonclinical Statistics

Hugo...

Read more »

R Consortium Proposal Accepted!

July 30, 2018
By
R Consortium Proposal Accepted!

Today I am happy to announce that my proposal to the R Consortium was accepted! I first announced that I was submitting a proposal back... The post R Consortium Proposal Accepted! appeared first on AriLamstein.com.

Read more »

Rj Editor – Analyse your data with R in jamovi

July 30, 2018
By
Rj Editor – Analyse your data with R in jamovi

tl;dr Rj Editor lets you analyse data in jamovi with R, and make use of your favourite R packages from within the jamovi statistical spreadsheet jmvconnect makes it easy to access jamovi data sets from R

Read more »

K-fold cross-validation in Stan

July 30, 2018
By

Comparing multiple models is one of the core but also one of the trickiest element of data analysis. Under a Bayesian framework the loo package in R allows you to derive (among other things) leave-one-out cross-validation metrics to compare the predictive abilities of different models. Cross-validation is basically: (i) separating the data into chunks, (ii) Related PostAutomated Text Feature...

Read more »

Following the Movement of Birds in the United States

July 29, 2018
By
Following the Movement of Birds in the United States

The American Birder For the millions of bird watchers in America, relevant and useful resources are always a welcome sight. Range maps and ecological histories enhance the bird watching experience by adding a layer of conservation awareness and help hobbyists become more acquainted with the birds they observe. As a birder myself, I am always

Read more »

Version 0.6-12 of NIMBLE released

July 29, 2018
By
Version 0.6-12 of NIMBLE released

We’ve released the newest version of NIMBLE on CRAN and on our website. Version 0.6-12 is primarily a maintenance release with various bug fixes. Changes include: a fix for the bootstrap particle filter to correctly calculate weights when particles are not resampled (the filter had been omitting the previous weights when calculating the new weights);

Read more »

About lazy evaluation

July 29, 2018
By
About lazy evaluation

A follow-up on Thomas Lumley follow-up post on Miles McBain post about quotation. In this post, Thomas is continuing Miles exploration of the concept of quoting and evaluation in R. Thomas speaks a little bit about lazy evaluation, and I decided to continue to explore this concept. Notably I wish to start over from on this quote from the...

Read more »

Beyond Basic R – Introduction and Best Practices

July 29, 2018
By

We queried more than 60 people who have taken the USGS Introduction to R class over the last two years to understand what other skills and techniques are desired, but not covered in the course. Though many people have asked for an intermediate level class, we believe that many of the skills could be best taught through existing online...

Read more »

Visualizing Wine Reviews

July 29, 2018
By
Visualizing Wine Reviews

  Have you ever been wine shopping and wondered if the ratings actually mean anything? Do only high-priced wines get good reviews? This analysis attempts to demystify some of the confusion behind these ratings by examining a wide range of wines reviewed by a popular wine publication and showing ways to choose a wine based

Read more »

RcppCNPy 0.2.10

July 29, 2018
By

Another small maintenance release of the RcppCNPy package arrived on CRAN a few minutes ago. RcppCNPy provides R with read and write access to NumPy files thanks to the cnpy library by Carl Rogers. I updated and refreshed the vignettes, and also ment...

Read more »

Co-integration and Pairs Trading

July 29, 2018
By
Co-integration and Pairs Trading

The co-integration is an important statistical concept behind the statistical arbitrage strategy named “Pairs Trading”. While projecting a stock price with time series models is by all means difficult, it is technically feasible to find a pair of (or even a portfolio of) stocks sharing the common trend such that a linear combination of two

Read more »

House price data cleansing and segmentation tool.

July 29, 2018
By
House price data cleansing and segmentation tool.

Project background Land Registry publishes data for each housing sale transaction that is registered in England & Wales. This data has been used extensively for many analysis, from price evolution in time to the assessment of price differences between areas. This dataset is publicly available under the government licence and dates back to 1995. The main

Read more »

histoRicalg: The effort to document historic and historical numerical algorithms in R

July 29, 2018
By
histoRicalg: The effort to document historic and historical numerical algorithms in R

Recently, the R-consortium accepted a new project called histoRicalg. The main goal of the project is to document and transfer knowledge of some older algorithms used by R and by other computational systems. There is a lot of R written in Fortran—much of which is in the old F77 format—and in C whose original implementations Read the full article... The...

Read more »

ggplot “Doodling” with HIBP Breaches

July 29, 2018
By

After reading this interesting analysis of “How Often Are Americans’ Accounts Breached?” by Gaurav Sood (which we need more of in cyber-land) I gave in to the impulse to do some gg-doodling with the “Have I Been Pwnd” JSON data he used. It’s just some basic data manipulation with some heavy ggplot2 styling customization, so... Continue reading →

Read more »

CHAID vs. ranger vs. xgboost — a comparison

July 29, 2018
By
CHAID vs. ranger vs. xgboost — a comparison

In an earlier post, I focused on an in-depth visit with CHAID (Chi-square automatic interaction detection). Quoting myself, I said “As the name implies it is fundamentally based on the venerable Chi-square test – and while not the most powerful (in terms of detecting the smallest possible differences) or the fastest, it really is easy Related PostCommon Mistakes to...

Read more »

Statistics Sunday: More Text Analysis – Term Frequency and Inverse Document Frequency

July 29, 2018
By
Statistics Sunday: More Text Analysis – Term Frequency and Inverse Document Frequency

Statistics Sunday: Term Frequency and Inverse Document Frequency As a mixed methods researcher, I love working with qualitative data, but I also love the idea of using quantitative methods to add some meaning and context to the words. This is the main reason I've started digging into using R for text mining, and these skills have paid off...

Read more »

But can ravens forecast?

July 29, 2018
By
But can ravens forecast?

Why forecast sales? Humans have the magical ability to plan for future events, for future gain. It’s not quite a uniquely human trait. Because apparently ravens can match a 4-year-old. An abundance of data, and some very nice R packages, make our ability to plan all the more powerful. A couple of months ago we looked … Continue reading "But...

Read more »

June 2018: Top 40 New Packages

July 28, 2018
By
June 2018: Top 40 New Packages

Approximately 144 new packages stuck to CRAN in June. That fact that 31 of these are specialized to particular scientific disciplines or analyses provides some evidence to my hypothesis that working scientists are actively adopting R. Below are my Top 40 picks for June, organized into the categories of Computational Methods, Data, Data Science, Economics, Science, Statistics, Time Series,...

Read more »

Tuning xgboost in R: Part II

July 28, 2018
By
Tuning xgboost in R: Part II

By Gabriel Vasconcelos In this previous post I discussed some of the parameters we have to tune to estimate a boosting model using the xgboost package. In this post I will discuss the two parameters that were left out in … Continue reading →

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC2 west

ODSC1_london

datasociety

http://www.eoda.de

max kuhn









Six Sigma Online Training



mljar.com

computationalanalytics.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.