July 2019

Clean, Consistent Column Names

July 7, 2019 | William Doane

I like to standardize the column names of data I’m reading into R so that I don’t have to match column names from one dataset that has an i.d. column and another that has an id column or maybe an ID column. Keep it simple: lower case ... [Read more...]

swephR v0.2.1

July 7, 2019 | R on Ralf Stubner

This morning swephR version 0.2.1 made it unto CRAN and is now propagating to the mirrors. The goal of swephR is to provide an R interface to the Swiss Ephemeris, a high precision ephemeris based upon the DE431 ephemeris from NASA’s JPL. It covers the time range 13201 BCE to 17191 CE. ... [Read more...]

Introducing tidylo

July 7, 2019 | Rstats on Julia Silge

Today I am so pleased to introduce a new package for calculating weighted log odds ratios, tidylo. Often in data analysis, we want to measure how the usage or frequency of some feature, such as words, differs across some group or set, such as documents. One statistic often used to ...
[Read more...]

Chunk Averaging of GLM

July 7, 2019 | statcompute

Chunk Average (CA) is an interesting concept proposed by Matloff in the chapter 13 of his book “Parallel Computing for Data Science”. The basic idea is to partition the entire model estimation sample into chunks and then to estimate a glm for each chunk. Under the i.i.d assumption, the ... [Read more...]

Sampling paths from a Gaussian process

July 7, 2019 | kjytay

Gaussian processes are a widely employed statistical tool because of their flexibility and computational tractability. (For instance, one recent area where Gaussian processes are used is in machine learning for hyperparameter optimization.) A stochastic process is a Gaussian process if … Continue reading →
[Read more...]

Le Monde puzzle [#1105]

July 7, 2019 | xi'an

Another token game as Le Monde mathematical puzzle: Archibald and Beatrix play with a pile of n__100 tokens, sequentially picking m tokens from the pile with m being a prime number [including m=1] or a multiple of 6, the winner taking the last tokens. If Beatrix knows n and proposes to ...
[Read more...]

CRAN Release of R/exams 2.3-3 and 2.3-4

July 7, 2019 | R/exams

New minor releases of the R/exams package to CRAN, containing a new dedicated function for online quizzes/exams in the Canvas learning management system. Moreover, the update provides a range of smaller improvements and bug fixes. ... [Read more...]

Link Functions versus Data Transforms

July 7, 2019 | Nina Zumel

In the linear regression section of our book Practical Data Science in R, we use the example of predicting income from a number of demographic variables (age, sex, education and employment type). In the text, we choose to regress against log10(income) rather than directly against income. One obvious reason ...
[Read more...]

Latin Hypercube Sampling in Hyper-Parameter Optimization

July 6, 2019 | statcompute

In my previous post https://statcompute.wordpress.com/2019/02/03/sobol-sequence-vs-uniform-random-in-hyper-parameter-optimization/, I’ve shown the difference between the uniform pseudo random and the quasi random number generators in the hyper-parameter optimization of machine learning. Latin Hypercube Sampling (LHS) is another interesting way to generate near-random sequences with a very simple idea. Let’...
[Read more...]

Visualize monthly precipitation anomalies

July 6, 2019 | R on Dominic Royé

Normally when we visualize monthly precipitation anomalies, we simply use a bar graph indicating negative and positive values with red and blue. However, it does not explain the general context of these anomalies. For example, what was the highest or lowest anomaly in each month? In principle, we could use ...
[Read more...]

Hacking dbplyr for CKAN

July 6, 2019 | R on Alan Yeung

Aim Create a dummy database Test dbplyr’s SQL translation Modify dbplyr’s SQL translation Testing the dbplyr hack Concluding notes At the end of my first post on CKAN discussing how to use the CKAN API to extract data from the NHS open data platform directly into R, I ... [Read more...]

Glue magic Part I

July 6, 2019 | Blog on Credibly Curious

Lately I’ve found myself using Jim Hester’s glue package instead of paste0 or sprintf. This post marks the start of an ongoing series of little magic spells using the glue package. The back story I’ve been through a few stages of discovery for combining strings of text ... [Read more...]

Hacking dbplyr for CKAN

July 6, 2019 | R on Alan Yeung

Aim Create a dummy database Test dbplyr’s SQL translation Modify dbplyr’s SQL translation Testing the dbplyr hack Concluding notes At the end of my first post on CKAN discussing how to use the CKAN API to extract data from the NHS open data platform directly into R, I ... [Read more...]

Visualize monthly precipitation anomalies

July 6, 2019 | R on Dominic Royé

Normally when we visualize monthly precipitation anomalies, we simply use a bar graph indicating negative and positive values with red and blue. However, it does not explain the general context of these anomalies. For example, what was the highest or lowest anomaly in each month? In principle, we could use ...
[Read more...]

Making a Cheat Sheet with Rmarkdown

July 6, 2019 | r on Tony ElHabr

Unfortunately, I haven’t had as much time to make blog posts in the past year or so. I started taking classes as part of Georgia Tech’s Online Master of Science in Analytics (OMSA) program last summer (2018) while continuing to work full-time, so extra time to code and write ...
[Read more...]

Use the k-means clustering, Luke

July 6, 2019 | Jamie Lendrum

In my last post I scraped some character statistics from the mobile game Star Wars: Galaxy of Heroes. In this post, I’ll be aiming to try out k-means clustering in order to see if it comes out with an intuitive result, and to learn how to integrate this kind ...
[Read more...]

Automatic differentiation in pqR

July 6, 2019 | Radford Neal

I’ve released a version of my pqR implementation of R that has extensions for automatic differentiation. This is not a stable release, but it can be downloaded from pqR-project.org — look for the test version at the bottom — and installed the same as other pqR versions (from source, so ...
[Read more...]
1 9 10 11 12 13

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)