RProtoBuf 0.4.15: One fix, some updates, depcrecation coming

February 8, 2020
By

A new release 0.4.15 of RProtoBuf just arrived at CRAN. RProtoBuf provides R with bindings for the Google Protocol Buffers (“ProtoBuf”) data encoding and serialization library used and released by Google, and deployed very widely in numerous proj...

Read more »

Replicating Plots: Oil Pathways

February 7, 2020
By
Replicating Plots:  Oil Pathways

Intro I was inspired by this graphic on Yahoo Finance. In the TidyTuesday spirit, I set out to recreate it using ggplot2. There were a few things I would have to figure out - how to apply ggtext to an annotation, how to draw a curved arrow, and how to create a pinhole like border around the graphic. That last...

Read more »

Three ways to calculate distances in R

February 7, 2020
By
Three ways to calculate distances in R

Three ways to calculate distances in R Calculating a distance on a map sounds straightforward, but it can be confusing how many different ways there are to do this in R. This complexity arises because there are different ways of defining ‘distance?...

Read more »

Generating correlation matrix for AR(1) model

February 7, 2020
By
Generating correlation matrix for AR(1) model

Assume that we are in the time series data setting, where we have data at equally-spaced times which we denote by random variables . The AR(1) model, commonly used in econometrics, assumes that the correlation between and is , where is … Continue reading →

Read more »

Bio7 3.1 Released

February 7, 2020
By
Bio7 3.1 Released

07.02.2020 A new release of Bio7 is available. This update comes with a plethora of new features, improvements and bugfixes. For those who don’t know Bio7. The application Bio7 is an integrated development environment for ecological modeling, scientific image analysis and statistical analysis. It also contains a feature complete development environment for R with an

Read more »

Feedback forms for contributing

February 6, 2020
By

Feedback forms for those who are not using Version Control yet

Read more »

Feedback forms for contributing

February 6, 2020
By

To those who are not comfortable with using Git/Version Control yet: I’ve created detailed feedback forms for the tools that I’m actively maintaining.

Read more »

Money, market value and competition in modern football

February 6, 2020
By
Money, market value and competition in modern football

In this blog post, I use the market value estimations from the Transfermrkt website to evaluate the influence of money in the top-tier of European football. I trace the evolution of market value for the top 5 European leagues, compute a measure of the market concentration as a proxy for competitiveness within each league, and then estimate the relationship...

Read more »

Electricity demand data in tsibble format

February 6, 2020
By

The tsibbledata packages contains the vic_elec data set, containing half-hourly electricity demand for the state of Victoria, along with corresponding temperatures from the capital city, Melbourne. These data cover the period 2012-2014. Other similar data sets are also available, and these may be of interest to researchers in the area. For people new to tsibbles, please read my introductory post.   Australian state-level...

Read more »

SHARPEn your portfolio

February 6, 2020
By
SHARPEn your portfolio

In our last post, we started building the intuition around constructing a reasonable portfolio to achieve an acceptable return. The hero of our story had built up a small nest egg and then decided to invest it equally across the three major asset classes: stocks, bonds, and real assets. For that we used three liquid ETFs (SPY, SHY, and...

Read more »

Comparing Ensembl GTF and cDNA

February 6, 2020
By
Comparing Ensembl GTF and cDNA

It seems that most people think Ensembl’s GTF file and cDNA fasta file mean the same transcripts: Watch out! @ensembl's Fasta and GTF annotation files available via https://t.co/2AhCSnL7py do not match (there are transcripts in the GTF not found in the Fasta file. Anyone else expected...

Read more »

“Clearing the Confusion” series

February 6, 2020
By

In recent weeks, I’ve posted three tutorials with Clearing the Confusion titles, all in my regtools GitHub repo. Topics have been unbalanced classification data; k-fold cross validation; and scaling in PCA. Comments welcome!

Read more »

Le Monde puzzle [#1130]

February 6, 2020
By
Le Monde puzzle [#1130]

A two-player game as Le weekly Monde current mathematical puzzle: Abishag and Caleb fill in alternance a row of N boxes in a row by picking one then two then three &tc. consecutive boxes. When a player is unable to find enough consecutive boxes, the player has lost. Who is winning when N=29? When N=30?

Read more »

Prying “.R” Script Files Away from Xcode (et al) on macOS

February 6, 2020
By

As the maintainer of RSwitch — and developer of my own (for personal use) macOS, iOS, watchOS, iPadOS and tvOS apps — I need the full Apple Xcode install around (more R-focused macOS folk can get away with just the command-line tools being installed). As an Apple Developer who insanely runs the macOS & Xcode... Continue reading →

Read more »

Function to download biotic interaction datasets

February 6, 2020
By

I work in ecology, biogeography, etc… Biotic interactions (interactions between species) and its repercussions on species distributions is my main research interest. As such, I had, at some point, to download datasets on species interactions. I wanted to be able to produce a uniform (more or less, not as much as I would like) R … Continue reading Function...

Read more »

How to use bootstraplib’s Live Theme Previewer to customize Shiny apps?

February 6, 2020
By
How to use bootstraplib’s Live Theme Previewer to customize Shiny apps?

One of the announcements of RStudio conf 2020 that caught my eyes is a brand new package {bootstraplib} - https://github.com/rstudio/bootstraplib/ . It’s another open-source contribution from RStudio (a PBC). {bootstraplib} basically provides tools for theming shiny and rmarkdown from R via Bootstrap (3 or 4) Sass. If you’re not aware of Bootstrap, it’s one of the most popular (open-source) css...

Read more »

New Data Scientist Stickers

February 5, 2020
By
New Data Scientist Stickers

We have a new data scientist sticker! If you see Nina or John at a conference/MeetUp, please ask us for a sticker!

Read more »

The simplest tidy machine learning workflow

February 5, 2020
By

caret is a magical package for doing machine learning in R. Look at this code for running a regularized regression: library(caret) inTrain % mutate(Sale_Price = log10(Sale_Price)) %__% select(Sale_Price, .pred) %__% rmse(Sale_Price, .pred) and here’s what I think it should look like in pseudocode: ############################# Pseudocode ###################################### ############################################################################### library(AmesHousing) # devtools::install_github("tidymodels/tidymodels") library(tidymodels) ames % # Split test/train initial_split(prop = .75) %__% ...

Read more »

Introduction to the forecastLM package

February 5, 2020
By

I am pleased to announce a new R package - forecastLM. The package, as the name implies, provides applications for forecasting regular time series data with a linear regression model (based on the lm function from the stats package). It supports both ts and tsibble objects as inputs and enables simple extractions of features from the input object on...

Read more »

Visualization of the Debt/GDP ratio and national debt level

February 5, 2020
By
Visualization of the Debt/GDP ratio and national debt level

I saw this graph on Twitter a few days ago: Short googling revealed that this is a relatively old graph from October 2017. On one hand, this is a really cool visualization. On the other hand, it also belongs… Continue reading →

Read more »

Please, somebody create an ETF that buys EU CO2 emission allowances! I’d like the gamble to earn money by fighting climate change.

February 5, 2020
By
Please, somebody create an ETF that buys EU CO2 emission allowances! I’d like the gamble to earn money by fighting climate change.

I really would like to buy now EU CO2 allowances and keep them for 5-20 years before selling them back. This transaction is likely to reduce total EU CO2 emissions and I would even have the chance to make some money out of it. That one can actually reduce emission (rather than only postpone them) just by holding allowances for...

Read more »

Shiny: Load testing and horizontal scaling

February 5, 2020
By
Shiny: Load testing and horizontal scaling

„Money can’t buy you happiness, but it can buy you more EC2 Instances…“ – With this quote Sean Lopp, Product Manager at RStudio, PBC, rang in his „Scaling Shiny“ showcase. In this showcase, he uses a load-testing approach to show how a Shiny application can be scaled for 10,000 users. RStudio’s shiny WebApp framework is an R

Read more »

#TidyTuesday and tidymodels

February 4, 2020
By
#TidyTuesday and tidymodels

This week I started my new job as a software engineer at RStudio, working with Max Kuhn and other folks on tidymodels. I am really excited about tidymodels because my own experience as a practicing data scientist has shown me some of the areas for growth that still exist in open source software when it comes to modeling and...

Read more »

Some 2020 R Conferences

February 4, 2020
By
Some 2020 R Conferences

rstudio::conf kicked off the 2020 season for R conferences last week with record attendance somewhere north of twenty-one hundred. Session topics ranged from business to science, marketing to medicine and attracted R users with very varied backgrounds including DevOps professionals, data scientists, journalists, physicians, statisticians, R package developers, Shiny developers and more. Although it is true that the San...

Read more »

The Fun of Building Things and the Challenge of Learning – the rOpenSci OzUnconf 2019

The Fun of Building Things and the Challenge of Learning – the rOpenSci OzUnconf 2019

It was the best of times, it was the worst of times. Dickens might have meant it figuratively, but in the case of the rOpenSci OzUnconf 2019, we mean it literally. Set to the backdrop of a national emergency that is still ongoing from 11-13 December, our participants came from across Australia as well as New Zealand, Japan, India and...

Read more »

wrapr Update: Removing Some Under-Used Functions and Classes

February 4, 2020
By

For the next version of the R package wrapr we are going to be removing a number of under-used functions/methods and classes. This update will likely happen in March 2020, and is the start of the wrapr 2.* series. Most of the items being removed are different abstractions for helping with function composition. We ended … Continue reading wrapr...

Read more »

Consensus clustering in R

February 4, 2020
By
Consensus clustering in R

The logic behind the Monti consensus clustering algorithm is that in the face of resampling the ideal clusters should be stable, thus any pair of samples should either always or never cluster together. We can use this principle to infer the optimal number of clusters (K). This works by examining cluster stability from K=2 to

Read more »

RStudio::conf 2020 San Francisco Recap

February 4, 2020
By
RStudio::conf 2020 San Francisco Recap

RStudio::conf 2020 is a wrap! What a tremendous experience. It was quite a production to send four Appsilon team members to San Francisco, California from Warsaw with nearly 100 kg of swag, but it was absolutely worthwhile. We were a proud sponsor of the event, and having a booth set up in the main lobby Article RStudio::conf 2020 San...

Read more »

Epidemiology: How contagious is Novel Coronavirus (2019-nCoV)?

February 4, 2020
By
Epidemiology: How contagious is Novel Coronavirus (2019-nCoV)?

A new invisible enemy, only 30kb in size, has emerged and is on a killing spree around the world: 2019-nCoV, the Novel Coronavirus! It has already killed more people than the SARS pandemic and its outbreak has been declared a Public Health Emergency of International Concern (PHEIC) by the World Health Organization (WHO). If you … Continue reading "Epidemiology:...

Read more »

Search R-bloggers

Sponsors