Tidy time series data using tsibbles

August 28, 2019
By
Tidy time series data using tsibbles

There is a new suite of packages for tidy time series analysis, that integrates easily into the tidyverse way of working. We call these the tidyverts packages, and they are available at tidyverts.org. Much of the work on these packages has been done by Earo Wang and Mitchell O’Hara-Wild. The first of the packages to make it to CRAN was...

Read more »

July 2019 “Top 40” R Packages

August 28, 2019
By
July 2019 “Top 40” R Packages

One hundred seventy-six new packages made it to CRAN in July. Here are my “Top 40” picks organized into twelve categories: Data, Data Science, Finance, Genomics, Machine Learning, Mathematics, Medicine, Statistics, Time Series, Topological Data Analysis, Utilities and Visualization. Data eia v0.3.2: Provides API access to data from the US Energy Information Administration (EIA). Use of the API requires a free...

Read more »

Errors and Debugging in RStudio

August 28, 2019
By
Errors and Debugging in RStudio

Diagnosing and fixing errors in your code can be time-consuming and frustrating. There are two ways you can make your life easier. The first is knowing the tools at your disposal in RStudio to debug errors. RStudio provides a variety of tools to help you diagnose the problem at its source and come up with a solution as quick...

Read more »

PostcodesioR 0.1.1 is on CRAN

August 28, 2019
By
PostcodesioR 0.1.1 is on CRAN

Introduction The latest stable version of my UK geocoder package has finally made it to CRAN. PostcodesioR is a wrapper for postcodes.io and it provides multiple functions to work with UK geospatial data. This package is based exclusively on open data provided by Ordnance Survey and Office for National Statistics and turned into an API

Read more »

R, Docker and Checkpoint: A Route to Reproducibility

August 28, 2019
By
R, Docker and Checkpoint: A Route to Reproducibility

I need to deploy Shiny on a Windows machine. I also need to use {checkpoint} for package management. Using Docker seems to be the only reasonable approach to Shiny on Windows. But how easy would it be to also factor {checkpoint} into this setup? Only one reasonable way to find out: give it a try. Below is the simple Dockerfile I...

Read more »

R Tensorflow Multiple Linear Regression

August 27, 2019
By
R Tensorflow Multiple Linear Regression

In the previous three posts I used multiple linear regression, decision trees, gradient boosting, and support vector machine to predict miles per gallon for 2019 vehicles.  It was determined that svm produced the best model.  In this post, I am going to run TensorFlow through R and fit a multiple linear regression model using the same data to predict...

Read more »

Find Insights with Ranked Cross-Correlations

August 27, 2019
By
Find Insights with Ranked Cross-Correlations

Are you interested in guest posting? Publish at DataScience+ via your editor (i.e., RStudio). Category Visualizing Data Tags Correlations EDA ggplot2 lares R Programming A well-done correlation analysis will lead us to a greater understanding of our data and empower us with valuable insights. A correlation analysis is a statistical technique that can show whether and how strongly pairs of variables are related, but all features must...

Read more »

swephR v0.3.0

August 27, 2019
By

This afternoon swephR version 0.3.0 made it unto CRAN and is now propagating to the mirrors. The goal of swephR is to provide an R interface to the Swiss Ephemeris (SE), a high precision ephemeris based upon the DE431 ephemeris from NASA’s JPL. It covers the time range 13201 BCE to 17191 CE. This new version comes with two important changes. First, Victor has finished the...

Read more »

How tennis has changed over time

August 27, 2019
By
How tennis has changed over time

During the last years I have become a huge fan of tennis (both, on court and in TV). This year I followed (in TV) with attention Roland Garros and Wimbledon. Although clay and grass are very different surfaces I’ve found the game very similar. What happened with the serve-volley game? I think it is my duty (as data freak) to...

Read more »

The R Factor

August 27, 2019
By

To the uninitiated, entering UCL’s packed Darwin lecture theatre on Monday evening knowing you held a golden ticket so coveted that 400 names remained on a waitlist, you could be forgiven for thinking this was the most popular meetup on the planet And perhaps, on this occasion at least, LondonR was the focal point of the R universe. Because...

Read more »

Data science courses with R in Frankfurt!

August 27, 2019
By
Data science courses with R in Frankfurt!

Data science courses with R in Frankfurt! R is one of the leading programming languages for data analysis. In our R trainings, we teach you the manifold possibilities of the free programming language in a practically orientated and comprehensive way. Unlock the potential of data science with the free R programming language for advanced analytics

Read more »

R Journal July Issue

August 26, 2019
By

As the current Editor-in-Chief of the R Journal, I must apologize for the delay in getting the July issue online, due to technical and other matters. In the meantime, though, please take a look at the many interesting articles slated for publication in this and upcoming issues. Various improvements in technical documentation, as well as … Continue reading R...

Read more »

How to scrape Zomato Restaurants Data in R

August 26, 2019
By
How to scrape Zomato Restaurants Data in R

Zomato is a popular restaurants listing website in India (Similar to Yelp) and People are always interested in seeing how to download or scrape Zomato Restaurants data for Data Science and Visualizations. In this post, We’ll learn how to scrape / download Zomato Restaurants (Buffets) data using R. Also, hope this post would serve as a basic web scraping framework...

Read more »

Approximating small probabilities using importance sampling

Approximating small probabilities using importance sampling

Box plots are often used. They are not always the best visualisation (e.g. is bi-modality not visible), but I will not discuss that in this post. Here, I will use it as an example of Importance sampling that is a technique to estimate tail probabi...

Read more »

Building a Recommendation System with Beer Data

Building a Recommendation System with Beer Data

Beer culture in the United States has changed dramatically in the past decade or so. This trend is reflected in the development of a vibrant community of people who rate, review, and share information about beers online. Websites like BeerAdvocate, RateBeer, and Untappd give beer drinkers a place to share their beer tastes with others. Surprisingly, despite the large...

Read more »

How much pizza and how much frozen yogurt? …with Gröbner bases

In a recent blog post I tried to get yacas to solve a system of polynomial equations. Unfortunately it could not do that, so I solved it numerically instead. Now it is possible – together with many other systems of polynomial equations thanks to fixing a small error in yacas. It has now been fixed, also in Ryacas (development version), so hurry up and...

Read more »

Improving AFL articles

August 26, 2019
By
Improving AFL articles

So you might have seen an article from someone who has a weird simpsons display picture about the sorry state of AFL analytics. This isn’t saying that AFL work is poor as a whole or that people within AFL circles aren’t capable of good work. This just points to a framework in place by the powers to be, that...

Read more »

Using rOpenSci Software Peer Review Guidelines for Teaching

Teaching collaborative software development In the University of British Columbia’s Master of Data Science program one of the courses we teach is called Collaborative Software Development, DSCI 524. In this course we focus on teaching how to exploit practices from collaborative software development techniques in data scientific workflows. This includes appropriate use of the software life cycle, unit testing and...

Read more »

Polished – Modern Authentication for Shiny

August 26, 2019
By

We are excited to announce polished. Polished is a new R package that adds modern user authentication and user administration to your Shiny apps. Polished comes with many of the authentication features required by today’s web apps (e.g. user registr...

Read more »

Introducing data_algebra

August 26, 2019
By
Introducing data_algebra

This article introduces the data_algebra project: a data processing tool family available in R and Python. These tools are designed to transform data either in-memory or on remote databases. In particular we will discuss the Python implementation (also called data_algebra) and its relation to the mature R implementations (rquery and rqdatatable). Introduction Parts of the … Continue reading Introducing...

Read more »

Behind the Scenes of an R Consortium Project

August 26, 2019
By
Behind the Scenes of an R Consortium Project

Next Tuesday (September 3) I will be giving a talk at the Bay Area R User Group titled Behind the Scenes of an R Consortium Project. This will be my first time speaking about my work with the R Consortium, and I encourage you to attend! Over the last few years the R Consortium has emerged The post Behind the...

Read more »

Combining the power of R and Python with reticulate

August 26, 2019
By
Combining the power of R and Python with reticulate

R + Py In the word of R vs Python fights, This is a simple (could be called, naive as well) attempt to show how we can combine the power of Python with R and create a new superpower. Like this one, If you have watched The Incredibles before! About this Dataset This dataset contains a bunch of tweet that came with...

Read more »

RMarkdown Template that Manages Academic Affiliations – docx or PDF output

August 26, 2019
By
RMarkdown Template that Manages Academic Affiliations – docx or PDF output

Background I like writing my academic papers in RMarkdown because it allows reproducible research. The cleanest way to submit a manuscript made in RMarkdown is using the LaTeX code that it generates using the YAML switch keep_tex = true. A minimalist YAML header would look like so: Introduction However, when you want mutliple authors … Continue reading RMarkdown...

Read more »

Click n click (GC606J1)

August 26, 2019
By
Click n click (GC606J1)

Today’s puzzle is a single part of a mystery that comprises several smaller puzzles, styled on the cult classic TV show The Crystal Maze. To get the coordinates there is a “mental” test for the northings, two easy puzzles that I won’t go into here. The westings are the more interesting part, with a “skill”

Read more »

RSwitch 1.4.0 Released

August 26, 2019
By
RSwitch 1.4.0 Released

Swift 5 has been so much fun to hack on that there’s a new update to macOS R-focused mebubar utility RSwitch available. Along with the app comes a new dedicated RSwitch landing page and a new user’s guide since it has enough features to warrant such documentation. Here’s the new menu The core changes/additions include:... Continue reading →

Read more »

Mixing up R markdown shortcut keys in RStudio, or how to unfold all chunks

August 25, 2019
By
Mixing up R markdown shortcut keys in RStudio, or how to unfold all chunks

When using R markdown in RStudio, I like to insert a new chunk using the shortcut Cmd+Option+I. Unfortunately I often press a key instead of “I” and end up folding all the chunks, getting something like this: It often takes … Continue reading →

Read more »

Practice using lubridate… THEATRICALLY

August 25, 2019
By
Practice using lubridate… THEATRICALLY

I am so pleased to now be an RStudio-certified tidyverse trainer! 🎉 I have been teaching technical content for decades, whether in a university classroom, developing online courses, or leading workshops, but I still found this program valuable for my own professonal development. I learned a lot that is going to make my teaching better, and I am happy...

Read more »

Maximum likelihood estimation from scratch

Maximum likelihood estimation from scratch

Maximum likelihood estimates of a distribution Maximum likelihood estimation (MLE) is a method to estimate the parameters of a random population given a sample. I described what this population means and its relationship to the sample in a previous post. Before we can look into MLE, we first need to understand the difference between probability and probability density for continuous variables....

Read more »

Studying Politics on and with Wikipedia

August 25, 2019
By
Studying Politics on and with Wikipedia

The online encyclopedia Wikipedia, together with its sibling, the collaboratively edited knowledge base Wikidata, provides incredibly rich yet largely untapped sources for political research. In this Methods Bites Tutorial, Denis Cohen and Nick Baumann offer a hands-on recap of Simon Munzert’s (Hertie School of Governance) workshop materials to show how these platforms can inform research on public attention dynamics,...

Read more »

Search R-bloggers

Sponsors