Analysing tweets from the Virtual Madrid Open

Analysing  tweets from the Virtual Madrid Open

With the world in lockdown and no live tennis in the horizon in the near future the Madrid Open decided to organise an online tournament with male and female players facing each other using the Tennis World Tour video game. I was curious to see how fans reacted to the event. I downloaded twitter data using the rtweet package. I...

Read more »

Greg Wilson Wins ACM Influential Educator Award

May 3, 2020
By

Recently, the Association for Computing Machinery’s (ACM’s) Special Interest Group on Software Engineering (SIGSOFT) recognized Greg Wilson as the 2020 recipient of its prestigious Influential Educator Award which is awarded annually to individuals or groups who have made significant contributions to software engineering through education, mentoring or policy. JBR: Greg Congratulations, and thank you for agreeing to this interview. Greg, you...

Read more »

missingHE update

May 3, 2020
By

Andrea has just released a major update for missingHE (this is my own fork of the main project — but now available on CRAN too), the package we (well, mostly he!) have (has) developed to handle missing data in Bayesian models for cost-effectiveness analysis. Additions to the basic structure of the package include multilevel models and new plotting facilities, revolving...

Read more »

Le Monde puzzle [#1141]

May 3, 2020
By
Le Monde puzzle [#1141]

The weekly puzzle from Le Monde is in honour of John Conway, who just passed away, ending up his own game of life: On an 8×8 checker-board, Alice picks n squares as “infected”. She then propagates the disease by having each square with least two infected neighbours to become infected as well. What is the

Read more »

#0: Introducing T^4: Tips, Tricks, Tools, and Toys

For way too long now something I had meant to start was a little series about tips, tricks, tools, and toys. I had mentioned the idea a few times to a friend or two, and generally received a thumbs up or a ‘go for it’. But it takes a little to ge...

Read more »

Securing and monitoring ShinyProxy deployment of R Shiny apps

May 3, 2020
By
Securing and monitoring ShinyProxy deployment of R Shiny apps

Table of Contents Introduction Prerequisites Step 1: Setting up Docker Step 2: Building the demo Shiny app and containerised ShinyProxy Docker images Step 3: Launching AWS EC2 instances Step 4: Setting up a domain name (Optional) Step 5: Setting up AWS Cognito (Optional) Setting up HTTPS (SSL / TLS) Step 1: Preparing configuration files Step 2: Setting up ShinyProxy Machine Setting up InfluxDB, Telegraf and Grafana for usage statistics logging Step...

Read more »

How to Become a Data Scientist in 2020

May 3, 2020
By
How to Become a Data Scientist in 2020

Despite the spike in the interest related to Data Science and Machine Learning roles and courses, it is still possible to become a fully functional data scientist with minimal resources. Some caveats, (1) be committed to invest hours of effort building your expertise. (2) The job market has gotten quite competitive, so be mentally prepared The post How to...

Read more »

How to Become a Data Scientist in 2020

May 3, 2020
By
How to Become a Data Scientist in 2020

Despite the spike in the interest related to Data Science and Machine Learning roles and courses, it is still possible to become a fully functional data scientist with minimal resources. Some caveats, (1) be committed to invest hours of effort building your expertise. (2) The job market has gotten quite competitive, so be mentally prepared The post How to...

Read more »

Statistics Sunday: My 2019 Reading

May 3, 2020
By
Statistics Sunday: My 2019 Reading

I've spent the month of April blogging my way through the tidyverse, while using my reading dataset from 2019 as the example. Today, I thought I'd bring many of those analyses and data manipulation techniques together to do a post about my reading habits for the year. library(tidyverse) ## -- Attaching packages ------------------------------------------- tidyverse 1.3.0 -- ## ggplot2 3.2.1 ...

Read more »

Revitalizing R package yorkr

May 3, 2020
By

There is nothing so useless as doing efficiently that which should not be done at all. Peter Drucker The most important thing in communication is to hear what isn’t being said. Peter Drucker “Work expands to fill the time available for its completion.” Corollary: “Expenditure rises to meet income.” Parkinson’s law Introduction “Operation successful!!!, the Programmer Surgeon in me, … Continue reading Revitalizing...

Read more »

Analyzing the ggcharts CRAN Downloads. Part 2: Data Visualization

Analyzing the ggcharts CRAN Downloads. Part 2: Data Visualization

Introduction In part 1 of this post I described how I got data about the ggcharts CRAN downloads. In this follow-up post I will walk you through how to turn this data into an informative visualization using ggplot2 and patchwork. library(ggplot2) library(patchwork) Daily Downloads The first plot will show the daily downloads over time. I will use a line chart for that purpose. Remember...

Read more »

Updating packages on a drat repo

May 2, 2020
By

This is just a small note (mainly for myself but hopefully may be of some use to a few others!) to remind of how to update a package on a drat repo. Create the source file for the package you want to host on the drat repo using devtools::build(). Clone the drat repo hosting the package (in my case https://github.com/alan-y/drat). Use drat::insertPackage("package-source.tar.gz",...

Read more »

An update to “An adventure in downloading books”

May 2, 2020
By

I received an email from Bernardo Lares as feedback on my previous article. You can also view some of the other cool work done by him in this link. His script is provided below. He uses the rvest package and the %__% operator to keep it really short and simple. library(rvest) library(dplyr) library(stringr) list % html_text() %__% str_split("http://link.springer.com/openurl\\?genre=book&isbn=") ids

Read more »

Installation of R 4.0 on Ubuntu 20.04 LTS and tips for spatial packages

May 2, 2020
By

You plan to upgrade your server installation from Ubuntu 18.04 to Ubuntu 20.04 LTS? It is also the best time to migrate to R 4.0 ! By the way, if you always found difficult to install R packages for geographical data on Ubuntu, this time is over. Just ...

Read more »

Text Plots

May 2, 2020
By
Text Plots

A few weeks ago, we pushed R package textplot to CRAN and it was accepted for release last week. The package contains straightforward functionalities for the visualisation of text, namely of  text cooccurences text clusters (in casu biterm clusters) dependency parsing results text correlations and text frequencies Some examples of these plots are shown in the gif. More details can be found in the pdf presentation shown below. {aridoc...

Read more »

Riddler: Can You Flip Your Way To Freedom?

May 2, 2020
By
Riddler: Can You Flip Your Way To Freedom?

FiveThirtyEight’s Riddler Express Dakota Jones is back in action. In her quest to locate the Temple of Diametra, she has found another highly symmetric crystal. However, nefarious agents have again gotten wind of her plans, and now Dakota and the crystal are nowhere to be found. And so, you must once again recreate the crystal using the data from Dakota’s laser scanner. As a reminder,...

Read more »

Europe COVID-19 death map

May 2, 2020
By
Europe COVID-19 death map

COVID-19 deaths in Europe# Europe COVID-19 deaths animated map # http://r.iresmi.net/ # data European Centre for Disease Prevention and Control # packages ---------------------------------------------------------------- library(tidyverse) library(httr) library(fs) library(sf) library(readxl) library(janitor) library(glue) library(tmap) library(grid) library(classInt) library(magick) # + btb, raster, fasterize, plyr # sources ----------------------------------------------------------------- # https://data.europa.eu/euodp/en/data/dataset/covid-19-coronavirus-data covid_file % complete(countryterritory_code, date_rep) %__% replace_na(list(deaths = 0)) %__% group_by(countryterritory_code) %__% arrange(date_rep) %__% mutate(deaths_cum = cumsum(deaths)) # keep only europen countries minus Russia and adding TUR...

Read more »

Generative art: Let your computer design you a painting

May 2, 2020
By
Generative art: Let your computer design you a painting

I really like generative art, or so-called algorithmic art. Basically, it means you take a pattern or a complex system of rules, and apply it to create something new following those patterns/rules. When I finished my PhD, I got a beautiful poster of where the k-nearest neighbors algorithms was used to generate a set of … Continue reading Generative...

Read more »

Checking your Package for Compatibility with R 4.0.0

As a package maintainer you might be observing an increasing number of questions raised by people that have recently migrated to R 4.0.0 and are now trying to get your package to work. Yet, rhub::check_with_rrelease() currently still uses R 3.6.3 as test base. While migrating to a new R version is always tempting maybe you don’t feel like disrupting...

Read more »

Combine values into a vector

May 1, 2020
By
Combine values into a vector

Combine values into a vector Everything is a vector Get to know different data types in R Learn how to create vectors Use the : operator to create numeric sequences Use the concatenate function c() to create vectors of different data types 1:100 c(...

Read more »

Asynchronous background execution in Shiny using callr

April 30, 2020
By

When designing Shiny applications we commonly associate asynchronous execution with multiple concurrent running sessions of an application. In such cases, when one user has requested a longer computation or a database query, the other users have to wait for this task to finish before they can see their plots and tables. This types of problems are elegantly solved with...

Read more »

Time sinks for data wrangling

April 30, 2020
By

Time sinks for data wrangling Estimating the time it will take to complete a project is an important part of being a successful PhD student, researcher or consultant. A major time sink on many analytical projects is data wrangling. Often the wranglin...

Read more »

Animations in the time of Coronavirus

Animations in the time of Coronavirus

The first four months of 2020 have been dominated by the Coronavirus pandemic (aka COVID-19), which has transformed global life in an unprecedented way. Societies and economies struggle to adapt to the new conditions and necessary contraints. A reassuringly large fraction of governments around the world continue to take evidence-based approaches to this crisis that are grounded in large...

Read more »

Rendering your README with GitHub Actions

April 30, 2020
By

There’s one thing that has bugged me for a while about developing R packages. We have all these nice, modern tools we have for tracking our code, producing web sites from the roxygen documentation, an so on. Yet for every code commit I make to the master branch of a package repo, there’s often two or more additional steps...

Read more »

Automated Programmatic Website Screenshots in R with {webshot} [Video Tutorial]

April 30, 2020
By

In this video tutorial, We explore the R package {webshot} by Winston Chang. This package internally uses phantom js to capture screenshot of web pages / websites, Shiny Applications, RMarkdown documents. {webshot} also lets you take screenshot of a particular viewport or a section of website selected by css selector. Youtube: https://youtu.be/oQKwd1cgiq4 Please subscribe and leave a comment if you...

Read more »

Which Technology Should I Learn?

April 30, 2020
By
Which Technology Should I Learn?

Knowing where to start can be challenging, but we're here to help. Read on to learn more about where to begin on your data science and analytics journey. Data science and analytics languages If you’re new to data science and analytics, or your organization is, you’ll need to pick a language to analyze your data and a thoughtful way to make...

Read more »

Why R? Webinar – Development pipeline for R production – rZYPAD

April 30, 2020
By
Why R? Webinar – Development pipeline for R production – rZYPAD

April 30th (8:00pm GMT+2) is another date for a webinar at Why R? Foundation YouTube channel. We will have a blast talk by Lorenzo Braschi from Roche IT. The title of the meeting is rZYPAD: Development pipeline for R production See you on the Webinar...

Read more »

Z is for Additional Axes

April 30, 2020
By
Z is for Additional Axes

Here we are at the last post in Blogging A to Z! Today, I want to talk about adding additional axes to your ggplot, using the options for fill or color. While these aren't true z-axes in the geometric sense, I think of them as a third, z, axis.Some of ...

Read more »

Vignette: Simulating a minimal SPSS dataset from R

April 29, 2020
By
Vignette: Simulating a minimal SPSS dataset from R

What this is about 📖 I will simulate a minimal labelled survey dataset that can be exported as a SPSS (.SAV) file (with full variable and value labels) in R. I will also attempt to fabricate ‘meaningful patterns’ to the dataset such that it can be more effectively used for creating demo examples. image from Giphy Background Simulating data is one of the...

Read more »

Search R-bloggers

Sponsors