This is the third part of the of the post where we are replicating the figures from a paper and in this part we are going to create figure 2 the Chronohorogram. Part 2 of this series we created temporal plot for understanding seasonality of the data records (Figure 1b). If you ...
Geospatial data - data embedded in a spatial context - is used across disciplines, whether it be history, biology, business, tech, public health, etc. Along with community contributors, we're working on a suite of tools to make working with spatial data in R as easy as possible.
If you're not ... [Read more...]
In the past couple of weeks I’ve noticed a flurry of visualizations of global sea ice extent on social media.
If you’re like me, and curious to see what the data look like yourself, here’s a bit of R code to fetch and visualize the most recent ... [Read more...]
So, in my last post, I showed how to create two histograms from a certain data set and then how to plot the two variables to see if there is any relationship. Visually, it was easy to tell that there was a negative relationship between the weight of an automobile ...
The magrittr pipe (%__%) has revolutionised the way many people now write R code. I’ve been using R for over 7 years and the pipe has become a staple of my programming conventions. However it was recently brought to my attention that you can actually use pipes within function calls, which ...
Scientists spend a lot of time “munging” data. Finding, cleaning, and managing datasets can take up the majority of the time it takes to complete an analysis. Tools that make the munging process easier can save scientists a lot of time.
We are tackling a small part of this problem ... [Read more...]
Scientists spend a lot of time “munging” data. Finding, cleaning, and managing datasets can take up the majority of the time it takes to complete an analysis. Tools that make the munging process easier can save scientists a lot of time.
We are tackling a small part of this problem ... [Read more...]
I am very pleased to announce the first release of the ggforce package.
ggforce is a general ggplot2 extension package in the same vein as ggalt with no overarching goal
other than to provide additional functionality to the ggplot2 universe. The
inc... [Read more...]
Last week’s post showed how to create a Gilmore Girls character network.
In this week’s short post, I want to explore the Gilmore Girls’ famous coffee addiction by analysing the same episode transcripts that were also used last week.
I am also ...
On my trip to Japan, I took this photo of the stairs leading to the “Rucker Park of Tokyo.” I crossed up some Tokyo cats, they were garbage. That one girl behind the blue pillar was practicing her hip hop power moves. She thought no one could see, but I ...
Descriptive Analytics is the examination of data or content, usually manually performed, to answer the question “What happened?”. In order to be able to solve this set of exercises you should have solved the part 0, part 1, part 2 ,and part 3 of this series but also you should run this script which ... [Read more...]
The often demand in the biostatistical research is to group patients depending on explanatory variables that are continuous. In some cases the requirement is to test overall survival of the subjects that suffer on a mutation in specific gene and have ... [Read more...]
Fusion and Runtime Compilation
Today’s deep learning models perform tens of thousands of operations on GPU. The input and output of each GPU kernel has to be stored in the global memory, but read and write on global memory is much slower than on on-chip register. When some special ...
In an article called A Paradox in the Interpretation of Group Comparisons published in Psychological Bulletin, Lord (1967) made famous the following controversial story:A university is interested in investigating the effects of the nutritional diet its...
A decision tree for function minimization What R calls “optimization” is more generally known as function minimization. The tools in the stats package function optim() are all essentially function mimizers, as are those in the package optimrx found at https://r-forge.r-project.org/projects/optimizer/. optimrx tries to make it ... [Read more...]
Basis and principle of D3partitionR D3partitionR is to plot sequential and hierarchical data using treemap (and circle Tree Map), sunburst and partition chart, collapsible trees (indented or not). The package only has one all-in function called D3partitionR(…) to create a partition chart. Two others functions as renderD3... [Read more...]
Introduction There are many packages available in R like data.table, tables, psych etc. to provide descriptive statistics like mean, standard deviation etc. group-wise(factor-wise) for number variables. In this article, an attempt is made to generate similar type of tabulated results utilizing the functions available in the base package ... [Read more...]
In this set of exercises you will be presented with real-life problems in marketing. Your task will be to choose appropriate nonparametric statistical technique and solve the problem using appropriate R functions. Answers to the exercises are available here. Exercise 1 A company wants to learn if sales income is equaly ... [Read more...]
A while back I spent some time figuring out how to serve interactive shiny apps through my website, but I haven't had a chance to build anything until recently. I set out to create a few simple shiny apps in R that I could use as a sort of test ... [Read more...]