Blog Archives

Single Word Analysis of Early 19th Century Poetry Using tidytext

June 12, 2017
By
Single Word Analysis of Early 19th Century Poetry Using tidytext

When reading poetry, it feels as if you are reading emotion. A good poet can use words to bring to their reader whichever emotions they choose. To do this, these poets often have to look deep within themselves, using their own lives as fuel. For this r...

Read more »

SatRday and visual inference of vine copulas

February 19, 2017
By
SatRday and visual inference of vine copulas

SatRday From the 16th to the 18th of February, satRday was held in the City of Cape Town in South Africa. The programme kicked off with two days of workshops and then the conference on Saturday. The workshops were divided up into three large sections:...

Read more »

Rewiring replyr with dplyr

February 16, 2017
By

Introduction of Parameterized dplyr expression The usefullness of any small function you write will eventually be judged upon its ability to be generically applied across any arbitrary data. As I explored a blog post from Dec 2016, I became a lot more...

Read more »

Data Scientist with a Wine hobby (Part II)

October 9, 2016
By
Data Scientist with a Wine hobby (Part II)

Getting introduced to Google and their API Previously we constructed a data frame that consisted out of the following variables that we might want to perform analysis on: ## "wine_farms" "wine_year" "wine_name" "points" "price" ## ...

Read more »

Data Scientist with a wine hobby (Part I)

September 11, 2016
By
Data Scientist with a wine hobby (Part I)

After high school I made my way from Johannesburg, situated in the northern part of South Africa, to the famous wine country known as Stellenbosch in the south. Here for the first time I got a ton of exposure to wine and the countless varietals that ma...

Read more »

Mirror, mirror on the wall

September 4, 2016
By
Mirror, mirror on the wall

Introduction Saving your R dataframe to a .csv can be useful; being able to view the data all at once can help to see the bigger picture. Often though, multiple dataframes, all pieces of the same project, need to be viewed this way and related back to...

Read more »

Gotta catch them all

August 21, 2016
By
Gotta catch them all

Introduction When data becomes high-dimensional, the inherent relational structure between the variables can sometimes become unclear or indistinct. One, might want to find clusters for numerous amounts of reasons - me, I want to use it to better unde...

Read more »

Untangling overlapping cellphone usage segments

February 22, 2016
By
Untangling overlapping cellphone usage segments

Our behaviour is often very variable and reducing it to single number such as an average might be comforting but ultimately misleading. For instance, I generally use 500MB of data on my cellphone on a monthly basis but this can be as little as 200 MB a...

Read more »

Cleaning up messy R code

February 15, 2016
By
Cleaning up messy R code

Are you the type of person that likes your code to be identically indented? for spacing to be consistent throughout your script, everything to be clear and aligned, read easily and just look nice? Well, I am. Often I get tremendously untidy code from o...

Read more »

Automated parameter selection for LOESS regression

February 11, 2016
By
Automated parameter selection for LOESS regression

Typically, when we want to understand the relationship between two variables we simply regress one on the other, plot the points and fit a linear trend line. To illustrate this, we use the EuStockMarkets datset pre-loaded in R. The data-set contains th...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)