Blog Archives

Experimenting with Hierarchical Clustering in a galaxy far far away…

July 14, 2019
By
Experimenting with Hierarchical Clustering in a galaxy far far away…

Introduction This post will be taking a bit of an unexpected diversion. As I was experimenting with hierarchical clustering I ran into the issue of how many clusters to assume. From that point I went deep into the rabbit hole and found out some really useful stuff that I wish I’d have known when I wrote my previous post. I’ve discovered...

Read more »

Use the k-means clustering, Luke

July 6, 2019
By
Use the k-means clustering, Luke

In my last post I scraped some character statistics from the mobile game Star Wars: Galaxy of Heroes. In this post, I’ll be aiming to try out k-means clustering in order to see if it comes out with an intuitive result, and to learn how to integrate this kind of analysis into a tidy workflow using broom. First I’ll load...

Read more »

Experimentation with Unsupervised Learning

July 3, 2019
By
Experimentation with Unsupervised Learning

Motivation I’ve written before about my learning plans, which always seem to be in a state of flux, and in particular learning about machine learning. Part of the reason why I’m so reticent is because I’m a mathematician and statistics does not come natural or easy for me. My limited past experience has exposed to me just how much I don’t...

Read more »

Are R ecosystems the future?

March 17, 2019
By

Some random thoughts… Over the past 6 months I’ve been creating, refining, and delivering a variety of ‘Introduction to R’ training courses. The more I do this, the more I come to the view that not nearly enough is made of taking an ecosystem-oriented view to packages. A good way of talking about #rstats functionality is in terms of ecosystems, rather...

Read more »

Let’s call it tidysearch

January 23, 2019
By

R became 25 years old last year, and yet it’s only in relatively recent years that the language has really taken off with numerous conferences every year driven by a passionate and vibrant community of users. A large part of this has been driven by a...

Read more »

Mapping homelessness in England

September 12, 2018
By
Mapping homelessness in England

Introduction Data wrangling Initial analysis The painful part Introduction For this blog post, I decided to try to find a dataset covering an issue I feel quite strongly about - homelessness. I managed to find a fairly large dataset from the Cambridgeshire Insight website. For a while I’ve wanted to try out R’s mapping potential and hopefully generate a heatmap, so I’ve deliberately tried to...

Read more »

Two years in Data Science and not yet a Data Scientist

July 25, 2018
By

What’s in a name? Despite the potentially grumpy sounding title of this post, this is more a positive reflection of the past two years since I started working in Data Science. I think I’ve come a long way, but there is still so far to to go if I am to confidently call myself a Data Scientist. Why does a...

Read more »

Portsmouth R User Group – 2nd Meeting

July 24, 2018
By
Portsmouth R User Group – 2nd Meeting

Last month I attended my first ever R User Group meeting, which was held at the University of Portsmouth in their impressive Future Technology Centre. I’d been itching to go to one of these meetups for a while, but unfortunately there was nothing in the South of England, so when this opportunity came around I couldn’t miss it, especially...

Read more »

Learning some tough lessons!

March 20, 2018
By

It’s been a while since I’ve posted - mainly because I got myself into a mess with Git and I’ve been putting off trying to sort it out. I’ve been wanting to post about htmlwidgets for a while now, and in my naivity I thought I’d retrospectively try to use the DT package on one of my older posts...

Read more »

Embarking on nested dataframes

January 20, 2018
By

In a recent sprint, I was faced with the problem of carrying out analysis on data extracted from a database where there were several instances of the same table type and I wanted to do the same tasks on each of them. I know enough about the tidyverse to realise that this was a good opportunity to use functions...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)