Analysing the US election using Youtube data

September 18, 2016
By
Analysing the US election using Youtube data

Youtube is one of the channels the candidates for the US election use extensively to promote themself. Using the public Youtube API and the R package tuber it is pretty straightforward to create a snapshop of the online discussion and sentiment. First...

Read more »

vecpack: an R package for packing stuff into vectors

September 18, 2016
By
vecpack: an R package for packing stuff into vectors

Here’s a problem I’ve had again and again: let’s say you’ve defined a statistical model with several parameters. One of them is a scalar. Another is a matrix. The third one is a vector, and so on. When fitting the model the natural thing to do is to write a likelihood function that takes as

Read more »

Analyzing World Bank data with WDI, googleVis Motion Charts

September 17, 2016
By
Analyzing World Bank data with  WDI, googleVis Motion Charts

Recently I was surfing the web, when I came across a real cool post New R package to access World Bank data, by Markus Gesmann on using googleVis and motion charts with World Bank Data. The post also introduced me to Hans Rosling, Professor of Sweden’s Karolinska Institute. Hans Rosling, the creator of the famous … Continue...

Read more »

A few thoughts on the existing code parallelization

September 17, 2016
By

A few weeks ago I worked on some old code parallelization. The whole process made me think about how efficient parallelization of the existing code in R can really be and what should be considered efficient. There is a lot … Continue reading →

Read more »

New Zealand Election Study individual level data

September 17, 2016
By
New Zealand Election Study individual level data

Individual level data is essential to understand voting behaviour My previous analysis has occasionally come up against the problem “only individual level data could resolve that,”. Since I last wrote that, the New Zealand Election Study data for ...

Read more »

Reflections on EARL London 2016

September 16, 2016
By
Reflections on EARL London 2016

The Mango Solutions team have done it again: another excellent Effective Applications of R (EARL) conference just wrapped up here in London. The conference was attended by almost 400 R users from companies all around the world, and was a really fun experience. I was honored to deliver a keynote presentation, alonside keynotes from Joe Cheng and Garrett Grolemund...

Read more »

Fixing “Peer certificate cannot be authenticated”

September 16, 2016
By

I’m currently getting the following error on a Windows machine: The machine in question is sitting behind a gnarly firewall and proxy, which I suspect are the source of the problem. I also need to use --ignore-certificate-errors when running chromium-browser, which points to the same issue. This seems to resolve the issue: The post Fixing “Peer certificate cannot...

Read more »

How to add pbapply to R packages

September 16, 2016
By
How to add pbapply to R packages

As of today, there are 20 R packages that reverse depend/import/suggest (3/14/3) the pbapply package. Current and future package developers who decide to incorporate the progress bar using pbapply might want to customize the type and style of the progress bar in their packages to better suit the needs of certain functions or to create a distinctive look. Here is a quick guide to help...

Read more »

anytime 0.0.2: Added functionality

September 15, 2016
By

anytime arrived on CRAN via release 0.0.1 a good two days ago. anytime aims to convert anything in integer, numeric, character, factor, ordered, ... format to POSIXct (or Date) objects. This new release 0.0.2 adds two new functions to gather conversi...

Read more »

Collapsing a bipartite co-occurrence network

Collapsing a bipartite co-occurrence network

This note is a follow-up to the previous one. It shows how to use student-submitted keywords to find clusters of shared interests between the students. Dear students If you enjoyed my previous note, this one might also entertain you. And since your real first names are used in the data, you should be able to tell me later if...

Read more »

tidyverse 1.0.0

September 15, 2016
By
tidyverse 1.0.0

The tidyverse is a set of packages that work in harmony because they share common data representations and API design. The tidyverse package is designed to make it easy to install and load core packages from the tidyverse in a single command. The best place to learn about all the packages in the tidyverse and how

Read more »

lubridate 1.6.0

September 15, 2016
By
lubridate 1.6.0

I am pleased to announced lubridate 1.6.0. Lubridate is designed to make working with dates and times as pleasant as possible, and is maintained by Vitalie Spinu. You can install the latest version with: install.packages("lubridate") This release includes a range of bug fixes and minor improvements. Some highlights from this release include: period() and duration() constructors now accept character

Read more »

Network Analysis Part 1 Exercises

September 15, 2016
By
Network Analysis Part 1 Exercises

In this set of exercises we shall create an empty graph and practice the functions for basic manipulation with vertices and edges, using the package igraph. If you don’t have the package already installed, install it using the following code: install.packages("igraph") and load it into the session using the following code: library("igraph") before proceeding. You

Read more »

EARL 2016 talk

September 15, 2016
By
EARL 2016 talk

I spoke on 14th September at the EARL (Effective Applications of the R Language) conference in London. This is event is concerned exclusively with the R programming language and it was the first… Continue reading →

Read more »

2016 Data Science Salary Survey results

September 15, 2016
By
2016 Data Science Salary Survey results

O'Reilly has released the results of the 2016 Data Science Salary Survey. This survey is based on data from over 900 respondents to a 64-question survey about data-related tasks, tools, and the salary they receive from doing/using them. The median salary reported in the survey was US$87,000; amongst data scientists in the US, the median salary was US$106,000. Appropriately...

Read more »

Why you need version control

September 15, 2016
By

I recently had an email exchange with a seasoned, well respected analytical professional which included the following (from them, not me): “… my versioning is to have multiple versions of files and to use naming conventions… it works really well....

Read more »

HIBPwned updated on CRAN

September 15, 2016
By

Haveibeenpwned.com is a fantastic service that helps people find out if they’ve been involved in a data breach. HIBPwned is an R wrapper for that service. Recently, due to abuse of the system, Troy Hunt had to add a limit of one request per 1.5s. The new version published on CRAN last night adds a The post

Read more »

Data Science 101, now online

September 14, 2016
By

We are delighted to note that IBM's BigDataUniversity.com has launched the quintessential introductory course on data science aptly named Data Science 101.The target audience for the course is the uninitiated cohort that is curious about data science and would like to take the baby steps to a career in data and analytics. Needless to say, the course is for...

Read more »

Monitoring R Applications with RZabbix

September 14, 2016
By
Monitoring R Applications with RZabbix

As R users we mostly perform analysis, produce reports and create interactive shiny applications. Those are rather one-time performances. Sometimes, however, the R developer enters the world of the real software development, where R applications shoul...

Read more »

How I made some Pokémon Business Cards

September 14, 2016
By
How I made some Pokémon Business Cards

As I’m in the industry now I figured I needed some business cards and as it seems the 90s never left us and Japanese monsters are hip again, I decided to make them Pokémon themed. I think they turned out pretty well, and here I’m just going ...

Read more »

GoodReads: Exploratory data analysis and sentiment analysis (Part 2)

September 14, 2016
By
GoodReads: Exploratory data analysis and sentiment analysis (Part 2)

After scraping reviews from Goodreads in the first installment of this series, we are now ready to do some exploratory data analysis to get a better sense of the data we have. This will also allow us to create features that we will use in future analyses. Setup and data preparation We start by loading Related Post

Read more »

2016-12 ‘DOM’ Version 0.2

September 13, 2016
By

This report describes changes in version 0.2 of the ‘DOM’ package for R. Version 0.1 of ‘DOM’ allowed HTML content to be added to a web page (or removed or modified); version 0.2 adds the ability to append SVG content … Continue reading →

Read more »

Forecasting Opportunities

September 13, 2016
By
Forecasting Opportunities

The previous post in this series, showed a way to identify trading opportunities. The approach I implemented used time series daily data to identify good entry points in terms of risk-reward. The natural next step is to try to make use of these opportunities using machine learning. To refresh: the output of the previous post The post

Read more »

Announcing the simputation package: make imputation simple

September 13, 2016
By

I am happy to announce that my simputation package has appeared on CRAN this weekend. This package aims to simplify missing value imputation. In particular it offers standardized interfaces that make it easy to define both imputation method and imputation … Continue reading →

Read more »

New Version of the OpenStreetMap R Pacakge

September 13, 2016
By
New Version of the OpenStreetMap R Pacakge

A new version of the OpenStreetMap package has been released to CRAN. OpenStreetMap 0.3.3 contains several minor improvements. I've removed the CloudMade tile set types, as they seem to have gone out of business. MapQuest has also been removed as they have moved to a new API. The mapbox type has been updated to use their

Read more »

A predictive maintenance solution template with SQL Server R Services

September 13, 2016
By
A predictive maintenance solution template with SQL Server R Services

by Jaya Mathew, Data Scientist at Microsoft By using R Services within SQL Server 2016, users can leverage the power of R at scale without having to move their data around. Such a solution is beneficial for organizations with very sensitive, big data which cannot be hosted on any public cloud but does most of their coding in R....

Read more »

Independent t test in R

September 13, 2016
By
Independent t test in R

The independent t test is used to test if there is any statistically significant difference between two means. Use of an independent t test requires several assumptions to be satisfied. The assumptions are listed below The variables are continuous and independent The variables are normally distributed The variances in each group are equal When these

Read more »

New features in imager 0.30

September 13, 2016
By
New features in imager 0.30

imager is an R package for image processing, based on CImg. This new release brings many new features, including: Support for automatic parallel processing using OpenMP. A new S3 class, imlist, which makes it easy to work with image lists New functions for interactively selecting image regions (grabRect,grabPoint,grabLine) Experimental support for CImg’s byte-compiled DSL via

Read more »

anytime 0.0.1: New package for ‘anything’ to POSIXct (or Date)

September 13, 2016
By
anytime 0.0.1: New package for ‘anything’ to POSIXct (or Date)

anytime just arrived on CRAN as a very first release 0.0.1. So why (yet another) package dealing with dates and times? R excels at computing with dates, and times. By using typed representation we not only get all that functionality but also of the a...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.