Blog Archives

Hot news detection using Wikipedia

June 29, 2015
By
Hot news detection using Wikipedia

Wikipedia is, indeed, one of the largest free access crowdsourcing sources of information in today's world. Every day, thousands of people land on its pages to get information about different topics. Moreover, many machine learning algorithms (including text mining, semantic web, etc.) get their input from Wikipedia; for example, Google’s Knowledge Graph is mainly built around Wikipedia .Every...

Read more »

Simple Regime Change Detection with t-test

May 21, 2015
By
Simple Regime Change Detection with t-test

It is always fun to find trend in time series data. But what about the scenarios where the trend in the time series changes. Detecting the point of this trend change can be quite beneficial. For example, if you can immediately detect the change in revenue regime of a company it can be very valuable to that company. Or...

Read more »

Bayesian network in R: Introduction

February 15, 2015
By
Bayesian network in R: Introduction

Bayesian networks (BNs) are a type of graphical model that encode the conditional probability between different learning variables in a directed acyclic graph. There are benefits to using BNs compared to other unsupervised machine learning techniques. A few of these benefits are:It is easy to exploit expert knowledge in BN models. BN models have been found to be very robust...

Read more »

Goodness of fit test in R

January 26, 2015
By
Goodness of fit test in R

As a data scientist, occasionally, you receive a dataset and you would like to know what is the generative distribution for that dataset. In this post, I aim to show how we can answer that question in R. To do that let's make an arbitrary dataset that we sample from a Gamma distribution. To make the problem a little...

Read more »

stacked plot in R

January 18, 2015
By
stacked plot in R

Consider the following example: there is a three-stage truck maintenance pipeline. Initially, when a Truck comes to the maintenance service, it is added to the first stage and its status in the pipeline is set to "New". When the technicians start working on it, for diagnosis the problem and removing the issue, its status is changed to "in progress"....

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)