Figure 1: Synthetic data and fitted curves.S-shaped distributed data can be found in many applications. Such data can be approximated with logistic distribution function . Cumulative distribution function of logistic distribution function is a...

KL divergence (Kullback-Leibler57) or KL distance is non-symmetric measure of difference between two probability distributions. It is related to mutual information and can be used to measure the association between two random variables.In this short tutorial, I show how to compute KL divergence and mutual information for two categorical variables, interpreted as discrete random variables.${bf Definition}$: Kullback-Leibler (KL) Distance...

SummaryThe common case in data science or machine learning applications, different features or predictors manifest them in different scales. This could bring difficulty in interpreting the resulting coefficients of linear regression, such as one featur...

The Euclid Algorithm provides a solution to the greatest common divisor (GCD) of two natural numbers $x_{1}$ and $x_{-2}$, denoted by $GCD(x_{1}, x_{2})$. This will produce the largest integer that divides $x_{1}$ and $x_{2}$. Solution is proposed by ...

Ensuring the quality and correctness of statistical or scientific software in general constitute as one fo the main responsibilities of scientific software developers and scientists who provide a code to solve a specific computational task. Sometimes t...

Functional languages allows us to treat functions as types. This brings us a distinct advantage of being able to write a code that generates further code, this practise is generally known as metaprogramming. As a functional language R project provides ...

The creator of S language which R is derived from John Chambers said in one of his books Software for data analysis programming with R: ...This places an obligation on all creators of software to program in such away that the computations ca...

Normally, this blog would only contain technical and scientific related posts. But this time I would like to share with you a very interesting phenomenon I came across on the R mailing list(s). I call it 'Ripley Facts' after the prolific statistician, ...

