Blog Archives

Salaries by alma mater – an interactive visualization with R and plotly

April 27, 2017
By
Salaries by alma mater – an interactive visualization with R and plotly

Based on an interesting dataset from the Wall Street Journal I made the above visualization of the median starting salary for US college graduates from different undergraduate institutions (I have also looked at the mid-career salaries, and the salar...

Read more »

Understanding the Tucker decomposition, and compressing tensor-valued data (with R code)

April 4, 2017
By
Understanding the Tucker decomposition, and compressing tensor-valued data (with R code)

In many applications, data naturally form an n-way tensor with n > 2, rather than a “tidy” table. As mentioned in the beginning of my last blog post, a tensor is essentially a multi-dimensional array: a tensor of order one is a vector, which simply is a column of numbers, a tensor of order two is a matrix, which...

Read more »

Understanding the CANDECOMP/PARAFAC Tensor Decomposition, aka CP; with R code

April 2, 2017
By
Understanding the CANDECOMP/PARAFAC Tensor Decomposition, aka CP; with R code

A tensor is essentially a multi-dimensional array: a tensor of order one is a vector, which simply is a column of numbers, a tensor of order two is a matrix, which is basically numbers arranged in a rectangle, a tensor of order three looks like numbers arranged in rectangular box (or a cube, if all modes have the...

Read more »

Contours of statistical penalty functions as GIF images

March 17, 2017
By
Contours of statistical penalty functions as GIF images

Many statistical modeling problems reduce to a minimization problem of the general form: or where $f$ is some type of loss function, $\mathbf{X}$ denotes the data, and $g$ is a penalty, also referred to by other names, such as “regularization term” (problems (1) and (2-3) are often equivalent by the way). Of course both, $f$ and $g$, may depend on further...

Read more »

2D contours of several penalty functions in statistics as GIF images

March 13, 2017
By
2D contours of several penalty functions in statistics as GIF images

Many statistical modeling problems reduce to a minimization problem of the general form: or where $f$ is some type of loss function, $\mathbf{X}$ denotes the data, and $g$ is a penalty, also referred to by other names, such as “regularization term” (problems (1) and (2-3) are often equivalent by the way). Of course both, $f$ and $g$, may depend on further...

Read more »

Tired of doing real math 2 — grad school and coffee consumption

February 15, 2017
By
Tired of doing real math 2 — grad school and coffee consumption

Lately I notice a sharp increase in my coffee consumption (reading Howard Schultz’s Starbucks book, which is actually quite good by the way, does not help either :grimacing:). Having recently transitioned into a new PhD program I started wondering wh...

Read more »

Visualization of MRI data in R

January 27, 2017
By
Visualization of MRI data in R

Lately I was getting a little bored with genomic data (and then TCGA2STAT started to give me a segfault on my university’s high performance computing facility too :stuck_out_tongue:). So I decided to analyze some brain imaging data that I had lying a...

Read more »

Visualization of MRI data in R

January 27, 2017
By
Visualization of MRI data in R

Lately I was getting a little bored with genomic data (and then TCGA2STAT started to give me a segfault on my university’s high performance computing facility too :stuck_out_tongue:). So I decided to analyze some brain imaging data that I had lying a...

Read more »

Tired of doing real math 1 — some visualizations of Hillary Clinton and Donald Trump tweets

October 24, 2016
By
Tired of doing real math 1 — some visualizations of Hillary Clinton and Donald Trump tweets

As a grad student working primarily on statistical methodology, I regularly experience phases of total disillusionment with math/stats. Recently I realized that when I don’t feel like doing “real” math for prolonged periods of time, I instead ...

Read more »

dplyr basics

July 18, 2016
By
dplyr basics

This blog post demonstrates the usage of the R package dplyr. It turns out that dplyr is intuitive to the point where I probably won’t ever need to look back at this summary. A nice and very concise dplyr and tidyr cheat sheet is available from RSt...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)