May 2016

the random variable that was always less than its mean…

May 29, 2016 | xi'an

Although this is far from a paradox when realising why the phenomenon occurred, it took me a few lines to understand why the empirical average of a log-normal sample is apparently a biased estimator of its mean. And why the biased plug-in estimator does not appear to present a bias. ...
[Read more...]

Reminiscences of R in Finance 2016

May 29, 2016 | Pat

When I announced R in Finance 2016 I talked about 2 days of conference and 50 speakers.  I missed out the 3 days of sleep deprivation. But a pleasant 3 days of sleep deprivation it was — seeing old friends and making new ones. I’m not sure that Mother Mary believed me that in our … ... [Read more...]

Introduction to k-Means clustering in R

May 29, 2016 | ujjwalkarn

k-means clustering aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean, serving as a prototype of the cluster. I have provided below the R code to get started with k-means clustering in R. The dataset can be downloaded from ... [Read more...]

End to end Logistic Regression in R

May 29, 2016 | ujjwalkarn

Logistic regression, or logit regression is a regression model where the dependent variable is categorical. I have provided code below to perform end-to-end logistic regression in R including data preprocessing, training and evaluation. The dataset used can be downloaded from here. [Read more...]

Codes for common Data Frame operations in R

May 29, 2016 | ujjwalkarn

Listed below are codes for some data frame operations that are good to have at your fingertips: Create an empty data.frame Sort a dataframe by column(s) Merge/Join data frames (inner, outer, left, right) Drop data frame columns by name Remove rows with NAs in data.frame Quickly ... [Read more...]

Useful functions in R!

May 29, 2016 | ujjwalkarn

I have listed some useful functions below: with() The with( ) function applys an expression to a dataset. It is similar to DATA= in SAS. # with(data, expression) # example applying a t-test to a data frame mydata with(mydata, t.test(y ~ group)) Please look at other examples here and here. ... [Read more...]

Some useful R cheatsheets

May 29, 2016 | ujjwalkarn

Given below is a list of useful cheatsheets for R: Data Wrangling in R ggplot2 Cheatsheet Shiny Cheatsheet devtools Cheatsheet markdown Cheatsheet,  reference Data Exploration Cheatsheet   [Read more...]

Arbitrage in Euro’16 soccer odds?

May 28, 2016 | Florian Teschner

With the Euro’16 coming up in 2 weeks, I thought it would be great to look up the odds for each team. Using a small R-Script, I got the data from this site. As previously discussed, I cleaned the data (calculate the probability from the odds and then normalize the probability ... [Read more...]

RcppArmadillo 0.7.100.3.0

May 28, 2016 | Thinking inside the box

The first Armadillo release of the 7.* series is out: a new version 7.100.3. We uploaded RcppArmadillo 0.7.100.3.0 to CRAN and Debian. This followed the usual thorough reverse-dependecy checking of by now 230 packages using it. This release now requ... [Read more...]

Introduction to R for Data Science :: Session 4

May 28, 2016 | The Exactness of Mind

Welcome to Introduction to R for Data Science Session 4! The course is co-organized by Data Science Serbia and Startit. You will find all course material (R scripts, data sets, SlideShare presentations, readings) on these pages. Welcome to the fourth session of Introduction to R for Data Science! Check out the ... [Read more...]

An object has no name

May 27, 2016 | David Smith

No, it's not a Jaqen H'ghar quote. Recently, Hadley Wickham tweeted the following image: While this image isn't included in Hadley's Advanced R book, he does discuss many of the implications there. The most significant of these is that creating a copy of an object in R doesn't consume any ... [Read more...]

Integrating R with Apache Hadoop

May 27, 2016 | Sunila Gollapudi

Integrating R to work on Hadoop is to address the requirement to scale R program to work with petabyte scale data. The primary goal of this post is to elaborate different techniques for integrating R with Hadoop. Approach 1: Using R and Streaming APIs in Hadoop In order to integrate an ...
[Read more...]

Some Impressions from R Finance 2016

May 27, 2016 | Joseph Rickert

by Joseph Rickert R / Finance 2016 lived up to expectations and provided the quality networking and learning experience that longtime participants have come to value. Eight years is a long time for a conference to keep its sparkle and pizzazz. But, the conference organizers and the UIC have managed to create ... [Read more...]

Jeff Meisel’s Workshop @ ODSC

May 27, 2016 | Ari Lamstein

Today I’d like to draw attention to a workshop that Jeffrey Meisel gave at the Open Data Science Conference East (ODSC East) in Boston last weekend. Jeff is the Chief Marketing Officer of the US Census Bureau, and for a few months we’ve been talking about how to ... [Read more...]

#satRdays voting closes May 31st

May 27, 2016 | Steph

Voting for 2 of the 3 locations for satRday conferences will be closing at the end of May 31st (GMT). It’s been a phenomenal turnout with more than 1,500 votes so far. You can still vote if you haven’t already! EU status Budapest, Hungary, is where Gergely will be throwing the ... [Read more...]
1 2 3 4 15

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)