Sentiment analysis finds trouble in the Enron emails

May 24, 2013
By
Sentiment analysis finds trouble in the Enron emails

The Enron email dataset, collected during the FERC investigation of the Enron financial scandal, represents the largest publicly available set of emails. This makes theman ideal testbed for sentiment analysis algorithms. Ikanow's Andrew Strite used the open-source Infinit.e framework and a Hadoop cluster to generate sentiment scores for all of the Enron emails, and then used R to manipulate...

Read more »

Down and Dirty Forecasting: Part 2

May 24, 2013
By
Down and Dirty Forecasting: Part 2

This is the second part of the forecasting exercise, where I am looking at a multiple regression. To keep it simple I chose the states that boarder WI and...

Read more »

What is probabilistic truth? Part 2 – Everything is conditional

May 24, 2013
By
What is probabilistic truth? Part 2 – Everything is conditional

Read Part 1 When making a statement of the form “1/2 is the correct probability that this coin will land tails”, there are a few things which are left...

Read more »

Down and Dirty Forecasting: Part 1

May 24, 2013
By
Down and Dirty Forecasting: Part 1

I wanted to see what I could do in a hurry using the commands found at Forecasting: Principles and Practice . I chose a simple enough data set of Wisconsin Unemployment...

Read more »

Shiny + Concerto = YES !!!

May 23, 2013
By
Shiny + Concerto = YES !!!

So I have finally gotten beta access to the two most powerful R controlled web application makers in existence and produced very exciting experimental productsA few posts ago I posted...

Read more »

7th R/Rmetrics workshop in Switzerland, June 30-July 4

May 23, 2013
By

The 7th annual R/Rmetrics Workshop om Computational Finance and Financial Engineering will take place June 30-July 4 in the beatiful alpine setting of Lake Thune, Switzerland. This is an...

Read more »

Highlights of the Milwaukee Workshop on R and Bioinformatics

May 23, 2013
By
Highlights of the Milwaukee Workshop on R and Bioinformatics

by Joseph Rickert On May 10th and 11th, in honor of this being the International Year of Statistics, the Milwaukee Chapter of the American Statistical Association (MILWASA) held a...

Read more »

Veterinary Epidemiologic Research: Modelling Survival Data – Non-Parametric Analyses

May 23, 2013
By
Veterinary Epidemiologic Research: Modelling Survival Data – Non-Parametric Analyses

Next topic from Veterinary Epidemiologic Research: chapter 19, modelling survival data. We start with non-parametric analyses where we make no assumptions about either the distribution of survival times or...

Read more »

Generating a Markov chain vs. computing the transition matrix

May 23, 2013
By
Generating a Markov chain vs. computing the transition matrix

A couple of days ago, we had a quick chat on Karl Broman‘s blog, about snakes and ladders (see http://kbroman.wordpress.com/…) with Karl and Corey (see http://bayesianbiologist.com/….), and the use of...

Read more »

The R-Podcast Episode 13: Interview with Yihui Xie

May 23, 2013
By

It’s an episode of firsts on the R-Podcast! In this episode recorded on location I had the honor and privilege of interviewing Yihui Xie, author of many innovative packages...

Read more »

Vote in the KDnuggets poll on Analytics Software

May 22, 2013
By

The 14th annual KDnuggets poll measuring use of analytics software is open for voting. The poll asks, "What Predictive Analytics, Big Data, Data mining, Data Science software you used...

Read more »

How Important is Variable Selection?

May 22, 2013
By
How Important is Variable Selection?

Very. If you have 10 possible independent regressors, and none of which matter, you have a good chance to find at least one is important. A good chance being...

Read more »

Operating on files with R: copy and rename

Nowadays, routinary operations on files, such as renaming or copying, are performed with some mouse clicks. Sometimes, it is useful perform this operations in batch. Linux users perform this...

Read more »

What happened to six million voters?

May 22, 2013
By
What happened to six million voters?

The recent elections in Pakistan on May 11 were a great success by all means. In spite of the threats for violence by Al-Qaeda and its local franchises in...

Read more »

My Prime Sieve – Homage to Yitan Zhang

May 22, 2013
By
My Prime Sieve – Homage to Yitan Zhang

# As a homage to Yitang Zhang who has proven a mind-bending property of Prime Pairs, I have written a prime Sieve to detect all of the prime numbers from...

Read more »

Video: R, ProjectTemplate, RStudio and GitHub: Automate the boring bits and get on with the fun stuff

May 22, 2013
By

This post shares the video from the talk presented on 15th May 2013 by Dr Kendra Vant on ProjectTemplate, github and Rstudio at Melbourne R Users. Overview: Want to...

Read more »

Get your questions answered about Open Data

May 21, 2013
By

The OpenData StackExchange site has just launched in beta, and looks to be a great resource for open data sources. Like StackOverflow for programming and CrossValidated for statistics, OpenData...

Read more »

Getting to the point – an alternative to the bezier arrow

May 21, 2013
By
Getting to the point – an alternative to the bezier arrow

(This article was first published on G-Forge » R, and kindly contributed to R-bloggers) An alternative bezier arrow to the regular grid-bezier. Apart from a cool gradient it has...

Read more »

Spatial correlograms in R: a mini overview

May 21, 2013
By
Spatial correlograms in R: a mini overview

Spatial correlograms are great to examine patterns of spatial autocorrelation in your data or model residuals. They show how correlated are pairs of spatial observations when you increase the...

Read more »

Slide: one function for lag/lead variables in data frames, including time-series cross-sectional data

May 21, 2013
By

I often want to quickly create a lag or lead variable in an R data frame. Sometimes I also want to create the lag or lead variable for different...

Read more »

An R debugging example

May 21, 2013
By

The steps taken to fix an R problem. Task To prepare for the Portfolio Probe blog post called “Implied alpha and minimum variance”, I tried to update a matrix...

Read more »

R programming challenge: Escape the zombie horde

May 20, 2013
By
R programming challenge: Escape the zombie horde

So when the world is taken over by a Zombie horde, you're going to want to figure out a way to get the human population to safety. This R...

Read more »

Solving Multiple Supplier Selection Problem using R and LP Solve

May 20, 2013
By
Solving Multiple Supplier Selection Problem using R and LP Solve

(This article was first published on Enterprise Software Doesn't Have to Suck, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and...

Read more »

R 3.0.1 is released

May 20, 2013
By
R 3.0.1 is released

R 3.0.1 (codename “Good Sport”) was released last week. As mentioned earlier by David, this version improves serialization performance with big objects, improves reliability for parallel programming and fixes...

Read more »

Non-Verbal Reasoning Test – Concerto

May 20, 2013
By
Non-Verbal Reasoning Test – Concerto

I have just released my first complete test of non-verbal problem solving skills.  It is run on Concerto (an R-based application development platform targeted at primarily test developers)  Try...

Read more »

More on Chutes & Ladders

May 20, 2013
By
More on Chutes & Ladders

Matt Maenner asked about the sawtooth pattern in the figure in my last post on Chutes & Ladders. Damn you, Matt! I thought I was done with this. Don’t...

Read more »

Model fitting exam problem

May 20, 2013
By

Recently I have run an exam where the following question had risen many problems for students (here I give its shortened formulation). You are given the data generating process...

Read more »

qdap 0.2.2 released

May 20, 2013
By
qdap 0.2.2 released

I’m very pleased to announce the release of qdap 0.2.2 This is the third installment of the qdap package available at CRAN. The qdap package automates many of the...

Read more »

Implied alpha and minimum variance

May 20, 2013
By
Implied alpha and minimum variance

Under the covers of strange bedfellows. Previously The idea of implied alpha was introduced in “Implied alpha — almost wordless”. In a comment to that post Jeff noticed that...

Read more »

Contributing Blogs