September 2011

Datasets to Practice Your Data Mining

September 16, 2011 | Yanchang Zhao

There are many datasets available online for free for research use. Some of them are listed below. - The R Datasets Package: There are around 90 datasets available in the package. Most of them are small and easy to feed … Continue reading → [Read more...]

Project Euler: problem 2

September 16, 2011 | MK

Each new term in the Fibonacci sequence is generated by adding the previous two terms. By starting with 1 and 2, the first 10 terms will be:1, 2, 3, 5, 8, 13, 21, 34, 55, 89, ...By considering the terms in the Fibonacci sequence whose values do not exc...
[Read more...]

How Lloyd’s of London uses R for Insurance

September 15, 2011 | David Smith

Lloyd's is the world's leading specialist insurance market, and is often the first to insure new, unusual or complex risks. So it's no surprise that Lloyd's is one of the many companies that use R and its advanced capabilities for data analysis to help manage its insurance risks. At the ... [Read more...]

Using Google Spreadsheets with R: an update

September 15, 2011 | David Smith

Prompted by a rush of visitors from Andrew Gelman's blog, I went back and updated the details of my post from 2009 on reading data from Google Spreadsheets into R. Since then, Google had switched to using a secure (https) connection for Google Docs, which required some tweaks to the code. ... [Read more...]

Reporting Good Enough to Share

September 15, 2011 | klr

Sorry to all my faithful readers for my absence recently. I started a new job at a new firm, so my blogging has moved down the priority list but only temporarily. I am still committed to documenting my thoughts, especially finance and R thoughts as dis...
[Read more...]

littler 0.1.4

September 15, 2011 | Thinking inside the box

Matthias Klose, the tireless force behind the Debian / Ubuntu gcc, python, and what have you packages, sent me a minimal patch to let littler build when the ld linker uses the --as-needed option (as Ubuntu builds now do): all it took was a little reor... [Read more...]

Project Euler: problem 1

September 15, 2011 | MK

To be fairly honest (assuming there are degrees of honesty), I do know a little about math and programming but I don't know much math or any programming. I've loved math for a long time, but started to learn and understand fairly recently. So during th...
[Read more...]

R Fork Bomb

September 14, 2011 | wrathematics

So maybe I’m a strange guy, but I think fork bombs are really funny.  What’s a fork bomb?  The basic premise is that you spawn a process that spawns a process that spawns a process…, ad infinitum. The most beautiful example of a fork bomb, and really one ... [Read more...]

Shortest paths to/from nodes of a certain type

September 14, 2011 | Michał

Elijah asked the following via SOCNET mailing list: I was wondering if anyone knew of a script or tool which would give me the network distance of nodes to a particular class of nodes.  I think of this as an Erdos number, except instead of getting the distance to one ... [Read more...]

Revolution Analytics Fall Webinar Series

September 14, 2011 | David Smith

We've lined up what we think is an amazing series of R-related webinars over the next couple of months. These free 30-60 minute webinars will cover a wide range of topics: big-data analysis in R with the RevoScaleR package, Hadoop and Netezza; introductions to R for SAS users and for ... [Read more...]

How to program MapReduce jobs in Hadoop with R

September 13, 2011 | David Smith

MapReduce is a powerful programming framework for efficiently processing very large amounts of data stored in the Hadoop distributed filesystem. But while several programming frameworks for Hadoop exist, few are tuned to the needs of data analysts who typically work in the R environment as opposed to general-purpose languages like ... [Read more...]

More sas7bdat progress

September 13, 2011 | BioStatMatt

The development version of the read.sas7bdat function (in the sas7bdat package) now reads field labels and formats. In addition, errors of the type "found subheaders where 1 expected" are now a thing of the past. These improvements are largely due to work by Clint Cummins. The function also ... [Read more...]

Backtesting a Simple Stock Trading Strategy

September 13, 2011 | Zach Mayer

Note: This post is NOT financial advice!  This is just a fun way to explore some of the capabilities R has for importing and manipulating data.   I recently read a post on ETF Prophet that explored an interesting stock trading strategy in Ex... [Read more...]

Speed up recursion in R 600-fold with Rcpp

September 12, 2011 | David Smith

Rcpp package co-author Dirk Eddelbuettel provides another case study in speeding up R code by rewriting repeatedly-called R code as inline C++ functions, using the classic Fibonacci recursion algorithm as an example. The speed gains here are impressive -- over 600x compared to native recursive R code -- but you ... [Read more...]

Why you should care about reproducible research

September 12, 2011 | David Smith

This week's Economist has an in-depth article on the consequences of failures reproducible research, adding more detail to the report in the New York Times in July. Errors in data analysis by researchers at Duke University led to patients in clinical trials being assigned the wrong drug: Dr Potti and ... [Read more...]

Testing and significance

September 12, 2011 | xi'an

Julien Cornebise pointed me to this Guardian article that itself summarises the findings of a Nature Neuroscience article I cannot access. The core of the paper is that a large portion of comparative studies conclude to a significant difference between protocols when one protocol result is significantly different from zero ... [Read more...]
1 4 5 6 7 8 10

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)