R in the Press

November 1, 2012
By
R in the Press

Here is the list of press reports and news about R Bits (A bog under The New York Times) R you ready for R? by Ashlee Vance Published: January 8, 2009, 1:52 PM The New York Times Data Analysts Captivated by R’s Power by Ashlee Vance Published: January 6, 2009  InfoWorld The BI battle isn’t

Read more »

Variable probability Bernoulli outcomes – Fast and Slow

November 1, 2012
By
Variable probability Bernoulli outcomes – Fast and Slow

I am working on a project that requires the generation of Bernoulli outcomes. Typically, I would go about this using the built in sample() function like so: This works great and is fast, even for large n. Problem is, I want to generate each sample with its own unique probability. Seems straight forward enough, I

Read more »

Correlation: Easy as 1-2-3?

November 1, 2012
By

I recently had a task to take a look at some assessment (audit) data. I was assuming, rather hoping for data with a normal distribution and thought it would be a quick case of Pearson correlation between two columns: "Duration" and "Score". Just conjecture at this point as I did not understand what the assessment process

Read more »

Upcoming R training by Hadley Wickham: SF Dec 3-4, DC Dec 10-11

November 1, 2012
By

(By Hadley Wickham) Hi all, I’d like to let you know about four R training courses that RStudio will be offering in December: * Effective data visualization (http://bit.ly/TY2ONI) Dec 3. San Francisco, CA * Reports and reproducible research (http://bit.ly/RsZmYr) Dec 4. San Francisco, CA * Advanced R programming (http://bit.ly/RvZDsd) Dec 10. Washington, DC * Package development (http://bit.ly/UhTIWz) Dec 11....

Read more »

New version of RStudio (v0.97)

November 1, 2012
By
New version of RStudio (v0.97)

Today a new version of RStudio (v0.97) is available for download from our website.  The principal focus of this release was creating comprehensive tools for R package development. We also implemented many other frequently requested enhancements including a new Vim editing mode and a much improved Find and Replace pane. Here’s a summary of what’s

Read more »

GGtutorial: Day 4 – More Colors

November 1, 2012
By
GGtutorial: Day 4 – More Colors

So far we’ve covered Melting and Casting data using the reshape() package and today we’re going to look at different ways of coloring and selecting palettes for plots. For these plots, we’re going to use the built in diamonds data...

Read more »

Why pictures are so important when modeling data?

October 31, 2012
By
Why pictures are so important when modeling data?

(bis repetita) Consider the following regression summary,Call: lm(formula = y1 ~ x1)   Coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) 3.0001 1.1247 2.667 0.02573 * x1 0.5001 0.1179 4.241 0.00217 **...

Read more »

Regime Detection

October 31, 2012
By
Regime Detection

Regime Detection comes handy when you are trying to decide which strategy to deploy. For example there are periods (regimes) when Trend Following strategies work better and there are periods when Mean Reversion strategies work better. Today I want to show you one way to detect market Regimes. To detect market Regimes, I will fit

Read more »

More data apps spawned by Sandy

October 31, 2012
By
More data apps spawned by Sandy

As the clean-up continues on the eastern seaboard, I wanted to follow up on Monday's post on tracking Hurricane Sandy with Open Data with a couple of other R-based data applications spawned by the storm. Josef Fruehwald created an R script to tap into local weather sensors to keep track of air pressure, wind speed and rainfall near his...

Read more »

draw figures in CMYK mode in R

October 31, 2012
By
draw figures in CMYK mode in R

Print publication usually ask to use CMYK (instead of RGB) color mode for figures (because not every color can be print out), while we usually use RGB for screen reading (because screen has larger range of color scale). Of course we can convert RGB to ...

Read more »

Using R with Routino to provide road network paths between random Tweets and an iconic Smiths landmark

October 31, 2012
By
Using R with Routino to provide road network paths between random Tweets and an iconic Smiths landmark

A couple of days ago I posted how you can go about installing Routino on OSX; and now I have just finished writing a quick post over on my Rpubs blog about how you go about using it from within R. I also wanted to know a bit more about how R and Twitter play

Read more »

Hierarchical linear models and lmer

October 31, 2012
By
Hierarchical linear models and lmer

Hierarchical linear models and lmer Article by Ben Ogorek Graphics by Bob Forrest Background My last article featured linear models with random slopes. For estimation and prediction, we used the lmer function from the lme4 package. Today we'll consider another level in the hierarchy, one...

Read more »

GGtutorial: Day 3 – Introduction to Colors

October 31, 2012
By
GGtutorial: Day 3 – Introduction to Colors

So, where does ggplot get its colors? If you’ve ever asked ggplot to color on the basis of a factor, you might have beeen surprised by the default color choices.  The fact is, ggplot colors factors on the basis of finding evenly spaced colors a...

Read more »

Using R with Routino to provide road network paths between random Tweets and an iconic Smiths landmark

October 31, 2012
By

A couple of days ago I posted how you can go about installing Routino on OSX; and now I have just finished writing a quick post over on my Rpubs blog about how you go about using it from within R. I also wanted to know a bit more about how R and Twitte...

Read more »

Fitting Distributions to Data with R

October 31, 2012
By

In “Fitting Distributions with R” Vito Ricci writes; “Fitting distributions consists in finding a mathematical function which represents in a good way a statistical variable. A statistician often is facing with this problem: he has some observations of a quantitative character and he wishes to test if those observations, being a sample of an unknown population, belong from a...

Read more »

Edmonton R User Group is going live

October 30, 2012
By
Edmonton R User Group is going live

Edmonton has made a name for itself as the City of Champions, The Gateway to the North and the most northern city in North America with a population of over 1

Read more »

Makefiles for R/LaTeX projects

October 30, 2012
By

Updated: 21 November 2012 Make is a marvellous tool used by programmers to build software, but it can be used for much more than that. I use make whenever I have a large project involving R files and LaTeX files, which means I use it for almost all of the papers I write, and almost of the consulting reports...

Read more »

Using R with Routino to provide road network paths between random Tweets and an iconic Smiths landmark

October 30, 2012
By

A couple of days ago I posted how you can go about installing Routino on OSX; and now I have just finished writing a quick post over on my Rpubs blog about how you go about using it from within R. I also wanted to know a bit more about how R and Twitte...

Read more »

R among TechCrunch’s 5 Trendy Open-Source Techs for Big Data

October 30, 2012
By

Tim Gasper (Product Manager at Big Data platform Infochimps) has an informative article at TechCrunch that provides an overview of five open-source technologies trending now for Big Data applications. They are: Storm and Kafka (for processing stream data) Drill and Dremel (for ad-hoc queries of big data) R (for data science with big data) Gremlin and Giraph (for graph...

Read more »

visit to ISU

October 30, 2012
By
visit to ISU

  A short visit to ISU but and therefore a busy and proftable day! About ten appointments in Snedecor Hall after a nice morning run, a highly attended Zyskind Lecture, and many interesting discussions all over the day: e.g., I had a great time discussing using null recurrent Markov chains for integral approximations with Krishna

Read more »

DINEOF (Data Interpolating Empirical Orthogonal Functions)

October 30, 2012
By
DINEOF (Data Interpolating Empirical Orthogonal Functions)

I finally got around to reproducing the DINEOF method (Beckers and Rixon, 2003) for optimizing EOF analysis on gappy data fields - it is especially useful for remote sensing data where cloud cover can result in large gaps in data. Their paper gives a nice overview of some of the various methods...

Read more »

Make your data famous!

October 30, 2012
By
Make your data famous!

I’m writing a book on R for O’Reilly, and I need interesting datasets for the examples. Any data that you provide will get you a mention in the book and in the publicity material, so it’s a great opportunity to publicise your work or your organisation. Datasets from any area or industry are suitable; the

Read more »

Speed up R by using a different BLAS implementation

October 30, 2012
By
Speed up R by using a different BLAS implementation

It is no news that R’s default BLAS is much slower that other available BLAS implementations. In A trick to speed up R matrix calculation/ Yu-Sung Su recommends using the ATLAS BLAS which is available on CRAN. When I learned about the possible speed-up  a while ago I tried several BLAS libraries and I found that GotoBLAS2 was

Read more »

Happy SAP HANA Friends

October 30, 2012
By

This is a presentation that I did on the Community Theatre at SAP TechEd Las Vegas 2012. Happy sap hana friends from Alvaro Tejada In this presentation, Blagbert helps his friends Nerdbert to set up his first SAP HANA project using different technol...

Read more »

On weather forecasts, Nate Silver, and the politicization of statistical illiteracy

October 30, 2012
By
On weather forecasts, Nate Silver, and the politicization of statistical illiteracy

As you know, we have a thing for statistical literacy here at Simply Stats. So of course this column over at Politico got our attention (via Chris V. and others). The column is an attack on Nate Silver, who has … Continue reading →

Read more »

On weather forecasts, Nate Silver, and the politicization of statistical illiteracy

October 30, 2012
By
On weather forecasts, Nate Silver, and the politicization of statistical illiteracy

As you know, we have a thing for statistical literacy here at Simply Stats. So of course this column over at Politico got our attention (via Chris V. and others). The column is an attack on Nate Silver, who has a blog where he tries to predict the outc...

Read more »

analyze the national health and nutrition examination survey (nhanes) with r

October 30, 2012
By

nhanes is this fascinating survey where doctors and dentists accompany survey interviewers in a little mobile medical center that drives around the country.  while the survey folks are interviewing people, the medical professionals administer labo...

Read more »

"Advanced R" Course – November 15-16, 2012

October 30, 2012
By

This is the last post about the course. As places are limited, please register as soon as possible! Milano R net, in collaboration with Quantide, organizes "Advanced R" Course November 15-16, 2012 Course description This course is designed for those … Continue reading →

Read more »

Introducing R and Biostatistics to first year LCG students (2012 version)

October 30, 2012
By
Introducing R and Biostatistics to first year LCG students (2012 version)

On Friday November 9th I’ll be giving a talk to the first year students from the Undergraduate Program on Genomic Sciences (LCG in Spanish) during their “Seminar 1: Introduction to Bioinformatics” course. It’s just like I did a year ago as I documented in my post Introducing Biostatistics to first year LCG students. Well, this time I’ll change things...

Read more »

Sponsors