Analytics with SAP and R (Windows version)

Analytics with SAP and R (Windows version)

My good friend and programming guru Piers Harding wrote a blog called Analytics with SAP and R where he showed us how to link the wonderful worlds of R and SAP. Yes...SAP...not SAP HANA...but the good old NetWeaver...Piers build the RSAP extension usin...

Read more »

When the going gets tough…

June 22, 2012
By
When the going gets tough…

Getting closer to my personal Euro2012 derby: England v Italy. I find amusing that both sets of media think that their respective team have been gifted a good tie. The English are very happy to have avoided Spain, while the Italians don't mind not...

Read more »

eoda publishes interactiveGGPLOT – interactive graphics with ggplot2

June 22, 2012
By

One of Rs great strengths compared to other statistic solutions or programming languages ​​is the amount of possibilities for creating well-designed publication-quality plots. Almost all plot types can be created with any amount of fine tuning. R works on small data sets as well as on big data. In addition to Rs base-graphics various add-on

Read more »

Two new, important books on R

June 22, 2012
By
Two new, important books on R

Two books were recently published that are sure to help R grow even faster. R has a reputation, partially deserved, for being hard to learn.  These books will help.  The first makes learning easier, the second can make learning less necessary for initiates. I have not yet touched either book. R for Dummies The authors … Continue reading...

Read more »

Video: Getting staRted with R: An accelerated primer by Lyndon Walker – Melbourne R Users

June 22, 2012
By
Video: Getting staRted with R: An accelerated primer by Lyndon Walker – Melbourne R Users

This post shares the video from a talk presented on June 20 2012 by Dr Lyndon Walker (see Meetup page). The talk was titled “Getting staRted with R: An accelerated primer”. To quote the outline of the talk : R … Continue reading →

Read more »

Nonlinear systems

June 21, 2012
By
Nonlinear systems

There is a long standing debate if financial systems are truly random or contain some structure. From the study of non-linear dynamical systems and chaos one finds it is possible that even perfectly deterministic systems can appear to be random. … Continue reading →

Read more »

Learning a new language

June 21, 2012
By
Learning a new language

It had been a very long time since I’d tried to learn a new programming language. I started C in 1987, S in 1992, and Perl in 1997, but nothing really new in the subsequent 15 years. A friend now has me doing D, wanting to find time to learn ruby, and, most recently, playing

Read more »

Background to my book project “Empirical Software Engineering with R”

June 21, 2012
By

This post provides background information that can be referenced by future posts. For the last 18 months I have been working in fits and starts on a book that has the working title “Empirical Software Engineering with R”. The idea is to provide broad coverage of software engineering issues from an empirical perspective (i.e., the

Read more »

Confidence intervals with tiers: functions for between-subjects (independent measures) ANOVA

June 21, 2012
By
Confidence intervals with tiers: functions for between-subjects (independent measures) ANOVA

In a previous post I showed how to plot difference-adjusted CIs for between-subjects (independent measures) ANOVA designs (see here). The rationale behind this kind of graphical display is introduced in Chapter 3 of Serious stats (and summarized in my earlier blog post). In a between-subjects – or in indeed in a within-subjects (repeated measures) – design

Read more »

My first "ChemoSpec" spectra

June 21, 2012
By
My first "ChemoSpec" spectra

I’ve been this week trying to import some spectra to ChemoSpec (R Package), at the beginning I had problems generating the “csv”  files, and the function getManyCsv did not recognize the files. I did not have properly configured the regional...

Read more »

Plotting differentially methylated bases on an ideogram

June 21, 2012
By

(This article was first published on Recipes, scripts and genomics, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: Recipes, scripts and genomics. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

R and the web (for beginners), Part I: How is the local nuclear plant doing?

June 21, 2012
By
R and the web (for beginners), Part I: How is the local nuclear plant doing?

One of the things I especially like about R is its ability to easily access and process data from the web. If you are new to R, or never have used it to access data from the Internet, here is the first part of a little series of posts with examples to ...

Read more »

An example on sentiment analysis with R

June 21, 2012
By
An example on sentiment analysis with R

by Yanchang Zhao, RDataMining.com There is a nice example on sentiment analysis with R at <http://viksalgorithms.blogspot.com.au/2012/06/tracking-us-sentiments-over-time-in.html>. In the example, the Wikileaks cable corpus is analyzed to track US sentiments of other countries and their presidents over time. The example describes … Continue reading →

Read more »

FDA: R OK for drug trials

June 21, 2012
By
FDA: R OK for drug trials

In a poster (PDF) presented at the UseR 2012 conference, FDA biostatistician Jae Brodsky reiterated the FDA policy regarding software used to prepare submissions for drug approvals with clinical trials: Sponsors may use R in their submissions. The FDA does not endorse or require any particular software to be used for clinical trial submissions, and there are no regulations...

Read more »

Normalising data within groups

June 21, 2012
By
Normalising data within groups

Occasionally it proves useful to normalise data. By this I mean to scale it between zero and one. Admittedly, most people frown of this but there are papers out there with this method in use*. How do we go about this? Its a very simple formula to calculate: y' = y/sqrt(sum(y^2)) So we square all

Read more »

The stimuli-as-fixed-effect fallacy

June 21, 2012
By

Neuroskeptic has just blogged on a new paper by Judd, Westfall and Kenny on Treating stimuli as a random factor in social psychology: A new and comprehensive solution to a pervasive but largely ignored problem. I can't access the original pap...

Read more »

Solving Big Problems with Oracle R Enterprise, Part I

June 21, 2012
By
Solving Big Problems with Oracle R Enterprise, Part I

Abstract: This blog post will show how we used Oracle R Enterprise to tackle a customer’s big calculation problem across a big data set. Overview: Databases are great for managing large amounts of data in a central place with rigorous enterprise-level controls.  R is great for doing advanced computations.  Sometimes you need to...

Read more »

Experimental Design: Problem Set

June 21, 2012
By
Experimental Design: Problem Set

QUESTIONSThe tensile strength of Portland cement is being studied. Four different mixing techniques can be used economically. The following data have been collected: MixingTechniques Tensile Strength (lb/in­­2) ...

Read more »

The Great Julia RNG Refactor

June 21, 2012
By

Many readers of this blog will know that I’m a big fan of Bayesian methods, in large part because automated inference tools like JAGS allow modelers to focus on the types of structure they want to extract from data rather than worry about the algorithmic details of how they will fit their models to data.

Read more »

Will Tiger Woods catch Jack Nicklaus? And a discussion of the virtues of using continuous data even if your goal is discrete prediction

June 21, 2012
By

I know next to nothing about golf. My mini-golf scores typically approach the maximum of 7 per hole, and I’ve never actually played macro-golf. I did publish a paper on golf once (A Probability Model for Golf Putting, with Deb Nolan), but it’s not so rare for people to publish papers on topics they know The post Will...

Read more »

To R or not to R, and other events

June 21, 2012
By
To R or not to R, and other events

New events To R, or not to R, that is the question The Statistical Computing Section of the Royal Statistical Society presents a one-day event on 2012 June 29. The details of the day.  See in particular the abstract for “Teaching statistics: a pain in the R?” by Andy Field — it involves a sheepdog … Continue reading...

Read more »

Body Weight in the United States – Part 3, "Contributing Factors"

June 20, 2012
By
Body Weight in the United States – Part 3, "Contributing Factors"

Carbs In Part 2 of this series, micro-nutrients were cited as a non-factor for weight gain. This is not the case with macro-nutrients (carbohydrates, fats, proteins, water). While fats, proteins and water are essential (without them you could no...

Read more »

R Workshop: Introducing Slidify – HTML5 slides from R markdown

June 20, 2012
By
R Workshop: Introducing Slidify – HTML5 slides from R markdown

Thursday, June 28th, 2012  19h. <–  new evening time! Tomson House: 650 McTavish, H3A 1Y2, Montréal, QC <– new social setting! guRu: Ramnath Vaidyanathan (McGill University) Ramnath Vaidyanathan will introduce the group to slidify, his brand new R package. From the slidify website: “The objective of slidify is to make it easy to create reproducible

Read more »

UseR 2012 highlights

June 20, 2012
By
UseR 2012 highlights

The eighth annual R user conference, UseR! 2012, has come and gone — and what an event it was! I've been to five useR! conferences so far, and each one improves upon the last. This year's conference at Vanderbilt was the best so far: an outstanding location (my first visit to Nashville, a great city), excellent facilities (the lecture...

Read more »

Simulation and resampling

June 20, 2012
By
Simulation and resampling

In financial applications one frequently comes across the need to draw samples according to an assumed distribution. This could be because one wants to simulate stock prices for a Monte Carlo simulation, to price an option payout or to generate … Continue reading →

Read more »

The R-Podcast Episode 8: Visualization with ggplot2

June 20, 2012
By

I’m happy to present this jam-packed episode of the R-Podcast dedicated to using the ggplot2 package for visualization. This episode will have a companion screencast released in the next few days. I use data from the Hockey Summary Project to demonstrate how to create a series of boxplots of NHL regular season attendance for each

Read more »

Color Palettes in RGB Space

June 20, 2012
By
Color Palettes in RGB Space

Introduction I've recently been interested in how to communicate information using color. I don't know much about the field of Color Theory, but it's an interesting topic to me. The selection of color palettes, in particular, has been a topic I've been faced with lately. I downloaded 18 different sequential color palettes from Cynthia Brewer's

Read more »

Euro 2012: End of Group Stage

June 20, 2012
By
Euro 2012: End of Group Stage

Time for an update of the plots. Here are the teams still left in the competition. This is the group stratification. Finally, the busy plot.

Read more »

Factor Attribution

June 19, 2012
By
Factor Attribution

I came across a very descriptive visualization of the Factor Attribution that I will replicate today. There is the Three Factor Rolling Regression Viewer at the mas financial tools web site that performs rolling window Factor Analysis of the “three-factor model” of Fama and French. The factor returns are available from the Kenneth R French:

Read more »