Interval Estimation of the Population Mean

June 14, 2013
By

Interval estimation of the population mean can be computed from the functions of the following R packages:stats - contains the t.testTeachingDemos - contains the z.testBSDA - contains the zsum.test and tsum.testThe t.test of the stats package is a stud...

Read more »

Modeling an Infant’s Feeding Schedule with Periodic Smoothing Splines

June 13, 2013
By
Modeling an Infant’s Feeding Schedule with Periodic Smoothing Splines

While on paternity leave I had an opportunity to test out periodic smoothing splines (within the framework of generalized additive models) on an interesting time-series-- an infant's feeding schedule. load / format data and fit GAMs

Read more »

Practicing static typing in R: Prime directive on trusting our functions with object oriented programming

June 13, 2013
By

The creator of S language which R is derived from John Chambers said in one of his books  Software for data analysis programming with R: ...This places an obligation on all creators of software to program in such away that the computations ca...

Read more »

Win Your Fantasy Football Auction Draft: Calculate the Optimal Players to Draft with this Shiny App in R

June 13, 2013
By

In this post, I use a Shiny app in R to determine the best possible players to pick in a fantasy football auction draft.  The app takes projections from FantasyPros, The post Win Your Fantasy Football Auction Draft: Calculate the Optimal Players to Draft with this Shiny App in R appeared first on Fantasy Football Analytics.

Read more »

How big data and statistical modeling are changing video games

June 13, 2013
By

Bill Grosso presented a fascinating webinar about the video gaming industry today, Knowing How People are Playing Your Game Gives You the Winning Hand. He described how over the past three years, game studios have switched from viewing analytics as a primarily descriptive tool to deploying modern data collection practices, machine learning toolkits, and statistical methods to gain a...

Read more »

ANOVA and Tukey’s test on R

June 13, 2013
By
ANOVA and Tukey’s test on R

OBS: This is a full translation of a portuguese version. In many different types of experiments, with one or more treatments, one of the most widely used statistical methods is analysis of variance or simply ANOVA . The simplest ANOVA can be called “one way” or “single-classification” and involves the analysis of data sampled from The post ANOVA...

Read more »

Big in Japan

June 13, 2013
By
Big in Japan

Inspired by this post on R-bloggers, I decided to check how BCEA was doing. Unfortunately, it does not feature in the top 100 most downloaded R packages. However, I think it's doing well $-$ considering the book (which is the main medium of advertising of the package) has...

Read more »

Getting started with twitteR in R

June 13, 2013
By
Getting started with twitteR in R

I have asked by a few people lately to help walk them through using twitter API in R, and I’ve always just directed them to the blog post I wrote last year during the US presidential debates not knowing that Twitter had changed a few things. Having my interest peaked through a potential project at

Read more »

Top 100 R packages for 2013 (Jan-May)!

June 13, 2013
By
Top 100 R packages for 2013 (Jan-May)!

(This article was first published on R-statistics blog » RR-statistics blog, and kindly contributed to R-bloggers) What are the top 100 (most downloaded) R packages in 2013? Thanks to the recent release of RStudio of their “0-cloud” CRAN log files (but without including downloads from the primary CRAN mirror or any of the 88 other CRAN mirrors), we can now answer this question...

Read more »

Google Prediction API example

June 13, 2013
By

Google's prediction API offers a blackbox way of doing some prediction. They had advertised an R package, but it doesn't seem to work with the new version of the prediction API or their OAuth2 authentication mechanism. So, in an effort to check out the...

Read more »

type = “what?”

June 13, 2013
By
type = “what?”

One great thing about R is that has a wide diversity of packages written by many different people of many different viewpoints on how software should be designed. However, this does tend to bite us periodically.  When I teach newcomers about R and...

Read more »

Serious stats: using multilevel models to get accurate inferences for repeated measures ANOVA

June 13, 2013
By

This article from my other blog may be of interest to readers of this blog: http://seriousstats.wordpress.com/2013/04/18/using-multilevel-models-to-get-accurate-inferences-for-repeated-measures-anova-designs/

Read more »

Le Monde puzzle [#824]

June 13, 2013
By
Le Monde puzzle [#824]

A rather dull puzzle this week: Show that, for any integer y, (√3-1)2y+(√3+1)2y is an integer multiple of a power of two. I just have to apply Newton’s binomial theorem to obtain the result. What’s the point?! Filed under: Books, Kids, R Tagged: Binomial theorem, Isaac Newton, Le Monde, mathematical puzzle

Read more »

More code and images from “Displaying time series, spatial and space-time data with R”

More code and images from “Displaying time series, spatial and space-time data with R”

My book continues growing. I have recently completed the first version of the Spatio-Temporal visualization chapters. Moreover, a significant part …Continuar leyendo »

Read more »

New version of solaR

New version of solaR

I have updated my package solaR. This package provides calculation methods of solar radiation and performance of photovoltaic systems from …Continuar leyendo »

Read more »

Advanced settings in ggplot2 (R)

June 13, 2013
By
Advanced settings in ggplot2 (R)

Here is a post with some information to control the layout of your graph produced by the ggplot library: http://rpubs.com/Lionel/6557 Enjoy!Filed under: R and Stat Tagged: ggplot, R

Read more »

inline 0.3.12

June 12, 2013
By

Another maintenance release of inline is now on CRAN and in already included in Debian. This release was triggered by a change in the development version of R which removed an argument to package.skeleton(). The complete NEWS entry is below. Chan...

Read more »

The null model for age effects with overdispersed infection

June 12, 2013
By
The null model for age effects with overdispersed infection

How does overdispersion of infections affect the behavior of the multiple-infection model? I redefine the model to account for overdispersion, assuming the same overdispersion occurs in both age classes. The parameter varies inversely with the degree of overdispersion. Again, the classes are demographically identical, and infection affects mortality but not growth: \[\begin{aligned} \frac{dJ}{dt}...

Read more »

Mini-tutorial for Quandl: How to access financial data with R

June 12, 2013
By
Mini-tutorial for Quandl: How to access financial data with R

by Joseph Rickert Quandl.com, the open source website for financial data, made rapid progress earlier this year in becoming an R friendly source for financial time series data. Tammer Kamel, Quandl’s founder introduced the site on Revolutions blog in late February as a “search engine” for numerical data and explained how Quandl’s “Q-bot” can take data from almost any...

Read more »

cran2deb4ubuntu Updated for R 3.0.1 and Ubuntu 13.04

June 12, 2013
By

It has taken a long time, but cran2deb4ubuntu has been updated for R 3.0.1. Over 1000 R packages are available as .deb files (with dependicies) for Ubutnu 13.04 (raring), 12.10 (quantal) and 12.04 (precise). These packages can be found at the c2d4u PPA. Instructions on how to install the PPA can be found on this...

Read more »

Cluster NHL Teams Based on 2012/13 Regular Season Performance

June 12, 2013
By
Cluster NHL Teams Based on 2012/13 Regular Season Performance

Since tonight kicks off Game 1 of the Stanley Cup Finals, I thought it would be fun to do a very quick and dirty cluster analysis of the league based on regular season performance. Tonight, the Chicago Blackhawks square off against my hometown team, the Boston Bruins.  Even though it was a lockout-shortened season, the

Read more »

Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

June 12, 2013
By
Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

Code updated based on feedback (see list of changes at the very end)Okay, that was a take on the mirror mirror on the wall quote from Snow White. This continues my saga of learning from the superb work done by the R-community and building on their...

Read more »

The Reorderable Data Matrix and the Promise of Pattern Discovery

June 12, 2013
By
The Reorderable Data Matrix and the Promise of Pattern Discovery

We typically start with the data matrix, a rectangular array of rows and columns.  If we type its name on the R command line, it will show itself.  But the data matrix is hard to read, even when there are not many rows or columns.  The heat map is a visual alternative.  All you need is the R function...

Read more »

Data imputation I

June 12, 2013
By

I recently entered kaggle titanic learning competition for fun and to see where my out of the box utilization of random forest would rank me (303 out of 5,882). It was interesting to see that much of the scoring differentiation came from score imputation, that is filling missing values based on other data. For example, we might have

Read more »

Using Quandl in R

June 12, 2013
By
Using Quandl in R

Image by Jan Zander Our mantra here at Quandl is making data easy to find and easy to use. Following that goal we (and subsequently the community) have created packages that integrate Quandl’s API into a number of software platforms. Today we’ll take a look at R. R is a free statistical computing language created

Read more »

More fun with data frames

June 12, 2013
By
More fun with data frames

Data frames are such a straightforward and essential element of R that it’s easy to lose sight of some of their peculiarities. Last week, I developed some code which would tear apart some data frames and create new ones based on columns specified by the user. This would allow me to dynamically create new data

Read more »

R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

June 12, 2013
By
R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Plotting average read and write operation size by ASM disk for Oracle

June 12, 2013
By
Plotting average read and write operation size by ASM disk for Oracle

  Throughput, throughput, throughput – for many databases, this is the performance measure of importance.  When you are working with a fixed number of IOPS but see mixed workload types, system health can be assessed through the average read and…Read more ›

Read more »

Introducing GTrendsR

June 12, 2013
By

Just another R blog has beed added to r-bloggers!In a paper, to be soon published in Conservation Biology and entitle Googling trends in conservation biology, we developed a package named GTrendsR that provides an interface for retrieving and displaying the information returned online...

Read more »

Sponsors