Blog Archives

Data Mining the California Solar Statistics with R: Part IV

May 18, 2015
By
Data Mining the California Solar Statistics with R: Part IV

Predicting the residential solar power installations by county by quarter in CA from 2009-2013 So far I have gathered three data sets and combined them into one which I will now use to try to predict the number of solar installations by county by quarter in CA from 2009-2013. The three data sets I am

Read more »

Data Mining the California Solar Statistics with R: Part III

May 11, 2015
By
Data Mining the California Solar Statistics with R: Part III

Data Mining the California Solar Statistics with R: Part III Today I want to combine the California solar statistics with information about the annual solar insolation in each county as well as information about the population and median income. These can then be used as predictors in the models I'll build in the next post.

Read more »

Data Mining the California Solar Statistics with R: Part II

May 4, 2015
By
Data Mining the California Solar Statistics with R: Part II

Data Mining the California Solar Statistics with R: Part II In today's post I'll be working some more with the working data set from California Solar Statistics. Last time I imported the data, cleaned it up a bit, grouped it by county and year, and made some plots to look at how residential solar installations

Read more »

Data Mining the California Solar Statistics with R: Part I

April 24, 2015
By
Data Mining the California Solar Statistics with R: Part I

Data Mining the California Solar Statistics with R: Part I Intro Today I’m taking a look at the data set available from California Solar Statistics availalbe from https://www.californiasolarstatistics.ca.gov/. This data set lists all the applications for state incentives for both residential and commercial systems, it contains information about the PV (Photovoltaic) system size, location, cost,

Read more »

Automatic drug utilization reports with R and ggplot2

September 18, 2012
By
Automatic drug utilization reports with R and ggplot2

This program takes a data set of drug utilisation of 4 fictional drugs in 10 fictional hospitals and plots each time-series with a locally weighted regression (Lowess) trend line. It also places an time-series trend of the usage for each … Continue reading →

Read more »

R script to manipulate health data

June 3, 2012
By

Here is the code that fixed up the World Bank data export for use in Tableau. The databank spits out everything in an untidy format for grouping and aggregating. The reshape2 and plyr packages  make it easy to manipulate the whole set … Continue reading →

Read more »

R Tutorial Series: Two-Way ANOVA with Pairwise Comparisons

January 31, 2011
By
R Tutorial Series: Two-Way ANOVA with Pairwise Comparisons

By extending our one-way ANOVA procedure, we can test the pairwise comparisons between the levels of several independent variables. This tutorial will demonstrate how to conduct pairwise comparisons in a two-way ANOVA. Tutorial FilesBefore we begin, yo...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)