566 search results for "sql"

Big data (useR! 2011)

August 18, 2011
By
Big data (useR! 2011)

Unfortunatley, I missed the first and last talks. My notes from a session on Thursday morning J. Demmler – Challenges of working with a large database of routinely collected health data The SAIL data bank holds over 1.9 billion (anonymous) entries. To use the data for research, they need to ensure that proper data security is

Read more »

Kaleidoscope Ic (useR! 2011)

August 16, 2011
By
Kaleidoscope Ic (useR! 2011)

These are my rough notes on the Kaleidoscope Ic session. David Smith – The R Ecosystem (useR! 2011) David Smith works for Revolution Analytics. Quick overview of the R project – useR, r-journal, and r-forge. Social media starting to play a part in R – Google+, twitter, stackoverflow, and the traditional R mailing list. The

Read more »

GDAT 2011 in Review

August 13, 2011
By
GDAT 2011 in Review

As usual, the Guerrilla Data Analysis Techniques (GDAT) class was a total blast. Motivated students always guarantee that. It would really help our scheduling, however, if people didn't wait until the last nanosecond to register for the class. But give...

Read more »

CHCN: Canadian Historical Climate Network

August 4, 2011
By
CHCN: Canadian Historical Climate Network

A reader asked a question about data from   environment canada.  He wanted to know if that data could somehow be integrated into the RGhcnV3 package.  That turned out to be a bit more challenging that I expected.  In short order I’d found a couple other people who had done something similar.  DrJ of course was

Read more »

Q-Q Plots for Multi-modal Performance Data

August 3, 2011
By
Q-Q Plots for Multi-modal Performance Data

I'm in the process of putting together some slides on how to apply Quantile-Quantile plots to performance data. Q-Q plots are a handy tool for visually inspecting how well your data matches a known probability distribution (prob dsn). If the match is g...

Read more »

WordPress WordCloud with R

August 3, 2011
By
WordPress WordCloud with R

These days one can frequently read about wordclouds created with R, initiated by the release of the wordcloud package by Ian Fellows on July 23rd. So here I am to put in my two cents. I thought about creating a wordcloud of a complete blog history, so I build a script that connects to a

Read more »

Merging Two Different Datasets Containing a Common Column With R and R-Studio

August 2, 2011
By
Merging Two Different Datasets Containing a Common Column With R and R-Studio

Another way for the database challenged (such as myself!) for merging two datasets that share at least one common column… This recipe using the cross-platform stats analysis package, R. I use R via the R-Studio client, which provides an IDE wrapper around the R environment. So for example, here’s how to merge a couple of

Read more »

Your Data is Never the Right Shape

July 31, 2011
By
Your Data is Never the Right Shape

One of the recurring frustrations in data analytics is that your data is never in the right shape. Worst case: you are not aware of this and every step you attempt is more expensive, less reliable and less informative than you would want. Best case: you notice this and have the tools to reshape your Related posts:

Read more »

Shorting Mebane Faber

July 19, 2011
By
Shorting Mebane Faber

Although I do not personally know Mebane Faber, I know enough that I do not want to short him. However, I thought it would be insightful to see how the short side of his “A Quantitative Approach To Tactical Asset Allocation” might look.  Once ...

Read more »

Drawdown Control Can Also Determine Ending Wealth

July 11, 2011
By
Drawdown Control Can Also Determine Ending Wealth

As an extension to yesterday’s post Just Arriving is Not Enough, I wanted to show how minimizing drawdown is a much better technique to help control comfort and potentially increase ending wealth.  CHTTX was one of the best performers of the fou...

Read more »