591 search results for "sql"

Importing Data Into R from Different Sources

December 6, 2012
By

I have found that I get data from many different sources.  These sources range from simple .csv files to more complex relational databases, to structure XML or JSON files.  I have compiled the different approaches that one can use to easily access these datasets. Local Column Delimited Files This is probably the most common and

Read more »

R FAQs for the fresh starters

December 4, 2012
By

R, which was largely predominant in the academic world, has started picking up a lot in businesses as well. At least that is what I am witnessing among my colleagues. Lot of people have started experimenting with R, choosing the path to enlightenment. ...

Read more »

analyze the basic stand alone medicare claims public use files (bsapufs) with r and monetdb

December 3, 2012
By

the centers for medicare and medicaid services (cms) took the plunge.  the famous medicare 5% sample has been released to the public, free of charge.  jfyi - medicare is the u.s. government program that provides health insurance to 50 million...

Read more »

Quick Shiny Demo – Exploring NHS Winter Sit Rep Data

November 28, 2012
By
Quick Shiny Demo – Exploring NHS Winter Sit Rep Data

Having spent a chink of the weekend and a piece of yesterday trying to pull NHS Winter sitrep data into some sort of shape in Scraperwiki, (described, in part, here: When Machine Readable Data Still Causes “Issues” – Wrangling Dates…), I couldn’t but help myself last night and had a quick go at using RStudio’s

Read more »

why and how to install monetdb with r on windows

November 26, 2012
By

warning: the instructions below are obsolete.  please check this page for the latest version.  <3 anthony whya speed test of three sql queries on sixty-seven million records using my personal computer --# calculate the sum, mean, median, a...

Read more »

Make a Graphical Figure of your SEM model in OpenMx

November 19, 2012
By

In this post, I made an SEM model and showed the results in a table.It’s a great feature of SEM that you can sketch your ideas about how the world works, and being able to get such a sketch back out of OpenMx is very helpful.Importantly, a figure can help readers understand what you’ve done, and it is a...

Read more »

Bottom-up creation of data-driven capabilities: automate your work

November 15, 2012
By

My previous post on how to transform an organization into a more data-driven version of itself made a pretty big assumption that often doesn’t hold true. I assumed that people in the organization wanted their company or agency to become more data-driven. I think almost everyone says they want that if asked. I even think

Read more »

Big Data ETL and Big Data Analysis

November 14, 2012
By
Big Data ETL and Big Data Analysis

I was at Strata New York 2012 last month. Great conference! Thanks O'Reilly media for assembling the industry leaders and running it well.I understand it was too crowded for some of my out-of-town friends. Stepping out to the streets of mid-town Manhat...

Read more »

SAP CodeJam Montreal

November 13, 2012
By
SAP CodeJam Montreal

Thanks to an initiative of Krista Elkin, Jonathan Druker and myself ( with a lot of support from Craig Cmehil and Helena Losada ), SAP CodeJam Montreal is going live on Thursday, December 13, 2012 from 3 to 9 pm in the SAP Labs Montreal offices.This is...

Read more »

Benchmarking bigglm

November 13, 2012
By

By Joseph Rickert In a recent blog post, David Smith reported on a talk that Steve Yun and I gave at STRATA in NYC about building and benchmarking Poisson GLM models on various platforms. The results presented showed that the rxGlm function from Revolution Analytics’ RevoScaleR package running on a five node cluster outperformed a Map Reduce/ Hadoop implementation...

Read more »