Blog Archives

analyze the united states decennial census public use microdata sample (pums) with r and monetdb

July 1, 2013
By

during his tenure as secretary of state, thomas jefferson oversaw the first american census way back in 1790.  some of my countrymen express pride that we're the oldest democracy, but my heart swells with the knowledge that we've got the world's o...

Read more »

analyze the pesquisa de orcamentos familiares (pof) with r

June 17, 2013
By

for the unlucky among us born without a portuguese mother tongue, the pesquisa de orcamentos familiares (pof) translates to survey of household budgets.  this data set captures brazilian family consumption habits, allocation of expenses, and incom...

Read more »

analyze the new york city housing and vacancy survey (nychvs) with r

May 19, 2013
By

for those interested in the real estate and rental markets of the big apple, the census bureau's nyc housing and vacancy survey might be your key to the city.  if you care about how many new york residents live more than one person per room (a lot...

Read more »

analyze the social security administration public use microdata files (ssapumf) with r

May 5, 2013
By

the social security administration (ssa) must be overflowing with quiet heroes, because their public-use microdata files are as inconspicuous as they are thorough.  sure, ssa publishes enough great statistical research of their own that outside re...

Read more »

analyze the medical large claims experience study (mlces) with r

April 21, 2013
By

not a survey, not even remotely current, the society of actuaries' medical large claims experience study (mlces) might be the best private health insurance claims data available to the public.  this data should be used to calibrate other data sets...

Read more »

analyze the pesquisa nacional por amostra de domicilios (pnad) with r

April 7, 2013
By

think of the pesquisa nacional por amostra de domicilios (pnad) as the brazilian census for off-years - the ones that don't end in zero.  the principal household survey for the nation of brazil, pnad measures general education, labor, income, and ...

Read more »

column-store R or: how i learned to stop worrying and love monetdb

March 18, 2013
By

"Combining R's sophisticated calculations and MonetDB's excellent data access performance is a no-brainer. One gets the best of two (open source) worlds with minimal hassle." - Dr. Hannes Mühleisen"oh wow that was fast like a cheetah with a jetpack or something" - anthony damicowhy try monetdb + ra speed test of four analysis commands on sixty-seven million...

Read more »

analyze health professional shortage areas (hpsa) with r

February 11, 2013
By

a health professional shortage area (hpsa) is a geographic area, population group, or health care facility that has been designated by the united states government as having an insufficient supply of medical providers, based on certain provider-to-popu...

Read more »

analyze the survey of income and program participation (sipp) with r

February 4, 2013
By

if the census bureau's budget was gutted and only one complex sample survey survived, pray it's the survey of income and program participation (sipp).  it's giant.  it's rich with variables.  it's monthly.  it follows households over three, four, now five year panels.  the congressional budget office uses it for their health insurance simulation.  analysts read that sipp has...

Read more »

analyze the survey of consumer finances (scf) with r

January 28, 2013
By

the survey of consumer finances (scf) tracks the wealth of american families.  every three years, more than five thousand households answer a battery of questions about income, net worth, credit card debt, pensions, mortgages, even the lease on th...

Read more »