Blog Archives

analyze the social security administration public use microdata files (ssapumf) with r

May 5, 2013
By

the social security administration (ssa) must be overflowing with quiet heroes, because their public-use microdata files are as inconspicuous as they are thorough.  sure, ssa publishes enough great statistical research of their own that outside re...

Read more »

analyze the medical large claims experience study (mlces) with r

April 21, 2013
By

not a survey, not even remotely current, the society of actuaries' medical large claims experience study (mlces) might be the best private health insurance claims data available to the public.  this data should be used to calibrate other data sets...

Read more »

analyze the pesquisa nacional por amostra de domicilios (pnad) with r

April 7, 2013
By

think of the pesquisa nacional por amostra de domicilios (pnad) as the brazilian census for off-years - the ones that don't end in zero.  the principal household survey for the nation of brazil, pnad measures general education, labor, income, and ...

Read more »

column-store R or: how i learned to stop worrying and love monetdb

March 18, 2013
By

"Combining R's sophisticated calculations and MonetDB's excellent data access performance is a no-brainer. One gets the best of two (open source) worlds with minimal hassle." - Dr. Hannes Mühleisen"oh wow that was fast like a cheetah with a jetpack or something" - anthony damicowhy try monetdb + ra speed test of four analysis commands on sixty-seven million...

Read more »

analyze health professional shortage areas (hpsa) with r

February 11, 2013
By

a health professional shortage area (hpsa) is a geographic area, population group, or health care facility that has been designated by the united states government as having an insufficient supply of medical providers, based on certain provider-to-popu...

Read more »

analyze the survey of income and program participation (sipp) with r

February 4, 2013
By

if the census bureau's budget was gutted and only one complex sample survey survived, pray it's the survey of income and program participation (sipp).  it's giant.  it's rich with variables.  it's monthly.  it follows households over three, four, now five year panels.  the congressional budget office uses it for their health insurance simulation.  analysts read that sipp has...

Read more »

analyze the survey of consumer finances (scf) with r

January 28, 2013
By

the survey of consumer finances (scf) tracks the wealth of american families.  every three years, more than five thousand households answer a battery of questions about income, net worth, credit card debt, pensions, mortgages, even the lease on th...

Read more »

analyze the health and retirement study (hrs) with r

January 14, 2013
By

the hrs is the one and only longitudinal survey of american seniors.  with a panel starting its third decade, the current pool of respondents includes older folks who have been interviewed every two years as far back as 1992.  unlike cross-se...

Read more »

analyze the medical expenditure panel survey (meps) with r

January 7, 2013
By

the meps household component leads the pack for examining individual-level medical expenditures by payor and type of service.  total expenditures captured by the survey tend to be low, but unbiased across the board and can be adjusted to match the...

Read more »

analyze the behavioral risk factor surveillance system (brfss) with r and monetdb

December 17, 2012
By

experimental.  the behavioral risk factor surveillance system (brfss) aggregates behavioral health data from 400,000 adults via telephone every year.  it's um *clears throat* the largest telephone survey in the world and it's gotta lotta uses...

Read more »