534 search results for "sql"

analyze the health and retirement study (hrs) with r

January 14, 2013
By

the hrs is the one and only longitudinal survey of american seniors.  with a panel starting its third decade, the current pool of respondents includes older folks who have been interviewed every two years as far back as 1992.  unlike cross-se...

Read more »

R for actuarial science

January 10, 2013
By
R for actuarial science

As mentioned in the Appendix of Modern Actuarial Risk Theory, “R (and S) is the ‘lingua franca’ of data analysis and statistical computing, used in academia, climate research, computer science, bioinformatics, pharmaceutical industry, customer analytics, data mining, finance and by some insurers. Apart from being stable, fast, always up-to-date and very versatile, the chief advantage of R is that...

Read more »

Getting Access data into R

January 9, 2013
By

The myth of the missing Data Scientist

January 7, 2013
By
The myth of the missing Data Scientist

Much has been said about the dire shortage of Data Scientists looming on the horizon. With the spectre of Big …Continue reading »

Read more »

analyze the medical expenditure panel survey (meps) with r

January 7, 2013
By

the meps household component leads the pack for examining individual-level medical expenditures by payor and type of service.  total expenditures captured by the survey tend to be low, but unbiased across the board and can be adjusted to match the...

Read more »

Batch forecasting in R

January 6, 2013
By

I sometimes get asked about forecasting many time series automatically. Here is a recent email, for example: I have looked but cannot find any info on generating forecasts on multiple data sets in sequence. I have been using analysis services for sql server to generate fitted time series but it is too much of a black box (or I...

Read more »

Search and replace: Are you tired of nested `ifelse`?

January 6, 2013
By

It happens all the time: you have a vector of fruits and you want to replace all bananas with apples, all oranges with pineapples, and leave all the other fruits as-is, or maybe change them all to figs. The usual solution? A big old nested `ifelse`: ...

Read more »

100 most read R posts in 2012 (stats from R-bloggers) – big data, visualization, data manipulation, and other languages

January 2, 2013
By
100 most read R posts in 2012 (stats from R-bloggers) – big data, visualization, data manipulation, and other languages

R-bloggers.com is now three years young. The site is an (unofficial) online journal of the R statistical programming environment, written by bloggers who agreed to contribute their R articles to the site. Last year, I posted on the top 24...

Read more »

The (near) Future of Data Analysis – A Review

January 2, 2013
By
The (near) Future of Data Analysis – A Review

Sean Murphy co-organizes Data Business DC, among many other things. Hadley Wickham, having just taught workshops in DC for RStudio, shared with the DC R Meetup his view on the future, or at least the near future of Data Analysis. … Continue reading → The post The (near) Future of Data Analysis – A Review appeared first on...

Read more »

Efficiecy of Extracting Rows from A Data Frame in R

January 1, 2013
By
Efficiecy of Extracting Rows from A Data Frame in R

In the example below, 552 rows are extracted from a data frame with 10 million rows using six different methods. Results show a significant disparity between the least and the most efficient methods in terms of CPU time. Similar to the finding in my previous post, the method with data.table package is the most efficient

Read more »