My favorite R bug

May 23, 2015
By
My favorite R bug

In this note am going to recount “my favorite R bug.” It isn’t a bug in R. It is a bug in some code I wrote in R. I call it my favorite bug, as it is easy to commit and (thanks to R’s overly helpful nature) takes longer than it should to find. The … Continue reading...

Read more »

Parametric Inference: Likelihood Ratio Test Problem 2

May 23, 2015
By
Parametric Inference: Likelihood Ratio Test Problem 2

More on Likelihood Ratio Test, the following problem is originally from Casella and Berger (2001), exercise 8.12.ProblemFor samples of size $n=1,4,16,64,100$ from a normal population with mean $mu$...

Read more »

My New Book and Other Matters

May 22, 2015
By
My New Book and Other Matters

I haven’t posted for a while, so here are some news items: My new book, Parallel Computation for Data Science, will be out in June or July. I believe...

Read more »

Simulation-based power analysis using proportional odds logistic regression

May 22, 2015
By
Simulation-based power analysis using proportional odds logistic regression

Consider planning a clinicial trial where patients are randomized in permuted blocks of size four to either a 'control' or 'treatment' group. The outcome is measured on an 11-point...

Read more »

visNetwork, Currencies, and Minimum Spanning Trees

May 22, 2015
By

Just because I’m ignorant doesn’t mean I won’t try things.  Feel free to correct any ignorance that follows.  More than anything I would like to feature the new htmlwidget...

Read more »

R Now Contains 150 Times as Many Commands as SAS

May 22, 2015
By
R Now Contains 150 Times as Many Commands as SAS

by Bob Muenchen In my ongoing quest to analyze the world of analytics, I’ve updated the Growth in Capability section of The Popularity of Data Analysis Software. To save...

Read more »

Old is New: XML and rvest

May 22, 2015
By

Huh… I didn’t realize just how similar rvest was to XML until I did a bit of digging. After my wonderful experience using dplyr and tidyr...

Read more »

Tutorial Recap: Analyzing Census Data in R

May 22, 2015
By
Tutorial Recap: Analyzing Census Data in R

A big thanks to Gabriela de Quieroz for organizing the San Francisco R-ladies Meetup, where I spent a few hours yesterday introducing people to my census-related R packages. A special thanks to Sharethrough...

Read more »

CONCOR in R

May 22, 2015
By
CONCOR in R

In network analysis, blockmodels provide a simplified representation of a more complex relational structure. The basic idea is to assign each actor to a position and then depict the...

Read more »

sjmisc – package for working with (labelled) data #rstats

May 22, 2015
By
sjmisc – package for working with (labelled) data #rstats

The sjmisc-package My last posting was about reading and writing data between R and other statistical packages like SPSS, Stata or SAS. After that, I decided to bundle all...

Read more »

Revolution R Open 3.2.0 now available for download

May 22, 2015
By

The latest update to Revolution R Open, RRO 3.2.0, is now available for download from MRAN. In addition to new features, this release tracks the version number of the...

Read more »

The R Foundation announces new mailing list ‘R-package-devel’

May 22, 2015
By

At last week’s monthly meeting, the R foundation has decided to create a new mailing list in order to help R package authors in their package development and testing....

Read more »

Exact computation of sums and means

May 21, 2015
By
Exact computation of sums and means

A while ago, I came across a mention of the Python math.fsum function, which sums a set of floating-point values exactly, then rounds to the closest floating point value. This...

Read more »

BH release 1.58.0-1

A new released of BH is now on CRAN. BH provides a large part of the Boost C++ libraries as a set of template headers for use by...

Read more »

Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

May 21, 2015
By
Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

          Mmmmm, chartjunk! Rebooting Vega.jl Recently, I’ve found myself without a project to hack on, and I’ve always been interested in learning more about browser-based...

Read more »

First Day Highlights from the Extremely Large Databases Conference

May 21, 2015
By
First Day Highlights from the Extremely Large Databases Conference

by Joseph Rickert The 8th XLDB (Extremely Large Databases) Conference open at Stanford on Tuesday with an outstanding program. This conference has been providing leadership in the "Big Data"...

Read more »

Introductory Point Pattern Analysis of Open Crime Data in London

May 21, 2015
By
Introductory Point Pattern Analysis of Open Crime Data in London

IntroductionPolice in Britain (http://data.police.uk/) not only register every single crime they encounter, and include coordinates, but also distribute their data free on the web.They have two ways...

Read more »

Course on using Oracle R Enterprise

Course on using Oracle R Enterprise

BNOSAC will be giving from June 08 up to June 12 a 5-day crash course on the use of R using Oracle R Enterprise. The course is given...

Read more »

How To Analyze Data: 21 Graphs that Explain the Same-Sex Marriage Case, Public Opinion, & Supreme Court

May 21, 2015
By
How To Analyze Data: 21 Graphs that Explain the Same-Sex Marriage Case, Public Opinion, & Supreme Court

The nine Justices on the United States Supreme Court recently took up a case about same-sex marriage. The question in Obergefell v. Hodges is whether states are required to...

Read more »

RInside 0.2.13

A new release 0.2.13 of RInside is now on CRAN. RInside provides a set of convenience classes which facilitate embedding of R inside of C++ applications and programs,...

Read more »

Open soure software has changed the way we do business

May 20, 2015
By

Earlier this month TechCrunch published an article of mine, "The Business Economics And Opportunity Of Open-Source Data Science". With this article I wanted to share how open-source software has...

Read more »

Teaching R course? Use analogsea to run your customized RStudio in Digital Ocean!

May 20, 2015
By
Teaching R course? Use analogsea to run your customized RStudio in Digital Ocean!

Two years ago I taught an introductory R/Shiny course here at The Jackson Lab. We all learnt a lot. Unfortunately not about Shiny itself, but rather about incompatibilities between...

Read more »

Put Google Scholar citations on your personal website with R, scholar, ggplot2 and cron

May 20, 2015
By
Put Google Scholar citations on your personal website with R, scholar, ggplot2 and cron

I have been looking for a solution to put my Google Scholar citations on my personal website for quite some time now. Some apps/gadgets seem to have existed to do...

Read more »

Are Canadian newspapers painting false pictures with data?

May 20, 2015
By
Are Canadian newspapers painting false pictures with data?

The Canadian newspaper, Globe and Mail, is a leader in diction and style, but it may need improvement in the ‘grammar of graphics’.Globe’s recent depiction of...

Read more »

First Steps with Structural Equation Modeling

May 20, 2015
By
2015-05-21 23_11_54-Clipboard

Last Friday at the Davis R Users’ Group, Grace Charles gave a presentation on structural equation modeling in R using the Lavaan package. Here’s...

Read more »

Databases – an Ideal Application for R6?

May 20, 2015
By

Over the years I have tried to simplify and streamline my access to financial historic data. All different solutions I tried (see here, for example) so far have been...

Read more »

More postdoctoral opportunities at IARC

May 20, 2015
By
More postdoctoral opportunities at IARC

If you want to want to work with R and JAGS on a hard multivariate measurement error problem in nutritional epidemiology, then take a look at this advert. http://www.iarc.fr/en/vacancies/postdoc/postdoc_NEP_June2015.pdf...

Read more »

Clusters Powerful Enough to Generate Their Own Subspaces

May 20, 2015
By
Clusters Powerful Enough to Generate Their Own Subspaces

Cluster are groupings that have no external label. We start with entities described by a set of measurements but no rule for sorting them by type. Mixture modeling makes...

Read more »

Benchmarking Random Forest Implementations

May 19, 2015
By
Benchmarking Random Forest Implementations

I currently have the need for machine learning tools that can deal with observations of...

Read more »