Blog Archives

Minding the zoo[keeper] with R

December 16, 2016
By

I’ve been drafting a new R package — sergeant — to work with Apache Drill and have found it much easier to manage having Drill operating in a single node cluster vs drill-embedded mode (esp when I need to add a couple more nodes for additional capacity). That means running Apache Zookeeper and I’ve had... Continue reading...

Read more »

Interacting With Amazon Athena from R

December 5, 2016
By

This is a short post for those looking to test out Amazon Athena with R. Amazon makes Athena available via JDBC, so you can use RJDBC to query data. All you need is their JAR file and some setup information. Here’s how to get the JAR file to the current working directory: To avoid putting... Continue reading...

Read more »

Survey on Data Science In Two Year Colleges

December 3, 2016
By

The ASA (American Statistical Association) has been working in collaboration with the ACM (Association for Computing Machinery) on developing a data science curriculum for Two Year Colleges. Part of this development is the need to understand the private-sector demand for two-year college data science graduates and the prevalence of the need to invest in the... Continue reading...

Read more »

Counting [U.S.] Expatriation with R (a.k.a. a Decade of Desertion)

August 8, 2016
By
Counting [U.S.] Expatriation with R (a.k.a. a Decade of Desertion)

If you’re even remotely following the super insane U.S. 2016 POTUS circus election you’ve no doubt seen a resurgence of “if X gets elected, I’m moving to Y” claims by folks who are “anti” one candidate or another. The Washington Examiner did a story on last quarter’s U.S. expatriation numbers. I didn’t realize we had... Continue reading...

Read more »

QuickLookR – A macOS QuickLook plugin for R Data files

August 6, 2016
By
QuickLookR – A macOS QuickLook plugin for R Data files

I had tried to convert my data-saving workflows to feather but there have been issues with it supporting large files (that seem to be near resolution), so I’ve been continuing to use R Data files for local saving of processed/cleaned data. I make many of these files and sometimes I do it as a one-off... Continue reading...

Read more »

U.S. Drought Animations with the “Witch’s Brew” (purrr + broom + magick)

July 27, 2016
By
U.S. Drought Animations with the “Witch’s Brew” (purrr + broom + magick)

This is another purrr-focused post but it’s also an homage to the nascent magick package (R interface to ImageMagick) by @opencpu. We’re starting to see/feel the impact of the increasing drought up here in southern Maine. I’ve used the data from the U.S. Drought Monitor before on the blog, but they also provide shapefiles and... Continue reading...

Read more »

Use quick formula functions in purrr::map (+ base vs tidtyverse idiom comparisons/examples)

July 26, 2016
By

I’ve converted the vast majority of my *apply usage over to purrr functions. In an attempt to make this a quick post, I’ll refrain from going into all the benefits of the purrr package. Instead, I’ll show just one thing that’s super helpful: formula functions. After seeing this Quartz article using a visualization to compare... Continue reading...

Read more »

Mid-year R Packages Update Summary

July 24, 2016
By

I been updating some existing packages and github-releasing new ones (before a CRAN push). Most are “cyber”-related, but there are some general purpose ones. Here’s a quick overview: docxtractr (CRAN, now, v0.2.0) was initially designed to make it easy to get data tables out of MS Word (docx) documents. The update removes use of a... Continue reading...

Read more »

Slaying CIDR Orcs with Triebeard (a.k.a. fast trie-based ‘IPv4-in-CIDR’ lookups in R)

July 12, 2016
By

The insanely productive elf-lord, @quominus put together a small package (triebeard) that exposes an API for radix/prefix tries at both the R and Rcpp levels. I know he had some personal needs for this and we both kinda need these to augment some functions in our iptools package. Despite triebeard having both a vignette and... Continue reading →

Read more »

CRAN Packages on GitHub (and some CRAN DESCRIPTION observations)

July 10, 2016
By

Just about a week ago @thosjleeper posited something on twitter w/r/t how many CRAN packages had associations with GitHub (i.e. how many used GitHub for development). The DESCRIPTION file (that comes with all R packages) has some fields that can house this information and most folks who do use GitHub for development of R seem... Continue reading →

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)