Articles by hrbrmstr

Diving Into Dynamic Website Content with splashr

February 9, 2017 | hrbrmstr

If you do enough web scraping, you’ll eventually hit a wall that the trusty httr verbs (that sit beneath rvest) cannot really overcome: dynamically created content (via javascript) on a site. If the site was nice enough to use XHR requests to load the dynamic content, you can generally ...
[Read more...]

Candy Coated Confidence Intervals

February 3, 2017 | hrbrmstr

@mrshrbrmstr hinted that she would like this post by @RickWicklin translated into R for her stats class. She’s quite capable of cranking out the translation of the core component of that post — a call to chisq.test — but she wanted to show the entire post (in R) and really ... [Read more...]

Exploring News Coverage With newsflash

February 1, 2017 | hrbrmstr

I was enthused to see a mention of this on the GDELT blog since I’ve been working on an R package dubbed newsflash to work with the API that the form front-ends. Given the current climate, I feel compelled to note that I’m neither a Clinton supporter/defender/...
[Read more...]

Workout Wednesday Redux (2017 Week 3)

January 18, 2017 | hrbrmstr

I had started a “52 Vis” initiative back in 2016 to encourage folks to get practice making visualizations since that’s the only way to get better at virtually anything. Life got crazy, 52 Vis fell to the wayside and now there are more visible alternatives such as Makeover Monday and Workout Wednesday. ...
[Read more...]

Knit directly to jupyter notebooks from RStudio

January 10, 2017 | hrbrmstr

Did you know that you can completely replace the “knitting” engine in R Markdown documents? Well, you can! Why would you want to do this? Well, in the case of this post, to commit the unpardonable sin of creating a clunky jupyter notebook from a pristine Rmd file. I’m ... [Read more...]

2017-01 Authored Package Updates

January 8, 2017 | hrbrmstr

The rest of the month is going to be super-hectic and it’s unlikely I’ll be able to do any more to help the push to CRAN 10K, so here’s a breakdown of CRAN and GitHub new packages & package updates that I felt were worth raising awareness on: ...
[Read more...]

The Most Important Commodity in 2017 is Data

January 5, 2017 | hrbrmstr

Despite being in cybersecurity nigh forever (a career that quickly turns one into a determined skeptic if you’re doing your job correctly) I have often trusted various (not to be named) news sources, reports and data sources to provide honest and as-unbiased-as-possible information. The debacle in the U.S. ...
[Read more...]

Pipes (%>%) Everywhere

December 22, 2016 | hrbrmstr

An R user asked a question regarding whether it’s possible to have the RStudio pipe (%__%) shortcut (Cmd-Shift-M) available in other macOS applications. If you’re using Alfred then you can use this workflow for said task (IIRC this requires an Alfred license which is reasonably cheap). When you add ... [Read more...]

sergeant : An R Boot Camp for Apache Drill

December 20, 2016 | hrbrmstr

I recently mentioned that I’ve been working on a development version of an Apache Drill R package called sergeant. Here’s a lifted “TLDR” on Drill: Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud ... [Read more...]

Package update: longurl 0.3.0 is hitting CRAN mirrors

December 18, 2016 | hrbrmstr

The longurl package has been updated to version 0.3.0 as a result of a bug report noting that the URL expansion API it was using went pay-for-use. Since this was the second time a short URL expansion service either went belly-up or had breaking changes the package is now completely client-side-based ...
[Read more...]

Minding the zoo[keeper] with R

December 16, 2016 | hrbrmstr

I’ve been drafting a new R package — sergeant — to work with Apache Drill and have found it much easier to manage having Drill operating in a single node cluster vs drill-embedded mode (esp when I need to add a couple more nodes for additional capacity). That means running Apache ... [Read more...]

Interacting With Amazon Athena from R

December 5, 2016 | hrbrmstr

This is a short post for those looking to test out Amazon Athena with R. Amazon makes Athena available via JDBC, so you can use RJDBC to query data. All you need is their JAR file and some setup information. Here’s how to get the JAR file to the ... [Read more...]

Survey on Data Science In Two Year Colleges

December 3, 2016 | hrbrmstr

The ASA (American Statistical Association) has been working in collaboration with the ACM (Association for Computing Machinery) on developing a data science curriculum for Two Year Colleges. Part of this development is the need to understand the private-sector demand for two-year college data science graduates and the prevalence of the ... [Read more...]
1 11 12 13 14 15 21

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)