Basic MCMC and Bayesian statistics in… BASIC!

August 9, 2015
By
Basic MCMC and Bayesian statistics in… BASIC!

The BASIC programming language was at one point the most widely spread programming language. Many home computers in the 80s came with BASIC (like the Commodore 64 and the Apple II), and in the 90s both DOS and Windows 95 included a copy of the QBasic IDE. QBasic was also the first programming language I encountered (I used...

Read more »

New R Package – domaintools (access the DomainTools.com WHOIS API)

August 9, 2015
By

We just did a github release for an R package that provides an interface to the DomainTools API. It provides access to the core API functions that aren’t restricted (i.e. the ones we have access to): domaintools_api_key: Get or set DOMAINTOOLS_API_KEY value domaintools_username: Get or set DOMAINTOOLS_API_USERNAME value domain_profile: Domain Profile hosting_history: Hosting History parsed_whois: Parsed Whois reverse_ip: Reverse IP reverse_ns: Reverse Nameserver shared_ips: Shared IPs whois: Whois Lookup whois_history: Whois History Each function has a full...

Read more »

cdcfluview – On The Way to “CRAN 7K”

August 9, 2015
By

I like to turn coincidence into convergence whenever possible. This weekend, a user of cdcfluview had a question that caused me to notice a difference in behaviour between the package was interacting with CDC FluView API, so I updated the package to accommodate the change and the user. Around the same time, @recology_ tweeted: we're

Read more »

Two Presentations about Joint Models

August 9, 2015
By

Packages JM and JMbayes @ JSM2015This year JSM features an interesting invited session about fitting joint models in different software packages -- if you're interested drop by... Here are my slides in which I give a short intro in packages JM and JMba...

Read more »

R packages for undergraduate stat ed

August 9, 2015
By

The other day on the isostat mailing list Doug Andrews asked the following question: Which R packages do you consider the most helpful and essential for undergrad stat ed? I ask in great part because it would help my local … Continue reading →

Read more »

Predicting Titanic deaths on Kaggle III: Bagging

August 9, 2015
By
Predicting Titanic deaths on Kaggle III: Bagging

This is the third post on prediction the deaths. The first one used randomforest, the second boosting (gbm). The aim of the third post was to use bagging. In contrast to the former posts I abandoned dplyr in this post. It gave some now you see now you ...

Read more »

Slides from my JSM 2015 talk on dendextend

August 8, 2015
By
heatmap_dendextend_Picture1

If you happen to be at the JSM 2015 conference this week, then this Monday, at 2pm, I will give a talk on the dendextend R package  (in the session “Advances in Graphical Frameworks and Methods Part 1“) – feel free to drop by and say hi. Here are my slides for the intended talk:   … Continue reading...

Read more »

Hey Fitbit, my data belong to me!

August 8, 2015
By
Hey Fitbit, my data belong to me!

When you go on Fitbit’s website and want to download your own Fitbit data, you will find your way to their Data Export utility on their settings page. Once you get there, you are greeted with what looks like a … Continue reading →

Read more »

delayed in Seattle

August 8, 2015
By
delayed in Seattle

Here are the slides of my talk on delayed acceptance I present this afternoon at JSM 2015, in Seattle, in the Bayesian Computation I (2pm, room CC-4C1) and II (4pm, room CC-3A) sessions Omiros Papaspiliopoulos and myself put together (sponsored by IMS and ISBA):Filed under: Books, R, Statistics, Travel, University life Tagged: American Statistical Association,

Read more »

drat 0.1.0: Even more repository support

August 8, 2015
By
drat 0.1.0: Even more repository support

A new version 0.1.0 of the drat package arrived on CRAN today. Its name stands for drat R Archive Template, and it helps with easy-to-create and easy-to-use repositories for R packages, and is finding increasing by other projects. This version 0.1.0...

Read more »

Soap analytics: Text mining “Goede tijden slechte tijden” plot summaries….

August 8, 2015
By
Soap analytics: Text mining “Goede tijden slechte tijden” plot summaries….

Sorry for the local nature of this blog post. I was watching Dutch television and zapping between channels the other day and I stumbled upon “Goede Tijden Slechte Tijden” (GTST). This is a Dutch soap series broadcast by RTL Nederland. … Continue reading →

Read more »

A Decade of Premier League Football

August 7, 2015
By
A Decade of Premier League Football

The English Premier League is BACK!Before we get over-excited, let's have a look at the past. Here is a shiny app to visualise some stats from last ten seasons. It is my first attempt at developing data-driven sports stories with R, Shiny and Shiny Das...

Read more »

Argentine general election, 2015

August 7, 2015
By
Argentine general election, 2015

The 2015 Argentine's presidential election to be held next October 25th is approaching and the dispute begun to appear more clearly since the major parties announced their potential candidates last June. This Sunday, the political parties are holding their primaries for the upcoming presidential election. As in US, in Argentine the primaries are important for

Read more »

New ‘R Talk’ podcast with news from R Consortium

August 7, 2015
By

There's a new podcast in town, and it's focused on R. Four members of the R community (Oliver Keyes, Wikimedia Foundation; Jasmine Dumas. DePaul University; Ted Hart, Silcon Valley "fruit vendor"; and Mikhail Popov, Wikimedia Foundation) have banded together to produce R Talk, a podcast about "the R Programming language, featuring news, interviews and dives into how R is...

Read more »

The New and Improved R Shodan Package

August 7, 2015
By

For those not involved with all things “cyber”, let me start with a description of what Shodan is (though visiting the site is probably the best introduction to what secrets it holds). Shodan is—at it’s core—a search engine. Unlike Google, Shodan indexes what I’ll call “cyber” metadata and content about everything accessible via a public IP address. This means things like routers,...

Read more »

Adding a CRAN Search Engine to Chrome

August 7, 2015
By

Riffing off of the previous post, here’s a way to quickly search CRAN (the @RStudio flavor) from the Chrome search bar. Paste chrome://settings/searchEngines into your location bar and hit return/enter Scroll down until the input boxes show, enabling you to add a search engine For “Add a new search engine” put “CRAN“ For “Keyword” put

Read more »

Call for Papers at the Budapest BI and Open Analytics Forum

August 7, 2015
By
Call for Papers at the Budapest BI and Open Analytics Forum

Last year, I was extremely happy to see the great success of the R community -- dominating the Budapest BI Forum 2014 with 8 talks and 2 workshops on introductory and some hard-core R topics:"My Journey to R" plenary talk and "Fast and flexible data wr...

Read more »

parsing BED coordinates

August 6, 2015
By
parsing BED coordinates

In supplemental file of ChIPseeker paper. I compare distances to TSS reported by several ChIP annotation software, including ChIPseeker, ChIPpeakAnno, HOMER and PeakAnalyzer. Although I found that the chromStart positions in HOMER output have a +1 shift compare to other software, I did not realize this issue since all other software are consistent. Read More: 1102 Words...

Read more »

Matrix Factorization Comes in Many Flavors: Components, Clusters, Building Blocks and Ideals

August 6, 2015
By
Matrix Factorization Comes in Many Flavors: Components, Clusters, Building Blocks and Ideals

Unsupervised learning is covered in Chapter 14 of The Elements of Statistical Learning. Here we learn about several data reduction techniques including principal component analysis (PCA), K-means clustering, nonnegative matrix factorization (NMF) ...

Read more »

Authenticated API Testing Using Travis CI

August 6, 2015
By
Authenticated API Testing Using Travis CI

As I’ve become more serious about contributing in the open-source community, having quality tests for my packages has been something I’ve spent much more time on than when I was just writing quick-and-dirty code for my own purposes. My most used open-sourced package is RSiteCatalyst, which accesses the Adobe Analytics (authenticated) API, which poses a problem: how do

Read more »

Just another way to make a R flavored blog

August 6, 2015
By
Just another way to make a R flavored blog

This post is in active development! image source This usually is "echo=FALSE" rm(list = ls()) library("printr") knitr::opts_knit$set(root.dir = normalizePath("..")) Considerations Use h2 ## in R files to spin, because h1 is reserved for title post. At the begin of R script write a chunk of R code loading the print package This post have some considerations to have when write a post via: r...

Read more »

Compiling RMarkdown from a Helper R Script

August 6, 2015
By

The problemI was looking for a way to compile an RMarkdown document and have the filename of the resulting PDF or HTML document contain the name of the input data that it processed. That is, if I compiled the analysis.Rmd file, where in that file it did some analysis and reporting on data001.txt, I’d want the resulting filename to...

Read more »

A simple statnet model of CRAN

August 6, 2015
By
A simple statnet model of CRAN

by Joseph Rickert In a recent post on creating JavaScript network graphs directly from R, my colleague and fellow blogger, Andrie de Vries, included a link to a saved graph of CRAN. Here, I will use that same graph (network) to build a simple exponential random graph model using functions from the igraph package, and the network and ergm...

Read more »

Tutorial Recap: Make a Census Explorer with Shiny

August 6, 2015
By
Tutorial Recap: Make a Census Explorer with Shiny

On July 28 I had the pleasure of leading a tutorial titled Make a Census Explorer with Shiny! at the San Francisco R-ladies Meetup. A big thank you to Gabriela de Queiroz for organizing the event, Sharethrough for hosting it, and all the participants for attending. If you are interested in seeing the slides, you The post

Read more »

Gathering German newspaper data with the rzeit package

Gathering German newspaper data with the rzeit package

The following is a guest post by Jana Blahak and Jan Dix (University of Konstanz), with support from Simon Munzert. We are happy to introduce our freshly created rzeit package. It connects to the Content API at ZEIT Online, a German newspaper website. In short, the package allows you to conduct an unfiltered search...

Read more »

Speeding Up Your Quest(s) For “R Stuff”

August 5, 2015
By
Speeding Up Your Quest(s) For “R Stuff”

I use Google quite a bit when conjuring up R projects, whether it be in a lazy pursuit of a PDF vignette or to find a package or function to fit a niche need. Inevitably, I’ll do something like this (yeah, I’m still on a mapping kick) and the first (and best) results will come

Read more »

partools: a Sensible R Package for Large Data Sets

August 5, 2015
By
partools: a Sensible R Package for Large Data Sets

As I mentioned recently, the new, greatly extended version of my partools package is now on CRAN. (The current version on CRAN is 1.1.3, whereas at the time of my previous announcement it was only 1.1.1. Note that Unix is NOT required.) It is my contention that for most R users who work with large … Continue reading...

Read more »

Contracting and simplifying a network graph

August 5, 2015
By
Contracting and simplifying a network graph

by Andrie de Vries In a previous post, I used page rank and community structure to create a plot of CRAN. This plot used vibrant colours to allow us to see some of the underlying structure of CRAN. However, much of this structure was still obfuscated by the amount of detail. Concretely, a large number of dots (packages) made...

Read more »

Moment conditions and Bayesian nonparametrics

August 5, 2015
By
Moment conditions and Bayesian nonparametrics

Luke Bornn, Neil Shephard, and Reza Solgi (all from Harvard) have arXived a pretty interesting paper on simulating targets on a zero measure set. Although it is not initially presented this way, but rather in non-parametric terms as moment conditions where θ is the parameter of the sampling distribution, constrained by the value of β. (Which

Read more »