Monthly Archives: August 2013

Reverse IP Address Lookups With R (From Simple To Bulk/Asynchronous)

August 12, 2013
By

R lacks some of the more “utilitarian” features found in other scripting languages that were/are more geared—at least initially—towards systems administration. One of the most frustrating missing pieces for security data scientists is the lack of ability to perform basic IP address manipulations, including reverse DNS resolution (even though it has nsl() which is just

Read more »

A Stata HTML syntax highlighter in R

August 12, 2013
By

So I have been having difficulty getting my Stata code to look the way I want it to look when I post it to my blog.  To alleviate this condition I have written a html encoder in R.  I don't know much about html so it is likely to be a little ...

Read more »

A beginner’s video introduction to R, from Google

August 12, 2013
By

If you're an absolute beginner to the R language, this Intro to R video series from Google Developers is a great place to get started. Just download R for your system, start the playlist below, and follow along with the on-screen examples. (The video uses the MacOS X version of R, but you should be able to follow along...

Read more »

Short tales of two NCAA basketball conferences (Big 12 and West Coast) using graphs

August 12, 2013
By
Short tales of two NCAA basketball conferences (Big 12 and West Coast) using graphs

Having been at the University of Kansas (Kansas Jayhawks) as a student and now working at Gonzaga University (Gonzaga Bulldogs), discussions about college basketball are inescapable. This post uses R, ggmap, ggplot2 and the shiny server to graphically ...

Read more »

Variable importance in neural networks

August 12, 2013
By
Variable importance in neural networks

If you’re a regular reader of my blog you’ll know that I’ve spent some time dabbling with neural networks. As I explained here, I’ve used neural networks in my own research to develop inference into causation. Neural networks fall under two general categories that describe their intended use. Supervised neural networks (e.g., multilayer feed-forward networks)

Read more »

Exploratory Data Analysis: The 5-Number Summary – Two Different Methods in R

Exploratory Data Analysis: The 5-Number Summary – Two Different Methods in R

Introduction Continuing my recent series on exploratory data analysis (EDA), today’s post focuses on 5-number summaries, which were previously mentioned in the post on descriptive statistics in this series.  I will define and calculate the 5-number summary in 2 different ways that are commonly used in R.  (It turns out that different methods arise from

Read more »

Identifying Potential Customers with Classification Techniques in R Language

August 12, 2013
By
Identifying Potential Customers with Classification Techniques in R Language

Data mining techniques and algorithms such as Decision Tree, Naïve Bayes, Support Vector Machine, Random Forest, and Logistic Regression are “most commonly used for predicting a specific outcome such as response / no-response, high / medium / low-value customer, likely to buy / not buy.”1 In this article, we will demonstrate how to use R

Read more »

Time Series Decomposition

August 12, 2013
By
Time Series Decomposition

In the last post on the changepoint package, I concluded with a brief example of time series decomposition with the "decompose" command.  After further reading, I discovered the "stl" command, which to me appears a superior method.  STL stand...

Read more »

analyze the national plan and provider enumeration system (nppes) with r and monetdb

August 12, 2013
By

the national plan and provider enumeration system (nppes) contains information about every provider, insurance plan, and clearinghouse actively operating in the united states healthcare industry.  did i just see the ears of all the health workforce researchers in the room perk up?  it's freely downloadable, courtesy of the department of health and human services' implementation of the...

Read more »

Some belated spring cleaning

August 11, 2013
By
Some belated spring cleaning

A very busy spring has transitioned into a very busy summer, so let me recap a few topics that probably deserve more time than I’ll give them here. Here are the things I’m overdue on, in no particular order: Publications In the March edition of the Journal of Risk, Kris Boudt, Brian Peterson and I

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)