2280 search results for "map"

Two ways that correlation and stepwise regression can give different results

October 8, 2012
By

In general, a correlation test is used to test the association between two variables (y and z). However, if there is a third variable (x) that might be related to z or y, it makes...

Read more »

DIY ZeroAccess GeoIP Analysis : So What?

October 8, 2012
By
DIY ZeroAccess GeoIP Analysis : So What?

NOTE: A great deal of this post comes from @jayjacobs as he took a conversation we were having about thoughts on ways to look at the data and just ran like the Flash with it. Did you know that – if you’re a US citizen – you have approximately a 1 in 5 chance of getting the

Read more »

GBIF biodiversity data from R – more functions

October 8, 2012
By
GBIF biodiversity data from R – more functions

We have been working on an R package to get GBIF data from R, with the stable version available through CRAN here, and the development version available on GitHub here. We had a Google Summer of code stuent work on the package this summer - you can se...

Read more »

EDA Before CDA

October 6, 2012
By
EDA Before CDA

One Paragraph Summary Always explore your data visually. Whatever specific hypothesis you have when you go out to collect data is likely to be worse than any of the hypotheses you’ll form after looking at just a few simple visualizations of that data. The most effective hypothesis testing framework in existence is the test of

Read more »

A quick introduction to ggplot()

October 5, 2012
By
A quick introduction to ggplot()

I gave a short talk today to the about ggplot. This what I presented. Additional resources at the bottom of this post ggplot is an R package for data exploration and producing plots. It produces fantastic-looking graphics and allows one to slice and dice one’s data in many different ways. Comparing with base...

Read more »

DIY ZeroAccess GeoIP Plots

October 5, 2012
By
DIY ZeroAccess GeoIP Plots

Since F-Secure was #spiffy enough to provide us with GeoIP data for mapping the scope of the ZeroAccess botnet, I thought that some aspiring infosec data scientists might want to see how to use something besides Google Maps & Google Earth to view the data. If you look at the CSV file, it’s formatted as

Read more »

Improving the integration between R and Hadoop: rmr 2.0 released

October 4, 2012
By

The RHadoop project, the open-source project supported by Revolution Analytics to integrate R and Hadoop, continues to evolve. Now available is version 2 of the rmr package, which makes it possible for R programmers to write map-reduce tasks in the R language, and have them run within the Hadoop cluster. This update is the "simplest and fastest rmr yet",...

Read more »

Presidential Debates with qdap-beta

October 4, 2012
By
Presidential Debates with qdap-beta

qdap brief intro For the past year I’ve been working on a package (qdap) to assist my field in quantitative discourse analysis; basically looking at patterns in language. It’s still a ways from being finished and lacks documentation (roxygen2 is … Continue reading →

Read more »

Garmin data visualization

October 4, 2012
By
Garmin data visualization

People go on rage, when governments initiate surveillance projects like CleanIT, nevertheless share very private data without a doubt. I have to admit, that some data leaks are well buried in the process. Take for example Garmin which produces GPS training devices for runners. In order to see your workouts you are forced to upload

Read more »

City Size and SUHI

October 2, 2012
By
City Size and SUHI

In the course of putting together data for my kriging project with the CRN stations, I got another idea related to a small but potentially important corner of the concerns over UHI in the global temperature index. For clarity I suppose I should make it clear that my position is that the UHI bias is

Read more »