566 search results for "SQL"

open-source campaign finance analysis with R and MySQL

June 18, 2009
By
open-source campaign finance analysis with R and MySQL

Introduction In Part 1 of this tutorial we introduced the fechell library by extracting all itemized contributions from individuals made to the Obama For America campaign in 2007 and 2008. In Part 2 of the tutorial we will summarize that data set by importing it into a MySQL database and aggregating contributions by week and

Read more »

Choosing an SQL Engine for Analytics

March 9, 2009
By
Choosing an SQL Engine for Analytics

I’ve been struggling for a while on which database to use for my working data. I used to use MS Access quite a lot. The problems with MS Access include but are not limited to: 2 GB file size limit, at least historically Versions change with each edition of MS Office Sort of tough to write SQL scripts Very

Read more »

R tops KDNuggets data analysis software poll for 4th consecutive year

August 29, 2014
By
R tops KDNuggets data analysis software poll for 4th consecutive year

KDNuggests asked its readers the question "What programming/statistics languages you used for an analytics / data mining / data science work in 2014?" and one again, R was the #1 response. (R was also the #1 response in similar polls in 2013, 2012 and 2011.) The top 5 selections of the 719 respondents were: R (352 respondents) SAS (262)...

Read more »

Critical Data (free) Hackathon in London (Sept 6-7, 2014)

August 11, 2014
By

We would like to invite you to participate in our forthcoming Critical Data Marathon a FREE event taking place in London on 6-7 September 2014. The event is collaboratively organised between researchers at Massachusetts Institute of Technology (MIT), University College London, King’s College London, and Imperial College London.The data marathon will bring together frontline healthcare providers (nurses, pharmacists, doctors) with data scientists to...

Read more »

Social Media Mining and Bioinformatics (with R)

August 5, 2014
By
Social Media Mining and Bioinformatics (with R)

In June and July, I receive copies of two books, Social Media Mining with R, by Nathan Danneman and Richard Heimann Bioinformatics with R Cookbook, by Paurush Praveen Sinha For the first one, two recent interesting books deal with the same topic. Reza Zafarani, Mohammad Ali Abbasi and Huan Liu published last year Social Media Mining: An Introduction. Actually, the book can...

Read more »

jpmml and R (Free Webinar)

July 28, 2014
By
jpmml and R (Free Webinar)

This free, global webinar will provide an introduction to jpmml, the world’s leading open-source PMML scoring engine currently being utilized by companies such as Airbnb to rapidly deploy predictive models into production. Webinar Format: – What is PMML? – Building … Continue reading →

Read more »

Competitive balance and home court advantage in the NBA

July 6, 2014
By
Competitive balance and home court advantage in the NBA

Two years ago, the entire NBA season went into lockout because of mostly financial reasons. However, one central point was also about keeping a competitive balance within the NBA, so that large and small-market teams alike would have a chance to compete for a championship. THis brings us to the obvious question “Is there competitive

Read more »

How To: 20 Minute Guide to Get Started with PivotalR

July 1, 2014
By
How To: 20 Minute Guide to Get Started with PivotalR

In this article, Pivotal engineer and predictive analytics expert Hai Qian explains how someone new to R can get started performing statistical analysis on data stores in Greenplum Database, Pivotal HD and PostgreSQL in just 20 minutes using PivotalR. First, there is some background on R’s popularity, then the articles dives into important topics such as installation, data loading,...

Read more »

Maybe I Don’t Really Know R After All

June 26, 2014
By
Maybe I Don’t Really Know R After All

Lately, I’ve been feeling that I’m spreading myself too thin in terms of programming languages. At work, I spend most of my time in Hive/SQL, with the occasional Python for my smaller data. I really prefer Julia, but I’m alone at work on that one. And since I maintain a package on CRAN (RSiteCatalyst), I frequently spend Related posts:

Read more »

Tailoring univariate probability distributions

June 26, 2014
By
Tailoring univariate probability distributions

This post shows how to build a custom univariate distribution in R from scratch, so that you end up with the essential functions: a probability density function, cumulative distribution function, quantile function and random number generator. In the beginning all you need is an equation of the probability density function, … Continue reading →

Read more »