569 search results for "SQL"

Analyzing competitive nordic skiing with R

June 22, 2010
By
Analyzing competitive nordic skiing with R

Here's another great example of R being used to analyze sports data. Statistician and skier Joran Elias has started a project to analyze and visualize international cross country ski racing results, and he publishes his analysis at the blog Statistical Skier. All of the analyses are done using R (and for data, SQLite via the RSQLite package). As much...

Read more »

PostGIS in Action Book Review

June 8, 2010
By
PostGIS in Action Book Review

I was recently asked to review a soon to be published book on PostGIS, a spatial extension to the very popular Postgresql relational database. I was very excited about receiving an early copy of this book, as the authors have provided countless tips, ...

Read more »

Data preparation for Social Network Analysis using R and Gephi

June 2, 2010
By
Data preparation for Social Network Analysis using R and Gephi

I want to share my experience in generating the data for social network analysis using R and analyzing it using Gephi... WHICH DATA STRUCTURE TO USE FOR LARGE GRAPHS?I quickly realized that using edge lists and adjacency matrix gets difficult as the g...

Read more »

MLB Baseball Pitching Matchups ~ grabbing pitcher and/or batter codes by specify game date using R XML

June 1, 2010
By
MLB Baseball Pitching Matchups ~ grabbing pitcher and/or batter codes by specify game date using R XML

MLB Gameday stores its game data in XML format, with the players denoted in ID numbers. To find out who is who, the codes are stored in pitchers.xml or batters.xml of each game. My DownloadPitchFX.R script can download the ID numbers, but it doesn’t look to see who the ID is because of the extra

Read more »

Source Code Files in R

May 29, 2010
By
Source Code Files in R

R's interactive programming style is similar to what I have seen in other environments (e.g. ruby's irb and Oracle's SQL*Plus, etc). There are a few commands that you need to be aware of to get up and running with developing R programs.To identify yo...

Read more »

Voter targeting with R

May 26, 2010
By
Voter targeting with R

Voter targeting for turnout is the process of scoring registered voters using demographic and electoral variables taken from voter lists and commercial databases. The score of all voters together is used to predict overall turnout, which determines the allocation of campaign resources and directs strategy for voter contact and communication. Targeting for turnout is a

Read more »

Testing Out my Pitch F/X Data

May 25, 2010
By
Testing Out my Pitch F/X Data

I recently got all the Pitch F/X data downloaded from Gameday, and have been fiddling around. I certainly don't have the physics knowledge to really talk about the movement at this point, and I'm still acquainting myself with the data format and what e...

Read more »

Prediction in the cloud: turbulent

May 19, 2010
By

While Microsoft rolled out its Technical Computing Initiative -- promising new tools for distributed parallel computing on large data sets in the cloud -- with much fanfare earlier this week, Google made a rather more understated response. In a post to the developer-focused Google Code Blog, they quietly announced two new, but potentially disruptive, products. Google BigQuery promises super-fast...

Read more »

MLB Baseball Pitching Matchups ~ downloading pitch f/x data using the XML package in R [updatedx6]

May 18, 2010
By
MLB Baseball Pitching Matchups ~ downloading pitch f/x data using the XML package in R [updatedx6]

Update x6 (Jul 27): so I guess people want pitch counts. The data @ MLB seems to only give the pitch count of the end result and the strikes/balls/outs of the particular pitch. Of course you can combine them to get the pitch count. Stupid WordPress comments strip out necessary HTML to properly display code,

Read more »

Reflections on consulting part 5 – what languages and tools to learn?

May 12, 2010
By
Reflections on consulting part 5 – what languages and tools to learn?

What languages and tools should you learn as a math/stat consultant?  To jump to the answer: Excel/VBA, SQL, R, Java, and Python. Spreadsheets have many problems with verifiability and scalability, so why Excel? Excel is: Useful for prototyping ideas quickly, either for your own use or to show to other team members Well-known and understood

Read more »