650 search results for "SQL"

24 Days of R: Day 10

December 10, 2013
By
24 Days of R: Day 10

How often is someone nominated for an academy award? Who has been nominated most often? Is there a difference between leading and supporting roles? Important questions. To answer them, I'm making use of a list of academy award nominees and winners. I've obtained the data from aggdata.com which has a few sets of free data.

Read more »

Creating custom CDF for Affy chips in R / Bioconductor

Creating custom CDF for Affy chips in R / Bioconductor What? For those who don't know, CDF files are chip definition format files that define which probes belong to which probesets, and are necessary to use any of the standard summarization methods such as RMA, and others. Why? Because we can, and because custom definitions have been shown to be quite useful. See...

Read more »

Build your own Twitter Archive and Analyzing Infrastructure with MongoDB, Java and R [Part 2] [Update]

December 9, 2013
By
Build your own Twitter Archive and Analyzing Infrastructure with MongoDB, Java and R [Part 2] [Update]

Hello everybody, in my first tutorial I described how you can build your own MongoDB and use a JAVA program to mine Twitter either via the search function and a loop or via the Streaming API. But till now you just have your tweets stores in a Database and we couldn´t get any insight in our …

Read more »

analyze the program for international student assessment (pisa) with r and monetdb

December 9, 2013
By

the authoritative source for evaluating educational achievement across nations, the program(me) for international student assessment ranks the math, science, and reading skills of 15-year-olds in more than sixty countries.  coordinated by the orga...

Read more »

R – Analyze any data frame in Saiku

December 4, 2013
By
R – Analyze any data frame in Saiku

In my previous article I have shown how R can be used to analyze PostgreSQL tables in Saiku using dynamically generated OLAP cubes. Today I will show you how you can analyze any R data frame in Saiku. WIth Saiku you can easily create excel-like pivot t...

Read more »

Using R to replicate common SPSS multiple regression output

December 4, 2013
By
Using R to replicate common SPSS multiple regression output

(This article was first published on Jeromy Anglim's Blog: Psychology and Statistics, and kindly contributed to R-bloggers) The following post replicates some of the standard output you might get from a multiple regression analysis in SPSS. A copy of the code in RMarkdown format is available on github. The post was motivated by this previous post that discussed using...

Read more »

14 Analytics Predictions for 2014

December 3, 2013
By

In a live webinar today hosted by Alteryx, five industry experts shared 14 analytics predictions for 2014. The panel included Paul Ross (Alteryx), Charles Zedlewski (Cloudera), Rick Schultz (Alteryx), Ellie Fields (Tableau) and Michele Chambers (Revolution Analytics). Their predictions were: Analysts will matter more than data scientists R will replace legacy SAS solutions and go mainstream Big Data will...

Read more »

Analyzing baseball data with R

November 27, 2013
By
Analyzing baseball data with R

This week, the post is an interview with Max Marchi. Max is the author, with Jim Albert, of the book "Analyzing baseball data with R". Hi, Max. Welcome back to MilanoR. Last time you wrote for us a series of … Continue reading →

Read more »

Art of Statistical Inference

November 20, 2013
By
Art of Statistical Inference

(This article was first published on MATHEMATICS IN MEDICINE, and kindly contributed to R-bloggers) Art of Statistical Inference Art of Statistical Inference This post was written by me a few years ago, when I started learning the art and science of data analysis. It will be a good starter for the amateur data analysts. Introduction What is statistics? There...

Read more »

The homogenization of scientific computing, or why Python is steadily eating other languages’ lunch

November 18, 2013
By

Over the past two years, my scientific computing toolbox been steadily homogenizing. Around 2010 or 2011, my toolbox looked something like this: Ruby for text processing and miscellaneous scripting; Ruby on Rails/JavaScript for web development; Python/Numpy (mostly) and MATLAB (occasionally) for numerical computing; MATLAB for neuroimaging data analysis; R for statistical analysis; R for plotting

Read more »