533 search results for "SQL"

Querying a database from within R

August 18, 2012
By
Querying a database from within R

For a while now I have been contemplating pulling data from our postgreSQL db directly from R, but just never actually pulled the trigger until today.  What I found was that it was a lot easier than I ever could have imagined.  My laptop was already on the VPN, so I decided to try it

Read more »

Experience with Oracle R Enterprise in the Oracle micro-processor tools environment

August 17, 2012
By
Experience with Oracle R Enterprise in the Oracle micro-processor tools environment

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Quick SAP HANA and R usecase

Quick SAP HANA and R usecase

DISCLAIMER: I'm not an SAP HANA expert or an R expert, not even a Python expert. I'm just a guy with a lot of ideas who loves to write blogs.The other day I was thinking about making some nice with SAP HANA and R, because people doesn't seem to be enou...

Read more »

Minimum Expected Shortfall Portfolio, Part 1

August 8, 2012
By

A few days ago, I wrote a piece on finding the minimum expected shortfall portfolio.  A few astute commenters quickly picked up where I was going with this -- using this as an alternative to low/minimum volatility portfolios.  What follo...

Read more »

If you are into large data and work a lot with package ff

August 8, 2012
By
If you are into large data and work a lot with package ff

The ff package is a great and efficient way of working with large datasets. One of the main reasons why I prefer to use it above other packages that allow working with large datasets is that it is a complete set of tools.When comparing it to the other open source 'bigdata' packages in RIt is not...

Read more »

Data Parallelism Using Oracle R Enterprise

August 2, 2012
By

Modern computer processors are adequately optimized for many statistical calculations, but large data operations may require hours or days to return a result.  Oracle R Enterprise (ORE), a set of R packages designed to process large data computations in Oracle Database, can run many R operations in parallel, significantly reducing processing time. ORE supports parallelism through the transparency layer,...

Read more »

ScraperWiki in R

July 29, 2012
By

ScraperWiki describes itself as an online tool for gathering, cleaning and analysing data from the web. It is a programming oriented approach, users can implement ETL processes in Python, PHP or Ruby, share these processes among the community (or pay for privacy) and schedule automated runs. The software behind the service is open source, and there is...

Read more »

Success does not require understanding

July 23, 2012
By

I took part in the second Data Science London Hackathon last weekend (also my second hackathon) and it was a very different experience compared to the first hackathon. Once again Carlos and his team really looked after us. The data was released 24 hours before the competition started and even though I had spent less

Read more »

The R packages in a data scientist’s toolbox

July 17, 2012
By

John Myles White, self-described "statistics hacker" and co-author of "Machine Learning for Hackers" was interviewed recently by The Setup. In the interview, he describes his some of his go-to R packages for data science: Most of my work involves programming, so programming languages and their libraries are the bulk of the software I use. I primarily program in R,...

Read more »