586 search results for "sql"

Plotting model fits

August 29, 2012
By
Plotting model fits

We all know that it is important to plot your data and explore the data visually to make sure you understand it. The same is true for your model fits. First, you want to make sure that the model is fitting...

Read more »

Open Research Data Processes: KMi Crunch – Hosted RStudio Analytics Environment

August 23, 2012
By
Open Research Data Processes: KMi Crunch – Hosted RStudio Analytics Environment

One of the possible barriers to widespread adoption of open notebook science is knowing where to start. Video reports of lab experiments hosted on Youtube can be easily embedded in a hosted WordPress blog; a MediaWiki wiki can be used to provide one page per experiment, with change tracking/history on each page and a shadow

Read more »

The Kaggle Bug

August 22, 2012
By
The Kaggle Bug

If you have any interest in data mining and machine learning, you might have already caught the Kaggle bug.I myself fairly recently got caught up in following the various contests and forums after reading a copy of "Practical Time Series Forecasting," ...

Read more »

Querying a database from within R

August 18, 2012
By
Querying a database from within R

For a while now I have been contemplating pulling data from our postgreSQL db directly from R, but just never actually pulled the trigger until today.  What I found was that it was a lot easier than I ever could have imagined.  My laptop was already on the VPN, so I decided to try it

Read more »

Experience with Oracle R Enterprise in the Oracle micro-processor tools environment

August 17, 2012
By
Experience with Oracle R Enterprise in the Oracle micro-processor tools environment

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Quick SAP HANA and R usecase

Quick SAP HANA and R usecase

DISCLAIMER: I'm not an SAP HANA expert or an R expert, not even a Python expert. I'm just a guy with a lot of ideas who loves to write blogs.The other day I was thinking about making some nice with SAP HANA and R, because people doesn't seem to be enou...

Read more »

Minimum Expected Shortfall Portfolio, Part 1

August 8, 2012
By

A few days ago, I wrote a piece on finding the minimum expected shortfall portfolio.  A few astute commenters quickly picked up where I was going with this -- using this as an alternative to low/minimum volatility portfolios.  What follo...

Read more »

If you are into large data and work a lot with package ff

August 8, 2012
By
If you are into large data and work a lot with package ff

The ff package is a great and efficient way of working with large datasets.  One of the main reasons why I prefer to use it above other packages that allow working with large datasets is that it is a complete set of tools. When comparing it to the other open source 'bigdata' packages in R It is not...

Read more »

Data Parallelism Using Oracle R Enterprise

August 2, 2012
By

Modern computer processors are adequately optimized for many statistical calculations, but large data operations may require hours or days to return a result.  Oracle R Enterprise (ORE), a set of R packages designed to process large data computations in Oracle Database, can run many R operations in parallel, significantly reducing processing time. ORE supports parallelism through the transparency layer,...

Read more »

ScraperWiki in R

July 29, 2012
By

ScraperWiki describes itself as an online tool for gathering, cleaning and analysing data from the web. It is a programming oriented approach, users can implement ETL processes in Python, PHP or Ruby, share these processes among the community (or pay for privacy) and schedule automated runs. The software behind the service is open source, and there is...

Read more »