April 2016

The one machine learning concept you need to know

April 25, 2016 | 0 Comments

Machine learning is hard. Some people spend weeks, months, even years trying to learn machine learning without any success. They play around with datasets, buy books, compete on Kaggle, but ultimately make little progress. One of the big problems, is that many people just want to “dive in and build ... [Read more...]

Fast csv writing for R

April 25, 2016 | 0 Comments

[social4i size=”large” align=”float-right”] Guest post by Matt Dowle. This post was first published on the H2O blog, please go there to leave a comment. R has [Read more...]

Missing Value Treatment

April 25, 2016 | 0 Comments

Missing values in data is a common phenomenon in real world problems. Knowing how to handle missing values effectively is a required step to reduce bias and to produce powerful models. Lets explore various options of how to deal with missing values and how to implement them. Data prep and ...
[Read more...]

Candlestick charts using Plotly and Quantmod

April 24, 2016 | 0 Comments

This post is dedicated to creating candlestick charts using Plotly’s R-API. For more information on candlestick charts visit www.stockcharts.com. We’ll also showcase Plotly’s awesome new range selector feature ! [crayon-571e00e6cc382939134649/] [crayon-571e00e6cc39c746160627/] [Read more...]

Map of the Windows Fonts Registered with R

April 24, 2016 | 0 Comments

If you already found package extrafont then you probably found how to load and use Windows fonts in R visualizations. But just in case, everything to get started with extrafont is found here and summarized for using fonts in Windows for on-screen or bitmap output below:One thing to add ...
[Read more...]

Fast csv writing for R

April 24, 2016 | 0 Comments

R has traditionally been very slow at reading and writing csv files of, say, 1 million rows or more. Getting data into R is often the first task a user needs to do and if they have a poor experience (either hard to use, or very slow) they are less likely ... [Read more...]

Sparling Water for Spark(R)

April 22, 2016 | 0 Comments

Update: this blogpost seems to be obsolete now and that's a good thing. I've kept the rest of the blopost intact for historical reasons. SparkR offers R users to do data wrangling on bigger chunks of data. The machine learning algorithms that are supported are a bit modest (only linear ... [Read more...]

R Courses at Newcastle

April 22, 2016 | 0 Comments

Over the next two months I’m running a number of R courses at Newcastle University. May 2016 May 10th, 11th: Predictive Analytics May 16th – 20th: Bioconductor May 23rd, 24th: Advanced programming June 2016 June 8th: R for Big Data June 9th: Interactive graphics with Shiny Since these courses are on  advanced ...
[Read more...]

yorkr crashes the IPL party! – Part 4

April 22, 2016 | 0 Comments

Introduction I’ve missed more than 9000 shots in my career. I’ve lost almost 300 games. 26 times, I’ve been trusted to take the game winning shot and missed. I’ve failed over and over and over again in my life. And that is why I succeed. Michael Jordan Success is ...
[Read more...]

How to use SparkR in Cloudera Hadoop

April 22, 2016 | 0 Comments

Suppose you are an avid R user, and you would like to use SparkR in Cloudera Hadoop; unfortunately, as of the latest CDH version (5.7), SparkR is still not supported (and, according to a recent discussion in the Cloudera forums, we shouldn’t expect this to happen anytime soon). Is there ... [Read more...]

Microsoft R Open 3.2.4 now available

April 22, 2016 | 0 Comments

M icrosoft R Open 3.2.4, Microsoft's enhanced distribution of R, is now available for download from mran.microsoft.com. This update is based on R 3.2.4-revised, and includes several improvements and some minor bug fixes from the R Core Group. Improvements include long-vector support for the smooth function, a new stringsAsFactors ... [Read more...]

Introducing fidlr: FInancial Data LoadeR

April 21, 2016 | 0 Comments

fidlr is an RSutio addin designed to simplify the financial data downloading process from various providers. This initial version is a wrapper around the getSymbols function in the quantmod package and only Yahoo, Google, FRED and Oanda are supported. I will probably add functionalities over time. As usual with those ... [Read more...]

Principal curves example (Elements of Statistical Learning)

April 21, 2016 | 0 Comments

The bit of R code below illustrates the principal curves methods as described in The Elements of Statistical Learning, by Hastie, Tibshirani, and Friedman (Ch. 14; the book is freely available from the authors' website). Specifically, the code generates some bivariate data that have a nonlinear association, initializes the principal curve ... [Read more...]

Get ready for R/Finance 2016

April 21, 2016 | 0 Comments

by Joseph Rickert R/Finance 2016 is less than a month away and, as always, I am very much looking forward to it. In past years, I have elaborated on what puts it among my favorite conferences even though I am not a finance guy. R/Finance is small, single track ... [Read more...]

an integer programming riddle

April 20, 2016 | 0 Comments

A puzzle on The Riddler this week that ends up as a standard integer programming problem. Removing the little story around the question, it boils down to optimise 200a+100b+50c+25d under the constraints 400a+400b+150c+50d≤1000, b≤a, a≤1, c≤8, d≤4, and (a,b,c,d) all ...
[Read more...]

Pride and Prejudice and Z-scores

April 20, 2016 | 0 Comments

You might think literary criticism is no place for statistical analysis, but given digital versions of the text you can, for example, use sentiment analysis to infer the dramatic arc of an Oscar Wilde novel. Now you can apply similar techniques to the works of Jane Austen thanks to Julia ... [Read more...]
1 2 3 4 5 6 13

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)