October 2016

What is wrong with lift curves

October 30, 2016 | 0 Comments

The first part of our Marketing Analytics Using R course covers campaign analysis with test- and control groups and campaign optimisation using lift curves and predicted responses. Among the many topics covered, we discuss what is wrong with lift curves. They are a standard tool in marketing to select a ... [Read more...]

The Bayesian approach to ridge regression

October 30, 2016 | 0 Comments

In a previous post, we demonstrated that ridge regression (a form of regularized linear regression that attempts to shrink the beta coefficients toward zero) can be super-effective at combating overfitting and lead to a greatly more generalizable model. This approach… Continue reading → [Read more...]

Regular Expressions Exercises – Part 1

October 30, 2016 | 0 Comments

A common task performed during data preparation or data analysis is the manipulation of strings. Regular expressions are meant to assist in such and similar tasks. A regular expression is a pattern that describes a set of strings. Regular expressions can range from simple patterns (such as finding a single ... [Read more...]

Nobel Laureates – A Closer Look At The Data

October 30, 2016 | 0 Comments

The ritual is now well established: since more than a century, every year on early October, the STEM small-world but also the media and many science-&-technology aficionados and enthusiasts wait for the Nobel Prizes announcement. As the 2016 round of Nobel laureates was announced a few days ago only, perhaps ... [Read more...]

Nobel Laureates – A Closer Look At The Data

October 30, 2016 | 0 Comments

Salvino A. Salvaggio [1] [2] ...The ritual is now well established: since more than a century every year on early October, the STEM small-world but also the media and many science-&-technology aficionados and enthusiasts wait for the Nobel Prizes announcement. As the 2016 round of Nobel laureates was announced a few days ... [Read more...]

RProtoBuf 0.4.7: Mostly harmless

October 29, 2016 | 0 Comments

CRAN requested a release updating any URLs for Omegahat to the (actually working) omegahat.net URL. The RProtoBuf package had this in one code comment (errr...) and on bibfile entry. Oh well -- so that caused this 0.4.7 release which arrived on CRAN today. It contains the requested change, and pretty ... [Read more...]

drat 0.1.2: Mostly harmless

October 29, 2016 | 0 Comments

CRAN requested a release updating any URLs for Omegahat to the (actually working) omegahat.net URL. So that caused this 0.1.2 release which arrived on CRAN yesterday. It contains the requested change along with one or two other mostly minor changes w... [Read more...]

How to add a background image to ggplot2 graphs

October 29, 2016 | 0 Comments

When producing so called infographics, it is rather common to use images rather than a mere grid as background. In this blog post, I will show how to use a background image with ggplot2. Packages required The following code will install load and / or install the R packages required for ... [Read more...]

A quick exploration of the ReporteRs package

October 28, 2016 | 0 Comments

The package ReporteRs has been getting some play on the interwebs this week, though it’s actually been around for a while. The nice thing about this package is that it allows writing Word and PowerPoint documents in an OS-independent fashion unlike some earlier packages. It also allows the editing ...
[Read more...]

Join Hadley Wickham’s Master R Workshop in Melbourne, Australia December 12 & 13

October 28, 2016 | 0 Comments

It’s nearly summeRtime in Australia! Join RStudio Chief Data Scientist Hadley Wickham for his popular Master R workshop in Melbourne. Register here:  https://www.eventbrite.com/e/master-r-developer-workshop-melbourne-tickets-22546200292 Melbourne will be Hadley’s first and only scheduled Master R workshop in Australia. Whether you live or work nearby or ...
[Read more...]

Free e-book: Data Science with SQL Server 2016

October 28, 2016 | 0 Comments

There's a new e-book available to download free from Microsoft Academy: Data Science with Microsoft SQL Server 2016. This 90-page e-book is aimed at data scientists who already have some experience in R, but want to learn how to use R wirth SQL Server. The book was written by some of ... [Read more...]

Comparing Symmetric Eigenvalue Performance

October 28, 2016 | 0 Comments

Lazy reader’s guide: skip to the pretty pictures, skim the conclusions section, ignore the rest. Background I think a lot about eigenvalue and singular value decompositions. I won’t get into it right now, but I have been quoted in the past as sa...
[Read more...]

Remote Processing Using Futures

October 28, 2016 | 0 Comments

A new version of the future package has been released and is available on CRAN. With futures, it is easy to write R code once, which later the user can choose to parallelize using whatever resources s/he has available, e.g. a local machine, a set of local notebooks, ...
[Read more...]

Assessing clustering tendency: A vital issue – Unsupervised Machine Learning

October 27, 2016 | 0 Comments

1 Required packages 2 Data preparation2.1 faithful dataset 2.2 Random uniformly distributed dataset 3 Why assessing clustering tendency? 4 Methods for assessing clustering tendency4.1 Hopkins statistic4.1.1 Algorithm 4.1.2 R function for computing Hopkins statistic 4.2 VAT: Visual Assessment of cluster Tendency4.2.1 VAT Algorithm 4.2.2 R functions for VAT 5 A single function for Hopkins statistic and VAT 6 Infos Clustering ... [Read more...]

A ‘tidyverse’ approach to simulation models

October 27, 2016 | 0 Comments

A ‘tidyverse’ approach to simulation models I am a big fan of Rstudio’s (mainly led by Hadley Wickham) series of R packages known as the ‘tidyverse’. The packages modernise much of R’s functionality to better deal with moderately large da... [Read more...]

Tidy Text Mining with R

October 27, 2016 | 0 Comments

I am so pleased to announce that tidytext 0.1.2 is now available on CRAN. This release of tidytext, a package for text mining using tidy data principles by Dave Robinson and me, includes some bug fixes and performance improvements, as well as some new ... [Read more...]
1 2 3 4 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)