Make your R plots interactive

February 11, 2015
By
Make your R plots interactive

As a part of my daily job, I draw scatterplots, lots of them. And because there are thousands of genes expressed in any mouse or human tissue, my typical plot looks something like this (code). (Actually, it is a comparison of variance that can be attri...

Read more »

Code as Magic, and the Vernacular of Data Wrangling Verbs

February 11, 2015
By
Code as Magic, and the Vernacular of Data Wrangling Verbs

It’s been some time now since I drafted most of my early unit contributions to the TM351 Data management and analysis course. Part of the point (for me) in drafting that material was to find out what sorts of thing we actually wanted to say and help identify the sorts of abstractions we wanted to

Read more »

EdinbR: A new R usergroup for Edinburgh

February 11, 2015
By
EdinbR: A new R usergroup for Edinburgh

Inspired by succesful RUGs like LondonR and CambR, I’m pleased to announce a new R usergroup for those in and around Edinburgh: EdinbR! Edinburgh has a large research community using R, spread across different campuses and even universities so a … Continue reading →

Read more »

Scheduling R Tasks via Windows Task Scheduler

February 10, 2015
By
Scheduling R Tasks via Windows Task Scheduler

This post will allow you to impress your boss with you’re strong work ethic by enabling Windows R users to schedule late night tasks.  Picture it, your boss gets an email at 1:30 in the morning with the latest company … Continue reading →

Read more »

More on Quadratic Progarmming in R

February 10, 2015
By
More on Quadratic Progarmming in R

This post is another tour of quadratic programming algorithms and applications in R. First, we look at the quadratic program that lies at the heart of support vector machine (SVM) classification. Then we'll look at a very different quadratic programming demo problem that models the energy of a circus tent. The key...

Read more »

Bash+R: howto pass parameters from bash script to R

February 10, 2015
By

From original post @ http://analyticsblog.mecglobal.it/

Read more »

Autocommit with ceODBC is slow

February 10, 2015
By
Autocommit with ceODBC is slow

You already know that a SQL INSERT is faster with bulk loading than inserting a record at a time, but what about the effect of autocommit on performance? While this is probably not specific to ceODBC, using autocommit is astonishingly slow. Here is how slow. First, the Python code to run the benchmark:import ceODBCimport datetimeimport osimport timeconnection_string="driver=sql...

Read more »

BayesFactorExtras: a sneak preview

February 10, 2015
By

Felix Schönbrodt and I have been working on an R package called BayesFactorExtras. This package is designed to work with the BayesFactor package, providing features beyond the core BayesFactor functionality. Currently in the package are:Sequential Bayes factor plots for visualization of how the Bayes factor changes as data come in: seqBFplot()Ability to embed R objects directly...

Read more »

Geomorph To Feature At SVP 2015

February 10, 2015
By
Geomorph To Feature At SVP 2015

Calling Palaeontology-based Morphometricians,I am pleased to announce that myself, along with Aki Watanabe and Marc Jones will be leading a Symposium and associated workshop on Geometric Morphometrics in Paleontology at the Annual Meeting for the Society of Vertebrate Paleontology in Dallas Texas October 2015.Podium Symposium: The Shape of Things to Come: Geometric Morphometrics...

Read more »

8 new R jobs (2015-02-10)

February 10, 2015
By
8 new R jobs (2015-02-10)

This is the bimonthly R Jobs post (for 2015-02-10), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Enhancing R for Distributed Computing

February 10, 2015
By
Enhancing R for Distributed Computing

A summary of a recent workshop at HP Labs addressed “Distributed Computing in R”

Read more »

rcrunchbase – An API Interface to CrunchBase

February 10, 2015
By

James Peruvankal Sr. Program Manager, Revolution Analytics Information about the technology business ecosystems is valuable to both established companies as well as startups. Fortunately CrunchBase - the world’s most comprehensive dataset of startup activity, captures quite a bit of such information. Founded in 2007 by Mike Arrington, CrunchBase began as a simple crowd-sourced database to track startups covered on...

Read more »

Why you should start by learning data visualization and manipulation

February 10, 2015
By

One of the biggest issues that comes up when I talk to people who want to get started learning data science is the following: I don’t know where to get started! Recently, I argued that R is the best programming language to learn when you’re getting started with data science. While this helps you select The post

Read more »

Mixing Waves

February 10, 2015
By
Mixing Waves

Fill a cocktail shaker with ice; add vodka, triple sec, cranberry, and lime, and shake well; strain into a chilled cocktail glass and garnish with orange twist (Cosmopolitan Cocktail Recipe) This is a tribute to Blaise Pascal and Joseph Fourier, two of the greatest mathematicians in history. As Pascal did in his famous triangle, I generate a set of … Continue reading...

Read more »

[ggtree] updating a tree view using %<% operator

February 9, 2015
By
[ggtree] updating a tree view using %<% operator

I am very exciting that I have received very positive feedback from Ahmed Moustafa and Simon Frost. Read More: 401 Words Totally

Read more »

RPushbullet 0.2.0

February 9, 2015
By

A new releases of the RPushbullet package (interfacing the neat Pushbullet service) arrived on CRAN today. It brings several weeks of extensions, corrections and cleanups---with key contributions by Mike Birdgeneau and Henrik Bengtsson. RPushbullet n...

Read more »

What to expect from Strata Conference 2015? An empirical outlook.

February 9, 2015
By
What to expect from Strata Conference 2015? An empirical outlook.

In one week, the 2015 edition of Strata Conference (or rather: Strata + Hadoop World) will open its doors to data scientists and big data practitioners from all over the world. What will be the most important big data technology trends for this year? As last year, I ran an analysis on the

Read more »

In case you missed it: January 2015 roundup

February 9, 2015
By

In case you missed them, here are some articles from January of particular interest to R users. Slides on reproducible data analysis with Revolution R Open and the checkpoint package. A review of a recent Bay Area R User Group meetup, featuring Hadley Wickham, Ryan Hafen and Nick Elprin. In an article at opensource.com, I explain why now is...

Read more »

PELTing a Competing Changepoint Algorithm

February 9, 2015
By
PELTing a Competing Changepoint Algorithm

This post will demonstrate the PELT algorithm from the changepoint package–a competing algorithm to the twitter package’s breakout detection algorithm. … Continue reading →

Read more »

MissData 2015 in Rennes [June 18-19]

February 9, 2015
By
MissData 2015 in Rennes [June 18-19]

This (early) summer, a conference on missing data will be organised in Rennes, Brittany, with the support of the French Statistical Society . (Check the website if interested, Rennes is a mere two hours from Paris by fast train.)Filed under: R, ...

Read more »

CFP: 13th Australasian Data Mining Conference (AusDM 2015)

February 9, 2015
By
CFP: 13th Australasian Data Mining Conference (AusDM 2015)

The 13th Australasian Data Mining Conference (AusDM 2015) Sydney, Australia, 8-9 August 2015 co-located with SIGKDD’15 URL: http://ausdm15.ausdm.org/ Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891 The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and … Continue reading →

Read more »

Morse Code Converter

February 8, 2015
By
Morse Code Converter

A few months ago, I finally got a chance to see The Imitation Game (the new Alan Turing movie), which gave me an idea for a Sunday morning R hacking session. The movie features a bunch of scenes with bustling rooms full or workers intercepting (and documenting) encrypted radio transmissions, which are then passed along...

Read more »

Pan-sharpening Using R

February 8, 2015
By
Pan-sharpening Using R

In my previous post I described how to perform pan-sharpening using OrfeoToolbox and QGIS. This time I will show you how to do this in R. At the bottom you will find several functions I wrote on top of the 'raster' package that allow a convenient pan-sharpening in R.MotivationYou may wonder why I even...

Read more »

Chapter 3 of Modeling data with functional programming in R is out

February 8, 2015
By
Chapter 3 of Modeling data with functional programming in R is out

Chapter 3 of my book “Modeling data with functional programming in R” is available for download. This chapter describes map-vectorization …Continue reading →

Read more »

UIUC free online courses on data mining starting on 9 Feb, lectured by Prof. Jiawei Han et al.

February 8, 2015
By
UIUC free online courses on data mining starting on 9 Feb, lectured by Prof. Jiawei Han et al.

by Yanchang Zhao, RDataMining.com A series of free online data mining courses will start on 9 Feb 2015, lectured by Prof. Jiawei Han and several other staff at UIUC. Prof. Han is one of the top data mining researchers around … Continue reading →

Read more »

Hierarchical log odds model example

February 8, 2015
By
Hierarchical log odds model example

I am working through Bayesian Approaches to Clinical Trials and Health-Care Evaluation (David J. Spiegelhalter, Keith R. Abrams, Jonathan P. Myles) (referred to as BACTHCE from here on). In chapter three I saw an example (3.13) where I wanted to d...

Read more »

Contingency Tables with gmodels in R

February 7, 2015
By
Contingency Tables with gmodels in R

Contingency tables provide a way to display the frequencies and relative frequencies of observations, which are classified according to two categorical variables. The elements of one category are displayed across the columns; the

Read more »

rfoaas 0.1.3

February 7, 2015
By

A brand new version of rfoaas is now on CRAN. It shadows the 0.1.3 release of FOAAS just how an earlier 0.1.2 had done (but there was something not quite right at the server backend which we coded around with an interim release 0.1.2.1; neither one of these was ever released to CRAN). The rfoaas package provides...

Read more »

drat Tutorial: First Steps towards Lightweight R Repositories

February 7, 2015
By

Now that drat is on CRAN and I got a bit of feedback (or typo corrections) in three issue tickets, I thought I could show how to quickly post such an interim version in a drat repository. Now, I obviously already have a checkout of drat. If you, dear reader, wanted to play along and create your...

Read more »