Simple data mining and plotting data on a map with ggplot2

April 2, 2012
By
Simple data mining and plotting data on a map with ggplot2

In this post I use OpenStreetsMaps and ggplot2 to plot geographically were psychologists live, using data from a facebook document.

Read more »

Simple data mining and plotting data on a map with ggplot2

April 2, 2012
By
Simple data mining and plotting data on a map with ggplot2

Introduction A Facebook group for psychologists in Sweden created a document where each member could type in were they live and what they work with. At the moment 192 psychologists have added their information to the document. This made me think about how to plot this information using R, and especially how to read data in this format. I was...

Read more »

Replacing market indices

April 2, 2012
By
Replacing market indices

If equity markets suddenly sprang into existence now, would we create market indices? I’m doubtful. Why an index? The Dow Jones Industrial Average was born in 1896.  This was when computers were humans with adding machines (but they did do parallel processing).  At that point boiling “the market” down to a single number had value. … Continue reading...

Read more »

Sunday evening, stupid games…

April 1, 2012
By
Sunday evening, stupid games…

This evening, while I was about to wash the dishes, I heard my elders starting a game (call them Him and Her) Him: "I have picked - in my head - a number, lower than 50. Try to guess..." Her: "No way, too difficult..." Him: "You can try five differ...

Read more »

Missing Data Club

April 1, 2012
By

Welcome to Missing Data Club. There are only three rules. Rule #1 is: There is no missing data. Rule #2 is: THERE IS NO MISSING DATA! Rule #3: If you’ve never built a model using missing data – you must do it...

Read more »

Quantitative finance and computational systems

April 1, 2012
By
Quantitative finance and computational systems

I’m writing a book proposal based on the lecture notes for my R for Quants workshop I conducted at the …Continue reading »

Read more »

A better way of saving and loading objects in R

April 1, 2012
By
A better way of saving and loading objects in R

Hadley Wickham (@hadleywickham) this week mentioned on Twitter his preference for saveRDS() over the more familiar save(). Being a new function to me, I thought I’d take a look… save() and load() will be familiar to many R users. They … Continue reading →

Read more »

A better way of saving and loading objects in R

April 1, 2012
By

Hadley Wickham (@hadleywickham) this week mentioned on Twitter his preference for saveRDS() over the more familiar save(). Being a new function to me, I thought I’d take a look…

Read more »

Julia, I Love You

March 31, 2012
By

Julia is a new language for scientific computing that is winning praise from a slew of very smart people, including Harlan Harris, Chris Fonnesbeck, Douglas Bates, Vince Buffalo and Shane Conway. As a language, it has lofty design goals, which, if attained, will make it noticeably superior to Matlab, R and Python for scientific programming.

Read more »

Back to Blogging

March 31, 2012
By

If you’re subscribed to this blog, you’ve surely noticed the very long hiatus I’ve taken from writing over the last six months. I wish I’d kept up with blogging more faithfully this year, but, in my defense, I’ve been busy doing a few big things: I wrote a book with Drew Conway called Machine Learning

Read more »

More on Philadelphia Homicide

March 31, 2012
By
More on Philadelphia Homicide

I've been doing more analysis of the Philadelphia Homicide data that the Philadelphia Inquirer has published, and presented some of it at the Philadelphia UseR group yesterday. My slides and source are on github.I should be clear tha...

Read more »

Draw Your Breast with CloudStat – A R Apps (for fun)

March 31, 2012
By
Draw Your Breast with CloudStat – A R Apps (for fun)

This is a simple apps, called “Draw Your Breast with R“  created with R to generate Breast alike graphics. With this Draw Your Breast with R apps, you can change 4 parameters which are Theta, Phi, Expand and Color to generate graphics like...

Read more »

Ggplot2, PubMed citation frequency and DSM-IV Axis I disorders by year

March 31, 2012
By
Ggplot2, PubMed citation frequency and DSM-IV Axis I disorders by year

I searched PubMed for several major DSM-IV disorders and downloaded the hits. Using ggplot2 I plotted the number of publications each year for each disorder.

Read more »

Ggplot2, PubMed citation frequency and DSM-IV Axis I disorders by year

March 31, 2012
By
Ggplot2, PubMed citation frequency and DSM-IV Axis I disorders by year

Introduction I an earlier post I used ggplot2 to plot publication count by year for different psychotherapy orientations. I'm using an updated version of that R-code in this post to plot publication count for several major DSM-IV disorders. It's basically like a very simple discourse analysis. The search string I searched for the following disorders: Depression, Bipolar disorder, Post-Traumatic Stress Disorder (PTSD), Attention-Deficit/Hyperactivity Disorder (ADHD), Obsessive-Compulsive Disorder (OCD),...

Read more »

Playing with XML-Package: Get No. of Google Search Hits with R

March 30, 2012
By
Playing with XML-Package: Get No. of Google Search Hits with R

GoogleHits <- function(input) { require(XML) require(stringr) require(RCurl) url

Read more »

GBLUP example in R

March 30, 2012
By

Shirin Amiri was asking about GBLUP (genomic BLUP) and based on her example I set up the following R script to show how GBLUP works. Note that this is the so called marker model, where we estimate allele substitution effects of the markers and not individual based model, where genomic breeding values are inferred directly. The code:library(package="MatrixModels") dat <- data.frame(...

Read more »

VIDEO: "R" Checking the reference values ("Y" Matrix).

March 30, 2012
By
VIDEO: "R" Checking the reference values ("Y" Matrix).

(This article was first published on NIR-Quimiometría, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: NIR-Quimiometría. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

See the wind

March 30, 2012
By
See the wind

The image below isn't a bearskin rug in the shape of the USA. It's fact, it's a visualization of the wind flowing over the United States, as of 4PM EDT today, March 30. You can click through to see the current wind conditions, based on latest data from the National Digital Forecast Database. But more importantly, as long as...

Read more »

By: Super Nerdy Cool » Build 32 bit R on 64 bit Ubuntu by utilizing chroot

route, it will suffice for now. With the chroot method, first compile the 64 bit version of R the usual way. For the 32 bit version of R,

Read more »

R 2.15.0 "Easter Beagle" is released

March 30, 2012
By

The R core group today announced the availability of R 2.15.0, codenamed "Easter Beagle". If you build R yourself, the new source distribution (including packages for Debian Linux) is available now, and binaries for Windows, MacOS and Linux will be available from your local CRAN mirror over the next couple of days. (As of this writing, Windows binaries are...

Read more »

Build 32 bit R on 64 bit Ubuntu by utilizing chroot

March 30, 2012
By

In the past, I’ve described how one could build multiarch (64 bit and 32 bit) versions of R on a 64 bit Ubuntu machine. The method based on this thread no longer works as of R 2.13 or 2.14 I believe. I received advice from someone on #R over on freenode (forgot who) a few... Read more »

Comparing Spectra with different math treatments

March 30, 2012
By
Comparing Spectra with different math treatments

It is amazing the quantity of graphics you can develop with R, and how you can show and manage  these graphics. Here in the same plot we compare the raw demo spectra, treated with MSC, SNV and with the first derivative (differences between consecu...

Read more »

Slow R graphics via SSH

March 30, 2012
By

I do most of my R work on a server via SSH.  Viewing graphics (i.e. plot()) can be excruciatingly slow through the SSH pipe, but it's easy to make it fast.  Simply run:        X11.options(type="Xlib")once in the R session.  Tags: linuxgraphicsR

Read more »

Slow R graphics via SSH

March 30, 2012
By
Slow R graphics via SSH

I do most of my R work on a server via SSH.  Viewing graphics (i.e. plot()) can be excruciatingly slow through the SSH pipe, but it's easy to make it fast.  Simply run:        X11.options(type="Xlib")once in the R session....

Read more »

Maths can be cool (to impress your kids)

March 30, 2012
By
Maths can be cool (to impress your kids)

Just imagine that your kids need some help, to prepare fishes for April 1st, like Her: "please, Daddy, help us to draw some fishes" Me: "Sure, Daddy is a champion, actually, I do that everyday at work: drawing fishes - and more generally nice stuf...

Read more »

Mapped: British, Spanish and Dutch Shipping 1750-1800

March 30, 2012
By
Mapped: British, Spanish and Dutch Shipping 1750-1800

I recently stumbled upon a fascinating dataset which co

Read more »

R 2.15.0 is released

March 30, 2012
By
R 2.15.0 is released

Bellow is the announcement made by Peter Dalgaard: The build system rolled up R-2.15.0.tar.gz (codename “Easter Beagle”) at 9:00 this morning. This is the first release of the 2.15 series and contains several new features and changes; see the list below for details. You can get the source code from http://cran.r-project.org/src/base/R-2/R-2.15.0.tar.gz or wait for it to be mirrored at...

Read more »

R 2.15.0 is released!

March 30, 2012
By
R 2.15.0 is released!

The new R 2.15.0 is out! You know the drill! Get the source code from here.

Read more »

BoRdeaux (France): the first French R useR conference

March 30, 2012
By

The first French R useR conference will be held in Bordeaux on July 2-3. The Rencontres R are conceived as a place to present and share ideas on using the R statistical software. This meeting is designed to be a nationwide event where various topics belong, such as graphical tools, applied statistics, biostatistics, bayesian statistics,

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.