2276 search results for "git"

PSID data set builder for R

April 14, 2013
By

Economists frequently use public datasets. One frequently used dataset is the Panel Study of Income Dynamics, short PSID, maintained by the Institute of Social Research at the University of Michigan.I'm introducing psidR, which is a small helper packag...

Read more »

Using R — Working with Geospatial Data

April 14, 2013
By
Using R — Working with Geospatial Data

This entry is part 12 of 12 in the series Using RGIS, an acronym that brings joy to some and strikes fear in the heart of those not interested in buying expensive software. Luckily fight or flight can be saved for another day because you …   read more ...

Read more »

Tip: Julia vs. R – introduction videos and more

April 14, 2013
By

Last years UseR 2012 conference in Nashville had an interesting discussion session titled “What other languages should R users know about?“. General consensus was that multilingualism is inevitable in modern computing, and panel members presented various languages that complement R in different ways. Some of the usual suspects included SQL, python, and of course C++, ...

Read more »

Continuing Sync

April 14, 2013
By
Continuing Sync

I am continuing in Sync: How Order Emerges from Chaos in the Universe, Nature, and Daily Lifeby Steven Strogatz. To get a feeling on it, I was building a group of things which have only a minute influence on each other are able to synchronize thei...

Read more »

Stan 1.3.0 and RStan 1.3.0 Ready for Action

April 12, 2013
By
Stan 1.3.0 and RStan 1.3.0 Ready for Action

The Stan Development Team is happy to announce that Stan 1.3.0 and RStan 1.3.0 are available for download. Follow the links on: Stan home page: http://mc-stan.org/ Please let us know if you have problems updating. Here’s the full set of release notes. v1.3.0 (12 April 2013) ====================================================================== Enhancements ---------------------------------- Modeling Language * forward sampling (random

Read more »

Extending RevoScaleR for Mining Big Data – Discretization

April 12, 2013
By
Extending RevoScaleR for Mining Big Data – Discretization

by Derek McCrae Norton, Senior Sales Engineer In this second installment of Extending RevoScaleR for Mining Big Data we look at how to use the building blocks provided by RevoScaleR to transform continuous variables into discrete. Motivation: Discretize continuous variables on big data. Discretization is a technique to convert continuous variables into discrete variables, and it is sometimes useful...

Read more »

Travis CI for R! (not yet)

April 12, 2013
By
Travis CI for R! (not yet)

A few days ago I wrote about Travis CI, and was wondering if we could integrate the testing of R packages into this wonderful platform. A reader (Vincent Arel-Bundock) pointed out in the comments that Travis was running Ubuntu that allows you to install software packages at your will.

I took a look at the documentation, and realized...

Read more »

Dropbox & R Data

April 11, 2013
By

I'm always looking for ways to download data from the internet into R. Though I prefer to host and access plain-text data sets (CSV is my personal favourite) from GitHub (see my short paper on the topic) sometimes it's convenient to get data stored on Dropbox.

There has been a change in the way Dropbox...

Read more »

Stepwise Regression for Big Data with RevoScaleR

April 11, 2013
By

by Joseph Rickert In a recent blog post, Revolution's Thomas Dinsmore announced stepwise regression for big data as a new feature of Revolution R Enterprise 6.2 that is scheduled for general availability later this month. Today, I would like to provide a simple example of doing stepwise regression with rxLinMod() (the RevoScaleR analog of lm()), using a 100,000 row...

Read more »

A quick introduction to ggplot2

April 10, 2013
By

My friend Jonah asked me to guest lecture in his R seminar aimed at grad students and postdocs in Integrative Biology. I gave Jonah a bunch of topics ranging from reproducible research with R to data manipulation. The consensus was data visualization so I put together a 2 hour talk/hands on presentation for ggplot2 beginners.

Read more »