The R Foundation announces new mailing list ‘R-package-devel’

May 22, 2015
By

At last week’s monthly meeting, the R foundation has decided to create a new mailing list in order to help R package authors in their package development and testing. The idea is that some experienced R programmers (often those currently helping on R-devel or also R-help) will help package authors and thus unload some of the burden of the...

Read more »

Exact computation of sums and means

May 21, 2015
By
Exact computation of sums and means

A while ago, I came across a mention of the Python math.fsum function, which sums a set of floating-point values exactly, then rounds to the closest floating point value. This seemed useful. In particular, I thought that if it’s fast enough it could be used instead of R’s rather primitive two-pass approach to trying to compute

Read more »

BH release 1.58.0-1

A new released of BH is now on CRAN. BH provides a large part of the Boost C++ libraries as a set of template headers for use by R and Rcpp. This release both upgrades the version of Boost to the current release, and adds a new library: Boost MultiPr...

Read more »

Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

May 21, 2015
By
Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

          Mmmmm, chartjunk! Rebooting Vega.jl Recently, I’ve found myself without a project to hack on, and I’ve always been interested in learning more about browser-based visualization. So I decided to revive the work that John Myles White had done in building Vega.jl nearly two years ago. And since I’ll be giving

Read more »

First Day Highlights from the Extremely Large Databases Conference

May 21, 2015
By
First Day Highlights from the Extremely Large Databases Conference

by Joseph Rickert The 8th XLDB (Extremely Large Databases) Conference open at Stanford on Tuesday with an outstanding program. This conference has been providing leadership in the "Big Data" world since its first workshop which was held in 2007. For example, the summary report for that year notes: "Both communities (industry and science) are moving towards parallel ... architectures...

Read more »

Introductory Point Pattern Analysis of Open Crime Data in London

May 21, 2015
By
Introductory Point Pattern Analysis of Open Crime Data in London

IntroductionPolice in Britain (http://data.police.uk/) not only register every single crime they encounter, and include coordinates, but also distribute their data free on the web.They have two ways of distributing data: the first is through an API, which is extremely easy to use but returns only a limited number of crimes for each request, the second is...

Read more »

Course on using Oracle R Enterprise

Course on using Oracle R Enterprise

BNOSAC will be giving from June 08 up to June 12 a 5-day crash course on the use of R using Oracle R Enterprise. The course is given together with our Oracle Partner in Leuven, Belgium. If you are interested in attending, contact us for further details. For R users who aren't aware of this yet....

Read more »

How To Analyze Data: 21 Graphs that Explain the Same-Sex Marriage Case, Public Opinion, & Supreme Court

May 21, 2015
By
How To Analyze Data: 21 Graphs that Explain the Same-Sex Marriage Case, Public Opinion, & Supreme Court

The nine Justices on the United States Supreme Court recently took up a case about same-sex marriage. The question in Obergefell v. Hodges is whether states are required to license and recognize marriages between two people of the same sex. This post e...

Read more »

RInside 0.2.13

A new release 0.2.13 of RInside is now on CRAN. RInside provides a set of convenience classes which facilitate embedding of R inside of C++ applications and programs, using the classes and functions provided by Rcpp. This release works around a bug i...

Read more »

Open soure software has changed the way we do business

May 20, 2015
By

Earlier this month TechCrunch published an article of mine, "The Business Economics And Opportunity Of Open-Source Data Science". With this article I wanted to share how open-source software has disrupted the economics of doing business, now that data is a fundamental component of every businesses' operations. Open source projects like Hadoop and R, coupled with commodity hardware, have fundamentally...

Read more »

Teaching R course? Use analogsea to run your customized RStudio in Digital Ocean!

May 20, 2015
By
Teaching R course? Use analogsea to run your customized RStudio in Digital Ocean!

Two years ago I taught an introductory R/Shiny course here at The Jackson Lab. We all learnt a lot. Unfortunately not about Shiny itself, but rather about incompatibilities between its versions and trouble with its installation to some machines.And it ...

Read more »

Put Google Scholar citations on your personal website with R, scholar, ggplot2 and cron

May 20, 2015
By
Put Google Scholar citations on your personal website with R, scholar, ggplot2 and cron

I have been looking for a solution to put my Google Scholar citations on my personal website for quite some time now. Some apps/gadgets seem to have existed to do so (e.g. citations-gadget), but none seem to be functioning today because of chan...

Read more »

Are Canadian newspapers painting false pictures with data?

May 20, 2015
By
Are Canadian newspapers painting false pictures with data?

The Canadian newspaper, Globe and Mail, is a leader in diction and style, but it may need improvement in the ‘grammar of graphics’.Globe’s recent depiction of metropolitan economic growth in the series Off the Charts was way off the mark. The chart plotted the current and forecasted GDP growth rates for select cities in Canada. The...

Read more »

First Steps with Structural Equation Modeling

May 20, 2015
By
2015-05-21 23_11_54-Clipboard

Last Friday at the Davis R Users’ Group, Grace Charles gave a presentation on structural equation modeling in R using the Lavaan package. Here’s the video and her slides. We’ve also posted Grace’s script from the presentation as a gist here. More resources that Grace mentioned in her talk below. Resources Jim Grace’s website...

Read more »

Databases – an Ideal Application for R6?

May 20, 2015
By

Over the years I have tried to simplify and streamline my access to financial historic data. All different solutions I tried (see here, for example) so far have been unsatisfactory, at least to some degree. That however changed after I started using R6. Here is an example of using the R6 class for the same The post

Read more »

More postdoctoral opportunities at IARC

May 20, 2015
By
More postdoctoral opportunities at IARC

If you want to want to work with R and JAGS on a hard multivariate measurement error problem in nutritional epidemiology, then take a look at this advert. http://www.iarc.fr/en/vacancies/postdoc/postdoc_NEP_June2015.pdf IARC is based in Lyon, France. For informal queries approach me … Continue reading →

Read more »

Clusters Powerful Enough to Generate Their Own Subspaces

May 20, 2015
By
Clusters Powerful Enough to Generate Their Own Subspaces

Cluster are groupings that have no external label. We start with entities described by a set of measurements but no rule for sorting them by type. Mixture modeling makes this point explicit with its equation showing how each measurement is an independe...

Read more »

Benchmarking Random Forest Implementations

May 19, 2015
By
Benchmarking Random Forest Implementations

I currently have the need for machine learning tools that can deal with observations of...

Read more »

Fast parallel computing with Intel Phi coprocessors

May 19, 2015
By
Fast parallel computing with Intel Phi coprocessors

by Andrew Ekstrom Recovering physicist, applied mathematician and graduate student in applied Stats and systems engineering We know that R is a great system for performing statistical analysis. The price is quite nice too ;-) . As a graduate student, I need a cheap replacement for Matlab and/or Maple. Well, R can do that too. I’m running a large...

Read more »

TidyR Challenge: Data.Table Solution

May 19, 2015
By

Arun Srinivasan is the man! Once he saw that his data.table solution to the TidyR Challenge had an issue, he fixed it! His solution is below along with a quick equivalence test to my original solution, and check out this stackOverflow question for a...

Read more »

[R] Kenntnis-Tage 2015: The first holistic R Event in the German-speaking community

May 19, 2015
By
[R] Kenntnis-Tage 2015: The first holistic R Event in the German-speaking community

The event aims at connecting R users from all industrial and academic sectors and facilitating the interconnectedness of the German-speaking R community as well as the exchange of practical and theoretical knowledge about R. From September 29 to October 02 the Kenntnis-Tage 2015 with topics ranging from big data over programming and data mining

Read more »

Bio7 2.1 for Linux 64-bit and Windows 32-bit released

May 19, 2015
By
Bio7 2.1 for Linux 64-bit and Windows 32-bit released

19.05.2015 After the 64-bit release of Windows the Linux 64-bit and Windows 32-bit release can be downloaded at: http://bio7.org Linux installation: The installation of Bio7 is similar to the installation of the Eclipse environment. Simply decompress the downloaded *.zip file in a preferred location on your file system. After decompressing with a standard zip-tool (like

Read more »

Unsupervised Google Maps image classification

May 19, 2015
By
Unsupervised Google Maps image classification

This is a guest post by Florian Detsch Prerequisites Required packages First, we need to (install and) load some packages required for data processing and visualization. The below code is mainly based on the Rsenal package, which is a steadily … Continue reading →

Read more »

Interactive charts in R

May 19, 2015
By
Interactive charts in R

I’m giving a talk tomorrow at the Edinburgh R usergroup (EdinbR) on how to get started building interactive charts in R. I’ll talk about rCharts as a great general entry point to quickly generating interactive charts, and also the newer htmlwidgets … Continue reading →

Read more »

How Predictable is the English Premier League?

May 19, 2015
By
How Predictable is the English Premier League?

The reason why football is so exciting is uncertainty. The outcome of any match or league is unknown, and you get to watch the action unfold without knowing what’s going to happen. Watching matches where you know the score is never exciting. This weekend the English Premier League season will conclude with little fanfare. Bar

Read more »

Posterior predictive output with Stan

May 18, 2015
By
Posterior predictive output with Stan

I continue my Stan experiments with another insurance example. Here I am particular interested in the posterior predictive distribution from only three data points. Or, to put it differently I have a customer of three years and I'd like to predict the expected claims cost for the next year to set or adjust the premium.The example is taken...

Read more »

Query Multiple Google Analytics View IDs with R

May 18, 2015
By
Query Multiple Google Analytics View IDs with R

Extracting Google Analytics data from one website is pretty easy, and there are several options to do it quickly. But what if you need to extract data from multiple websites or, to be more precise, from multiple Views? And perhaps you also need to summ...

Read more »

Basic text string functions in R

May 18, 2015
By

To get the length of a text string (i.e. the number of characters in the string): Using length() would just give you the length of the vector containing the string, which will be 1 if the string is just a single string. To get the position of a regular expression match(es) in a text string

Read more »

Bio7 2.1 for Windows 64 bit released

May 18, 2015
By
Bio7 2.1 for Windows 64 bit released

18.05.2015 I released a new version of the Integrated Development Environment Bio7 with new functions and many visual layout improvements of the Bio7 Graphical User Interface. Bio7 2.1 can be downloaded here: http://bio7.org Release notes Bio7 2.1: R: Updated R to version 3.2.0. Improved the R perspective layout (see below). Improved the syntax coloring and

Read more »