Calculate your nutrients with my new package: NutrientData

June 15, 2016
By

I have created a new package: NutrientData This package contains data sets with the composition of Foods: Raw, Processed, Prepared. The source of the data is the USDA National Nutrient Database for Standard Reference, Release 28 (2015), a long with two functions to search and calculate nutrients. You download it from github: devtools::install_github("56north/NutrientData") Lets first... Read more »

Iraq-Wikileaks Analysis with R

June 14, 2016
By
Iraq-Wikileaks Analysis with R

In a place of extreme violence and devoid of order, the practical subsumes the principle. I drifted down the path of bribery and corruption endemic to the streets of Baghdad”.Jason Whiteley, Father of Money: Buying Peace in BaghdadAs I mentioned in a previous post, I wanted to explore the Wikileaks data of the US...

Read more »

jsonlite 0.9.22: distinguish between double and integer

June 14, 2016
By
jsonlite 0.9.22: distinguish between double and integer

Today a new version of the jsonlite package was released to CRAN. This update includes a few internal enhancements and one new feature. Doubles vs integers The new always_decimal parameter forces formatting of doubles in decimal notation....

Read more »

R, Yelp and the Search for Good Indian Food – An Open Course

June 14, 2016
By

New Free Course by Springboard and DataCamp Are all Yelp restaurant reviews created equal? Should we place greater trust in reviews made by people who know the cuisine well? How about reviews of ethnic restaurants by people of that ethnicity or reviews by seasoned Yelpers? We may not be able to find the perfect restaurant all the time,...

Read more »

Intro to The data.table Package

June 14, 2016
By
Intro to The data.table Package

Data Frames R provides a helpful data structure called the “data frame” that gives the user an intuitive way to organize, view, and access data.  Many of the functions that you would use to read in external files (e.g. read.csv) or connect to databases (RMySQL), will return a data frame structure by default. While there

Read more »

Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

June 14, 2016
By
Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

by Dmitry Pechyoni, Microsoft Data Scientist The New York City taxi dataset is one of the largest publicly available datasets. It has about 1.1 billion taxi rides in New York City. Previously this dataset was explored and visualized in a number of blog posts, where the authors used various technologies (e.g., PostgreSQL and Apache Elastic Search). Moreoever, in a...

Read more »

githubinstall: New R Package for Easy to Install R Packages on GitHub

June 14, 2016
By

1. OverviewA growing number of R packages are created by various people in the world. A part of the cause of it is the devtools package that makes it easy to develop R packages . The devtools package not only facilitates the process to develop R packages but also provides an another way to distribute R packages.When developers...

Read more »

My knitr LaTeX template: manuscript and supplement interleaved in one source file

June 14, 2016
By

Most of the time between starting manuscript and having it accepted after peer-review is spent writing, re-writing and re-arranging content. In Word, keeping track of figure numbers is a big pain, even more so when figures are moved between the main ma...

Read more »

R Hero saves Backup City with archivist and GitHub

June 14, 2016
By

Have you ever suffered because of the impossibility of reproducing graphs, tables or analysis’ results in R? Have you ever bothered yourself for not being able to share R objects (i.e., plots or final analysis models) within your reports, posters or articles? Or maybe simply you have too many objects you can’t manage to store in a convenient...

Read more »

Le Monde puzzle [#965]

June 13, 2016
By
Le Monde puzzle [#965]

A game-related Le Monde mathematical puzzle: Starting with a pile of 10⁴ tokens, Bob plays the following game: at each round, he picks one of the existing piles with at least 3 tokens, takes away one of the tokens in this pile, and separates the remaining ones into two non-empty piles of arbitrary size. Bob

Read more »

8 new R jobs from all over the world (2016-06-13)

June 13, 2016
By
8 new R jobs from all over the world (2016-06-13)

Here are the new R Jobs for 2016-06-13. To post your R job on the next post Just visit this link and post a new R job to the R community. You can either post a job for free, or pay $50 to have your job featured. Current R jobs Job seekers: please follow the links below to learn more and apply for your R job of interest:...

Read more »

R holds top ranking in KDnuggets software poll

June 13, 2016
By
R holds top ranking in KDnuggets software poll

The open-source R language is the most frequently used analytics / data science software, selected by 49% of the 2895 voters of the 2016 KDNuggets Software Poll. (R was also the top selection in last year's poll.) Python was a close second at 45.8%, and SQL was third at 35.5%. (Respondents could select multiple tools in the poll, and...

Read more »

tidyr 0.5.0

June 13, 2016
By
tidyr 0.5.0

I’m pleased to announce tidyr 0.5.0. tidyr makes it easy to “tidy” your data, storing it in a consistent form so that it’s easy to manipulate, visualise and model. Tidy data has a simple convention: put variables in the columns and observations in the rows. You can learn more about it in the tidy data

Read more »

Open to Non-Conference Attendees – R Workshops at EARL 2016

June 13, 2016
By
Open to Non-Conference Attendees – R Workshops at EARL 2016

EARL is a Conference for users and developers of the open source R programming language. The primary focus of the Conference is the commercial usage of R across a range of industry sectors with the aim of sharing knowledge and … Continue reading →

Read more »

Presentation slides on using graphics

June 13, 2016
By
Presentation slides on using graphics

Last week I gave a seminar for around 40 analysts from another government agency on using graphics to represent data. In doing such presentations, I usually focus on different purposes of graphics: exploratory as part of the analysis workflow (eg as diagnosis for statistical models) for presenting results Exactly what the purpose is makes quite a difference to...

Read more »

R for Publication by Page Piccinini: Lesson 4 – Multiple Regression

June 13, 2016
By
R for Publication by Page Piccinini: Lesson 4 – Multiple Regression

Introduction Today we’ll see what happens when you have not one, but two variables in your model. We will also continue to use some old and new dplyr calls, as well as another parameter for our ggplot2 figure. I’ll be taking for granted some of the set-up steps from Lesson 1, so if you haven’t done Lesson 4: Multiple...

Read more »

Germany most likely to win Euro 2016

June 13, 2016
By
Germany most likely to win Euro 2016

After World Cup 2014 we finally are facing the next spectacular football event now: Euro 2016. With billions of football fans spread all over the world, football still seems to be the single most popular sport. Might have something to do with the fact that football is a game of underdogs: David could beat Goliath any day. Just take

Read more »

Manhattanly: R package for Interactive Manhattan Plots

June 13, 2016
By

Introduction The new R package, manhattanly, creates interactive manhattan plots using the plotly.js engine. The plots are usable from the R console, the RStudio viewer pane, R Markdown documents, in Shiny apps, embeddable in websites and can be exported as .png files. By hovering the mouse over a point, you can see annotation information such

Read more »

Risk Models with Generalized PLS

June 12, 2016
By
Risk Models with Generalized PLS

While developing risk models with hundreds of potential variables, we often run into the situation that risk characteristics or macro-economic indicators are highly correlated, namely multicollinearity. In such cases, we might have to drop variables with high VIFs or employ “variable shrinkage” methods, e.g. lasso or ridge, to suppress variables with colinearity. Feature extraction approaches

Read more »

How to Cut Your Planks with R

June 12, 2016
By
How to Cut Your Planks with R

Today I’m extraordinarily pleased because today I solved an actuall real world problem using R. Sure, I’ve solved many esoteric statistical problems with R, but I’m not sure if any of those solutions have escaped the digital world and made some impact ex silico. It is now summer and in Sweden that means that many people tend to overhaul...

Read more »

On Whether Y-axis Labels Are Always Necessary

June 12, 2016
By
On Whether Y-axis Labels Are Always Necessary

The infamous @albertocairo blogged about a nice interactive piece on German company tax avoidance by @ProPublica. Here’s a snapshot of their interactive chart: Dr. Cairo (his PhD is in the bag as far as I’m concerned :-) posited: Isn’t it weird that the chart doesn’t have a scale on the Y-axis? It’s not the first... Continue reading →

Read more »

Exploring the Half-Life Property of K

June 11, 2016
By

A colleague recently questioned whether the loge(2)/K, where K is the Brody growth coefficient in the typical parameterization of the von Bertalanffy growth function, represents the “amount of time it takes for a fish to grow from any length to a length halfway between the initial length and the asymptotic mean length (Linf)”. This phenomenon is briefly illustrated...

Read more »

Is Online AD Space a Commodity?

June 11, 2016
By

I just started reading Alvin Roth’s book “Who gets What - And why?” and it already got me thinking. The book discusses the principles of markets and market design using various examples. One starting point is the transition of markets into being commodity markets. Simplified; in a commodity market all products sold are equal (think: stock markets), hence only the...

Read more »

Exploring Spatial Patterns and Coexistance

June 11, 2016
By
Exploring Spatial Patterns and Coexistance

Today is a rainy day and I had to drop my plans for going out hiking, instead I continued reading “Self-Organization in Complex Ecosystems” from Richard Solé and Jordi Bascompte. As I will be busy in the coming weeks with spatial models at the iDiv summer school I was closely reading chapter three on spatial

Read more »

Balancing a centrifuge

June 11, 2016
By
Balancing a centrifuge

I saw this cute little paper on arxiv about balancing a centrifuge: Peil & Hauryliuk (2010) A new spin on spinning your samples: balancing rotors in a non-trivial manner. Let us have a look at the maths of balancing a centrifuge. The way I think most people (including myself) balance their samples is to put

Read more »

useResearch – Usage Analytics for R Functions, Pt.1

June 11, 2016
By
useResearch – Usage Analytics for R Functions, Pt.1

useResearch This is the main part of the proposal Tyler Rinker and I submitted to the first ISC call for proposals by the R-Consortium. Our next post will describe useResearch: the solution we ended up building, despite not getting funded. Some details of the how part of our proposal have evolved in our development of...

Read more »

Visualize completeness of biodiversity data

June 10, 2016
By
Visualize completeness of biodiversity data

Package bdvis: Biodiversity data visualizations using R is helpful to understand completeness of biodiversity inventory, extent of geographical, taxonomic and temporal coverage, gaps and biases in data. Package bdvis version 0.2.6 is on CRAN now. This version has several features added since version 0.1.0. I plan to post set of blog entries here to describe

Read more »

Interactive maps and charts in R

June 10, 2016
By
Interactive maps and charts in R

Randy George, an expert in web map applications, has been fascinated with computer graphics (especially maps) since the early '80s. For much of that time, he says, the technology for mapping has been pretty static: The main stay of web mapping applications for the last couple of decades has been three tier: Model – SQL, View – web UI,...

Read more »

Radial bar charts in R using Plotly

June 10, 2016
By

Creating a radial barchart is fairly easy using plotly. In this post we’ll focus on modifying a radial line chart to make it look like a bar chart so come up with a nice visualization for CO2 emissions. The visualization is inspired by this awesome chart.

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.