Analyzing the US elections with Facebook and R

July 11, 2016
By
Analyzing the US elections with Facebook and R

Hillary Clinton and Donald Trump seem to be the nominees for the upcoming US presidential election in November 2016. The US presidential election in five months provokes already harsh attacks between the nominees – the election campaigns of Trump und Clinton evolve to one of the toughest competitions in the recent history of US presidential elections The post

Read more »

Big Data Insights: Tale of IT Investments and Returns

July 11, 2016
By
Big Data Insights: Tale of IT Investments and Returns

Once again, this post brings forth to the audience, a predictive analytical insight from huge volumes of information technology security data belonging to two fortune 500 companies (more or less having similar characteristics). Going to a quick backgro...

Read more »

Time series charts by the Economist in R using Plotly

July 11, 2016
By

In this post we’ll recreate two info graphics created by The Economist. The code uses the new Plotly 4.0 syntax. Note: Plotly 4.0 has not been officially released yet. You can download the dev version using Volume of google searches related to immigrating to Canada AIDS related Visualization

Read more »

Introducing the free Microsoft R Client

July 11, 2016
By

Over the years, we've shared several posts on using the ScaleR package to import, process, visualize and analyze large data sets with R. Until now, you needed to have access to a Microsoft R Server license to take advantage of the package. Now, you can use all of the capabilities of ScaleR free of charge with Microsoft R Client...

Read more »

Rcpp now used by over 700 CRAN packages

July 11, 2016
By
Rcpp now used by over 700 CRAN packages

Earlier this morning, Rcpp reached another milestone: 701 packages on CRAN now depend on it (as measured by Depends, Imports and LinkingTo declarations). The graph is on the left depicts the growth of Rcpp usage over time. Rcpp cleared 300 packages ...

Read more »

The Power of ggplot2 in ArcGIS – The Plotting Toolbox

July 11, 2016
By
The Power of ggplot2 in ArcGIS – The Plotting Toolbox

In this post I present my third experiment with R-Bridge. The plotting toolbox is a plug-in for ArcGIS 10.3.x that allows the creation of beautiful and informative plot, with ggplot2, directly from the ESRI ArcGIS console.As always I not only provide t...

Read more »

CRAN Packages on GitHub (and some CRAN DESCRIPTION observations)

July 10, 2016
By

Just about a week ago @thosjleeper posited something on twitter w/r/t how many CRAN packages had associations with GitHub (i.e. how many used GitHub for development). The DESCRIPTION file (that comes with all R packages) has some fields that can house this information and most folks who do use GitHub for development of R seem... Continue reading →

Read more »

Creating color palettes in R

July 10, 2016
By
Creating color palettes in R

In the R post, we will present how to create your own color palettes and how to work with other palettes such as RColorBrewer, wesanderson and hex codes from www.colorcombos.com for exciting color palettes.

Read more »

Submitting your first package to CRAN, my experience

July 8, 2016
By
Submitting your first package to CRAN, my experience

I recently published my first R package to The Comprehensive R Archive Network (CRAN). It was very exciting and also quite easy. Let me walk you through my process. First a description of my brand new package: badgecreatr, then a description of steps to take for submission. Package description When you go around github looking at projects you often see...

Read more »

dplyr do: Some Tips for Using and Programming

July 8, 2016
By

If you want to compute arbitrary operations on a data frame returning more than one number back, use dplyr do()! Tips and suggestions, in SE and NSE version. The post dplyr do: Some Tips for Using and Programming appeared first on MilanoR.

Read more »

The Mathematics of Machine Learning

July 8, 2016
By
The Mathematics of Machine Learning

This post was first published on my Linkedin page and posted here as a contributed post. In the last few months, I have had several people contact me about their enthusiasm for venturing into the world of data science and using Machine Learning (ML) techniques to probe statistical regularities and build impeccable data-driven products. However, I’ve observed that some actually lack...

Read more »

Best way to draw heatmap for publication

July 8, 2016
By

Here are two tips I can share if you were also working on a big dataset towards a high quality heatmap:1. Don't generate PDF using pheatmap() or heatmap.2() as (i) the file is unnecessarily SUPER large if you have a lot of data points in the heatmap, s...

Read more »

In case you missed it: June 2016 roundup

July 8, 2016
By

In case you missed them, here are some articles from June of particular interest to R users. A preview of the tutorials presented at the useR! 2016 conference. A "advanced beginner's" guide to R published by ComputerWorld includes guides on data wrangling, visualization, and data APIs. Microsoft R Server now runs on Apache Spark, bringing high performance to big-data...

Read more »

Time Averages of NetCDF files from ECMWF in ArcGIS with R-Bridge

July 8, 2016
By
Time Averages of NetCDF files from ECMWF in ArcGIS with R-Bridge

With this post I would like to talk again about R-Bridge, which allows a direct communication between ArcGIS and R.In the previous post, I presented a very simple application of R-Bridge where I built a toolbox to perform k-means clustering on point sh...

Read more »

Out now: Agenda of the [R] Kenntnis-Tage 2016

July 8, 2016
By
Out now: Agenda of the [R] Kenntnis-Tage 2016

Data science goes professional: On November 2 and 3, the Kenntnis-Tage 2016 offer valuable insights into diverse application examples of the programming language R to users, developers, administrators and IT decision makers. Now the program has been released. In the age of Internet of Things, data has long become the new currency for the … „Out now:...

Read more »

Obtaining package performance data using Rperform

July 8, 2016
By
Obtaining package performance data using Rperform

“In God we trust. All others must bring data.” – W. Edwards Deming In a previous post, I had discussed how Rperform uses the grammar of graphics approach to visualize an R package’s performance in terms of runtime and memory usage. The visualizations contribute significantly towards Rperform’s mission to allow package developers to quantify, analyze and visualize performance. However, at...

Read more »

Introducing Badgecreatr, a package that places badges in your readme

July 7, 2016
By
Introducing Badgecreatr, a package that places badges in your readme

Introducing Badgecreatr, a package to create and place badges in your readme.Rmd file on Github. Badgecreatr will create the following badges (aka shields): minimal R...</p><p><a href=Read more »

Bad Coder, Bad Coder!

July 7, 2016
By
Bad Coder, Bad Coder!

My title here is in the sense of “Bad dog, bad dog!”, a scolding I sometimes see dog owners use to tame their pets, and is also an allusion to Bad Reporter, a sometimes hilarious and always irreverent political comic strip in the San Francisco Chronicle. And my title is intended to convey the point … Continue reading...

Read more »

Incase you missed it: My Talk at the United Nations Global Pulse Workshop

July 7, 2016
By
Incase you missed it: My Talk at the United Nations Global Pulse Workshop

In case you missed my talk at the 2016 Data Science Africa Workshop organized by the United Nations Global Pulse Lab, here is the recording. My talk was titled “Sustainable Urban Transport Planning using Big Data from Mobile Phones“. You can download slides for my talk from here. There were also talks from my colleagues at IBM

Read more »

Express dplyr Part II

July 7, 2016
By
Express dplyr Part II

This is Part II of the “Express dplyr” posting. If you haven’t already you might want to review Part I of this topic before proceeding although if you have some knowledge of dplyr then by all means proceed. Don’t worry – there are no tests ! In this second part we will work with some

Read more »

Bridging The Political [Polygons] Gap with ggplot2

July 7, 2016
By
Bridging The Political [Polygons] Gap with ggplot2

The @pewresearch folks have been collecting political survey data for quite a while, and I noticed the visualization below referenced in a Tableau vis contest entry: Those are filled frequency polygons, which are super-easy to replicate in ggplot2, especially since Pew even kind of made the data available via their interactive visualization (it’s available in... Continue reading →

Read more »

Best practices for logging computational systems in R and Python

July 7, 2016
By
Best practices for logging computational systems in R and Python

As is the case with most quant software, it’s a bit different from run-of-the-mill software. The somewhat prosaic world of …Continue reading →

Read more »

R Competition on education in South Africa (July and August 2016)

July 7, 2016
By

(Guest post by Bartosz Sękiewicz) We invite you to participate in our Kaggle-style R competition, an online team competition (1-3 people) which is based in Poland though we would welcome international teams. It will take place during July and August 2016. The organisers are Do-IT Solutions Ltd and eRka (Cracow R User Group), who have decided to join forces again...

Read more »

useR! 2016 Tutorials: Part 2

July 7, 2016
By
useR! 2016 Tutorials: Part 2

by Joseph Rickert Last week, I mentioned a few of the useR tutorials that I had the opportunity to attend. Here are the links to the slides and code for all but two of the tutorials: Regression Modeling Strategies and the rms Package - Frank Harrell Using Git and GitHub with R, RStudio, and R Markdown - Jennifer Bryan...

Read more »

Bad ways to run a user group

July 7, 2016
By
Bad ways to run a user group

I love user groups and I always want there to be more. I’m not a perfect organiser but I run reasonable groups. When I see organisers doing it badly, it makes me sad. There’s lots of great ways to run a user group, but I thought I’d cover some of the bad ways to run The post

Read more »

Implementing Apriori Algorithm in R

July 7, 2016
By
Implementing Apriori Algorithm in R

There are a bunch of blogs out there posted that show how to implement apriori algorithm in R. However, when I was working on the same, I hit a roadblock since the data was neither in single format, nor in basket(Step 2 explains what a basket format is). I spent quite some time converting the Related Post

Read more »

Latest on the Julia Language (vs. R)

July 6, 2016
By
Latest on the Julia Language (vs. R)

I’ve written before about the Julia language. As someone who is very active in the R community, I am biased of course, and have been (and remain) a skeptic about Julia. But I would like to report on a wonderful talk I attended today at Stanford. To my surprise and delight, the speaker, Viral Shah of Julia … Continue reading...

Read more »

Fatal Police Shootings Across the U.S.

July 6, 2016
By
Fatal Police Shootings Across the U.S.

I have been full of grief and sadness and some anger in the wake of yet more videos going viral in the past couple days showing black men being killed by police officers. I am not an expert on what it means to be a person of color in the United States or what is or isn’t wrong with...

Read more »

Fast and Big Linear Model Fitting with bigmemory and RcppEigen

July 6, 2016
By

In a previous post, I went over the basics of linking up bigmemory and the eigen C++ library via RcppEigen. In this post I’ll take this a bit further by creating a version of the fastLm() function of RcppEigen that can accept bigmemory objects. By doing so, we will create a fast way to fit...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.