Some New R Packages

July 14, 2016
By
Some New R Packages

by Joseph Rickert New R packages keep rolling into CRAN at a prodigious rate: 184 in May, 195 in June and July looks like it will continue the trend. I spent some time sorting through them and have picked out a few that that are interesting from a data science point of view. ANLP provides functions for building text...

Read more »

That’s like so random! Monte Carlo for Data Science

July 14, 2016
By
That’s like so random! Monte Carlo for Data Science

Another great turnout at the DataPhilly meetup last night. Was great to see all you random data nerds! Code snippets to generate animated examples here.

Read more »

nzelect 0.2.0 on CRAN

July 13, 2016
By
nzelect 0.2.0 on CRAN

Introduction The nzelect R package which I first introduced in a blog post in April is now available on CRAN. The version number is 0.2.0. The difference from version 0.1.0 is sizeable - all the 2013 census data has been removed and is now in a companion package, nzcensus. This is for ease of development and maintenance,...

Read more »

Notes from the Kölner R meeting, 9 July 2016

July 13, 2016
By
Notes from the Kölner R meeting, 9 July 2016

Last Thursday the Cologne R user group came together again. This time, our two speakers arrived from Bavaria, to talk about Spark and R Server.Introduction to Apache SparkDownload slidesDubravko Dulic gave an introduction to Apache Spark and why Spark might be of interest to data scientists using...

Read more »

Working on Data-Warehouse (SQL) with R

July 13, 2016
By
Working on Data-Warehouse (SQL) with R

SQL being the most popular used query language for deep diving from small data to so called big data. However, there are many other languages like R which are growing in the user community for stats and graphical methodology models and plot data to match the data scientists mind. R as an Analytic Application R Related Post

Read more »

Extending R

July 12, 2016
By
Extending R

As I was previously unaware of this book coming up, my surprise and excitement were both extreme when I received it from CRC Press a few weeks ago! John Chambers, one of the fathers of S, precursor of R, had just published a book about extending R. It covers some reflections of the author on

Read more »

Slaying CIDR Orcs with Triebeard (a.k.a. fast trie-based ‘IPv4-in-CIDR’ lookups in R)

July 12, 2016
By

The insanely productive elf-lord, @quominus put together a small package (triebeard) that exposes an API for radix/prefix tries at both the R and Rcpp levels. I know he had some personal needs for this and we both kinda need these to augment some functions in our iptools package. Despite triebeard having both a vignette and... Continue reading →

Read more »

“Writing Functions in R” from Hadley & Charlotte Wickham!

July 12, 2016
By

Learn from the creator of ggplot2, dplyr, and more! Whether you're a seasoned veteran or R novice, DataCamp’s latest course will help you make your code more readable, avoid coding errors, and automate repetitive tasks. Start For Free Like all DataCamp courses, Writing Functions in R is self-paced and combines high-quality video, in-browser coding, and gamification...

Read more »

vtreat version 0.5.26 released on CRAN

July 12, 2016
By

Win-Vector LLC, Nina Zumel and I are pleased to announce that ‘vtreat’ version 0.5.26 has been released on CRAN. ‘vtreat’ is a data.frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. (from the package documentation) ‘vtreat’ is an R package that incorporates a number of transforms and simulated out of … Continue reading...

Read more »

covr: A Victory for Open Source

Aimee Gott, Mango Solutions As an academic statistician, my life before I joined Mango was one in which the concepts of “proper programming” didn’t exist. I was one of those people who wrote ad-hoc tests when I changed code and … Continue reading →

Read more »

SQL Server, Power BI, and R

July 12, 2016
By
SQL Server, Power BI, and R

by Sheri Gilley, Microsoft Senior Software Engineer SQL Server 2016 has reached general availability recently and one of the top new capabilities it features is SQL Server R Services -- advanced analytics in-database with the R language. R has also been integrated into Power BI, allowing you to create fully integrated visualizations with the power of the R language....

Read more »

An Introoduction to Portfolio Component Conditional Value At Risk

July 12, 2016
By
An Introoduction to Portfolio Component Conditional Value At Risk

This post will introduce component conditional value at risk mechanics found in PerformanceAnalytics from a paper written by Brian Peterson, … Continue reading →

Read more »

Announcing the R Shapefile Contest

July 12, 2016
By
Announcing the R Shapefile Contest

Today I am happy to announce the R Shapefile Contest. The goal of the contest is to encourage and promote high quality work at the intersection... The post Announcing the R Shapefile Contest appeared first on AriLamstein.com.

Read more »

New R Editor Features in Bio7

July 12, 2016
By

12.07.2016 Bio7 2.4 has been released for Windows, MacOSX and Linux and this release comes with a plethora of new R editor features. The editor now supports the dynamic code analysis of R scripts, refactoring methods, improved code completions, more key shortcuts for a faster editing and improved quick fixes which now can be opened

Read more »

Analyzing Mexican votes in the U.N.

July 11, 2016
By
Analyzing Mexican votes in the U.N.

Last week I discovered the unvotes package on github so I thought i’d do some number-crunching to see if I find anything interesting. The package provides the voting history of countries in the United Nations General Assembly, along with information such as date, description, and topics for each vote. The author makes it clear to reference the original publication of...

Read more »

RProtoBuf 0.4.4, and new JSS paper

July 11, 2016
By

A new release 0.4.4 of RProtoBuf is now on CRAN, and corresponds to the source archive for the Journal of Statistical Software paper about RProtoBuf as JSS vol71 issue 02. The paper is also included as a pre-print in the updated package. RProtoBuf provides R bindings for the Google Protocol Buffers...

Read more »

Analyzing the US elections with Facebook and R

July 11, 2016
By
Analyzing the US elections with Facebook and R

Hillary Clinton and Donald Trump seem to be the nominees for the upcoming US presidential election in November 2016. The US presidential election in five months provokes already harsh attacks between the nominees – the election campaigns of Trump und Clinton evolve to one of the toughest competitions in the recent history of US presidential elections The post

Read more »

Big Data Insights: Tale of IT Investments and Returns

July 11, 2016
By
Big Data Insights: Tale of IT Investments and Returns

Once again, this post brings forth to the audience, a predictive analytical insight from huge volumes of information technology security data belonging to two fortune 500 companies (more or less having similar characteristics). Going to a quick backgro...

Read more »

Time series charts by the Economist in R using Plotly

July 11, 2016
By

In this post we’ll recreate two info graphics created by The Economist. The code uses the new Plotly 4.0 syntax. Note: Plotly 4.0 has not been officially released yet. You can download the dev version using Volume of google searches related to immigrating to Canada AIDS related Visualization

Read more »

Introducing the free Microsoft R Client

July 11, 2016
By

Over the years, we've shared several posts on using the ScaleR package to import, process, visualize and analyze large data sets with R. Until now, you needed to have access to a Microsoft R Server license to take advantage of the package. Now, you can use all of the capabilities of ScaleR free of charge with Microsoft R Client...

Read more »

Rcpp now used by over 700 CRAN packages

July 11, 2016
By
Rcpp now used by over 700 CRAN packages

Earlier this morning, Rcpp reached another milestone: 701 packages on CRAN now depend on it (as measured by Depends, Imports and LinkingTo declarations). The graph is on the left depicts the growth of Rcpp usage over time. Rcpp cleared 300 packages ...

Read more »

The Power of ggplot2 in ArcGIS – The Plotting Toolbox

July 11, 2016
By
The Power of ggplot2 in ArcGIS – The Plotting Toolbox

In this post I present my third experiment with R-Bridge. The plotting toolbox is a plug-in for ArcGIS 10.3.x that allows the creation of beautiful and informative plot, with ggplot2, directly from the ESRI ArcGIS console.As always I not only provide t...

Read more »

CRAN Packages on GitHub (and some CRAN DESCRIPTION observations)

July 10, 2016
By

Just about a week ago @thosjleeper posited something on twitter w/r/t how many CRAN packages had associations with GitHub (i.e. how many used GitHub for development). The DESCRIPTION file (that comes with all R packages) has some fields that can house this information and most folks who do use GitHub for development of R seem... Continue reading →

Read more »

Creating color palettes in R

July 10, 2016
By
Creating color palettes in R

In the R post, we will present how to create your own color palettes and how to work with other palettes such as RColorBrewer, wesanderson and hex codes from www.colorcombos.com for exciting color palettes.

Read more »

Submitting your first package to CRAN, my experience

July 8, 2016
By
Submitting your first package to CRAN, my experience

I recently published my first R package to The Comprehensive R Archive Network (CRAN). It was very exciting and also quite easy. Let me walk you through my process. First a description of my brand new package: badgecreatr, then a description of steps to take for submission. Package description When you go around github looking at projects you often see...

Read more »

dplyr do: Some Tips for Using and Programming

July 8, 2016
By

If you want to compute arbitrary operations on a data frame returning more than one number back, use dplyr do()! Tips and suggestions, in SE and NSE version. The post dplyr do: Some Tips for Using and Programming appeared first on MilanoR.

Read more »

The Mathematics of Machine Learning

July 8, 2016
By
The Mathematics of Machine Learning

This post was first published on my Linkedin page and posted here as a contributed post. In the last few months, I have had several people contact me about their enthusiasm for venturing into the world of data science and using Machine Learning (ML) techniques to probe statistical regularities and build impeccable data-driven products. However, I’ve observed that some actually lack...

Read more »

Best way to draw heatmap for publication

July 8, 2016
By

Here are two tips I can share if you were also working on a big dataset towards a high quality heatmap:1. Don't generate PDF using pheatmap() or heatmap.2() as (i) the file is unnecessarily SUPER large if you have a lot of data points in the heatmap, s...

Read more »

In case you missed it: June 2016 roundup

July 8, 2016
By

In case you missed them, here are some articles from June of particular interest to R users. A preview of the tutorials presented at the useR! 2016 conference. A "advanced beginner's" guide to R published by ComputerWorld includes guides on data wrangling, visualization, and data APIs. Microsoft R Server now runs on Apache Spark, bringing high performance to big-data...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.