Does sentiment analysis work? A tidy analysis of Yelp reviews

July 21, 2016
By
Does sentiment analysis work? A tidy analysis of Yelp reviews

This year Julia Silge and I released the tidytext package for text mining using tidy tools such as dplyr, tidyr, ggplot2 and broom. One of the canonical examples of tidy text mining this package makes possible is sentiment analysis. Sentiment analysis is often used by companies to quantify general social media opinion (for...

Read more »

Performing Principal Components Regression (PCR) in R

July 20, 2016
By
Performing Principal Components Regression (PCR) in R

Principal components regression (PCR) is a regression method based on Principal Component Analysis: discover how to perform this Data Mining technique in R The post Performing Principal Components Regression (PCR) in R appeared first on MilanoR.

Read more »

Monte Carlo Analysis of Manning’s Equation: A Shiny App

July 20, 2016
By
Monte Carlo Analysis of Manning’s Equation:  A Shiny App

Monte Carlo analysis is a great way to explore the impact of input variable uncertainty on the results of engineering equations, and with vector variables and distribution and sampling functions at its core, R is a natural platform for this analysis. During a recent rainy vacation, I built a Shiny app that applies...

Read more »

Bayesian Bootstrap: The Movie + Some Highlights from UseR! 2016

July 20, 2016
By

Not surprisingly, this year’s UseR! conference was a great event with heaps of talented researchers and R-developers showing off the latest and greatest R packages. (A surprise visit from Donald Knuth didn’t hurt either.) What was extra great thi...

Read more »

Networks, mazes, and R (Rmaze)

July 20, 2016
By
Networks, mazes, and R (Rmaze)

Like many people who work with networks (aka graphs), I like to emphasize their usability and applicability to different problems. If some phenomena can be described by its objects and relationships among those objects – well, hello networks. One of … Continue reading →

Read more »

An analysis of Pokémon Go types, created with R

July 20, 2016
By
An analysis of Pokémon Go types, created with R

As anyone who has tried Pokémon Go recently is probably aware, Pokémon come in different types. A Pokémon's type affects where and when it appears, and the types of attacks it is vulnerable to. Some types, like Normal, Water and Grass are common; others, like Fairy and Dragon are rare. Many Pokémon have two or more types. To get...

Read more »

Life Expectancy by Country

July 20, 2016
By
Life Expectancy by Country

I was rather inspired by this plot on Wikipedia’s List of Countries by Life Expectancy. Shouldn’t be too hard to reproduce with a bit of scraping. Here are the results (click on the static image to view the interactive plot): The bubble plot above compares female and male life expectancies for a number of countries. The post

Read more »

Giving back with code

July 20, 2016
By
Giving back with code

From code in answers on Stack Overflow to R packages or full programs, there’s a lot of code being written and given away. This post examines some of the reasons why the people writing all that code do it, why you should consider giving back with code, and how you can get started. Finally, I The post

Read more »

leading edge analysis

July 20, 2016
By
leading edge analysis

leading edge and core enrichment Leading edge analysis reports Tags to indicate the percentage of genes contributing to the enrichment score, List to indicate where in the list the enrichment score is attained and Signal for enrichment signal strength. It would also be very interesting to get the core enriched genes that contribute to the enrichment. Now DOSE, clusterProfiler and...

Read more »

Calculating Memory Requirements

July 19, 2016
By

I had a conversation with people at the office about size and memory requirements that a computer must have to deal with a data frame. It started like this: suppose you have a data frame with 2,000,000 rows and 250 columns, all of which are numeric dat...

Read more »

Discover R and RStudio at JSM 2016 Chicago!

July 19, 2016
By
Discover R and RStudio at JSM 2016 Chicago!

The JSM conference in Chicago, July 31 thru August 4, 2016, is one of the largest to be found on statistics, with many terrific talks for R users. We’ve listed some of the sessions that we’re particularly excited about below. These include talks from RStudio employees, like Hadley Wickham, Yihui Xie, Mine Cetinkaya-Rundel, Garrett Grolemund,

Read more »

Updated OpenBLAS instructions for R-3.3+ and Rtools34

July 19, 2016
By

I’ve just updated the instructions for building a 64-bit OpenBLAS-based Rblas.dll for Windows to reflect changes to R 3.3+ and Rtools34. Enjoy!

Read more »

9 new R jobs from around the world (2016-07-19)

July 19, 2016
By
9 new R jobs from around the world (2016-07-19)

Here are the new R Jobs for 2016-07-19. To post your R job on the next post Just visit this link and post a new R job to the R community. You can either post a job for free (which works great), or pay $50 to have your job featured (and get extra exposure). Current R jobs Job seekers: please follow the links below to learn more and...

Read more »

Candlestick charts using Quandl and Plotly

July 19, 2016
By

In this post we’ll show how to create candle stick charts using the new plotly 4.0 syntax. You can refer to this older post as well. This time we’ll use the Quandl package to retrieve stock data. See here for more details.

Read more »

dplyr and Zika – Epilogue

July 19, 2016
By
dplyr and Zika – Epilogue

I really thought I was done with the Express dplyr series though on completion of the second part I received many messages requesting more examples of using dplyr with ggplot along with some other types of information such as the Zika virus data which can be downloaded from Github. These examples are not drastically different

Read more »

Principal Component Analysis Cluster Plots with Plotly

July 19, 2016
By
Principal Component Analysis Cluster Plots with Plotly

The Problem When clustering data using principal component analysis, it is often of interest to visually inspect how well the data points separate in 2-D space based on principal component scores. While this is fairly straightforward to visualize with a scatterplot, the plot can become cluttered quickly with annotations as shown in the following figure:

Read more »

Sustainability through Energy Load Shaping for Buildings using R

July 19, 2016
By
Sustainability through Energy Load Shaping for Buildings using R

by Mike Wise, Data Scientist / Solution Architect – MCS Incubation Services Background Building energy consumption is a huge component of energy consumption. In the USA for example, around 40 percent of the energy consumed is in buildings, and as the USA consumes something like 25 percent of the worlds energy, it is clear we are talking about a...

Read more »

Installing the additional R packages in Oracle Big Data Lite VM 4.5.0

Oracle has just released version 4.5.0 of the Big Data Lite VM which, when it comes to R, still suffers from the issues we had pinpointed for the previous version 4.4.0 (and then some). The first attempt to install the additional packages fails with a ‘cannot open URL’ error: Fortunately, the warning about the proxy helps to locate the...

Read more »

Rcpp 0.12.6: Rolling on

July 19, 2016
By

The sixth update in the 0.12.* series of Rcpp has arrived on the CRAN network for GNU R a few hours ago, and was just pushed to Debian. This 0.12.6 release follows the 0.12.0 release from late July, the 0.12.1 release in September, the 0.12.2 release in November, the 0.12.3 release...

Read more »

Spatio-Temporal Point Pattern Analysis in ArcGIS with R

July 19, 2016
By
Spatio-Temporal Point Pattern Analysis in ArcGIS with R

This post would probably be the last in my series about merging R and ArcGIS. In August unfortunately I would have to work for real and I will not have time to play with R-Bridge any more.In this post I would like to present a toolbox to perform some introductory point pattern analysis in R through ArcGIS....

Read more »

Easy access to data on US politics: New version of pvsR now on BitBucket

July 19, 2016
By

I am happy to announce a new release (version 0.4) of the R-package pvsR  on Bitbucket. pvsR facilitates data retrieval from Project Vote Smart's rich online data base on US politics via the Project Vote Smart application programming interface (PVS API). The functions in this package cover most PVS API classes...

Read more »

stacksurveyr: An R package with the 2016 Developer Survey Results

July 18, 2016
By
stacksurveyr: An R package with the 2016 Developer Survey Results

This year, more than fifty thousand programmers answered the Stack Overflow 2016 Developer Survey, in the largest survey of professional developers in history. Last week Stack Overflow released the full (anonymized) results of the survey at stackoverf...

Read more »

Releasing the StackLite dataset of Stack Overflow questions and tags

July 18, 2016
By
Releasing the StackLite dataset of Stack Overflow questions and tags

At Stack Overflow we’ve always been committed to sharing data: all content contributed to the site is CC-BY-SA licensed, and we release regular “data dumps” of our entire history of questions and answers. I’m excited to announce a new resource...

Read more »

Populating data frame cells with more than one value

July 18, 2016
By
Populating data frame cells with more than one value

Data frames are lists Most R users will know that data frames are lists. You can easily verify that a data frame is a list by typing However, data frames are lists with some special properties. For example, all entries in the list must have the same length (here 2), etc. You can find a

Read more »

The trick to understanding NAs (missing values) in R

July 18, 2016
By

Here's a little puzzle that might shed some light on some apparently confusing behaviour by missing values (NAs) in R: What is NA^0 in R? You can get the answer easily by typing at the R command line: > NA^0 1 But the interesting question that arises is: why is it 1? Most people might expect that the...

Read more »

Escalating Life Expectancy

July 18, 2016
By
Escalating Life Expectancy

I’ve added mortality data to the lifespan package. A result that immediately emerges from these data is that average life expectancy is steadily climbing. The effect is more pronounced for men, rising from around 66.5 in 1994 to 70.0 in 2014. The corresponding values for women are 74.6 and 76.5 respectively. Good news for everyone.

Read more »

EARL 2016 London Workshops Selling Fast

July 18, 2016
By
EARL 2016 London Workshops Selling Fast

A Great Choice of Workshops With less than two months to go until EARL2016 in London, the Workshops are filling up fast and this year for the first time we are offering workshops to non-conference attendees. The UK’s only R Conference this year … Continue reading →

Read more »

Exploring the diversity of Life using Rvest and the Catalog of Life

July 18, 2016
By
Exploring the diversity of Life using Rvest and the Catalog of Life

I am writing the general introduction for my thesis and wanted to have a nice illustration of the diversity of Arthropods compared to other phyla (my work focus on Arthropods so this is a nice motivation). As the literature I have had access so far use pie charts to graphically represent these diversities and knowing

Read more »

Dance of the Bayes factors

July 18, 2016
By
Dance of the Bayes factors

You might have seen the ‘Dance of the p-values’ video by Geoff Cumming (if not, watch it here). Because p-values and the default Bayes factors (Rouder, Speckman, Sun, Morey, & Iverson, 2009) are both calculated directly from t-values and sample sizes, we might expect there is also a Dance...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.