Brief tutorial to perform descriptive statistics using R with two examples.

January 3, 2017
By
Brief tutorial to perform descriptive statistics using R with two examples.

Using a pair of databases we will do a brief and basic analysis of descriptive statistics using R.At the end of the article you can find the corresponding links to get both the script and the databases so that you can perform the exercise. Install and load the packages we are going to useinstall.packages("sm")install.packages("plotrix")library(sm)library(plotrix)library(grDevices)...

Read more »

RStudio on RaspberryPi 3

January 3, 2017
By

I have been able to build RStudio server on my RaspberryPi 3. One change was crucial: I added a 16GB swap file on an external disk to my system (mounting the external drive and using dphys-swapfile – available in Raspbian) I used the current version of Raspbian (Jessie), updated all packages, and did sudo apt-get

Read more »

The biggest R stories from 2016

January 3, 2017
By
The biggest R stories from 2016

It's been another great year for the R project and the R community. Let's look at some of the highlights from 2016. The R 3.3 major release brought some significant performance improvements to R, along with a spiffy new logo. There were also two updates in 2016: R 3.3.1 and R 3.3.2. (The R 3.2 series also received an...

Read more »

Descriptive Analytics-Part 6: Interactive dashboard ( 1/2)

January 3, 2017
By
Descriptive Analytics-Part 6: Interactive dashboard ( 1/2)

Descriptive Analytics is the examination of data or content, usually manually performed, to answer the question “What happened?”.As this series of exercises comes to an end, the last part is going to be the development of a data product. Not everybody is able to code in R, so it is useful to be able to

Read more »

Working with Shapefiles in R Exercises

January 3, 2017
By
Working with Shapefiles in R Exercises

R has many powerful libraries to handle spatial data, and the things that R can do with maps can only grow. This exercise tries to demonstrate a few basic functionalities of R while dealing with shapefiles. A shapefile is a simple, nontopological format for storing the geometric location and attribute information of geographic features. Geographic

Read more »

Commercial Applications using R

January 3, 2017
By
Commercial Applications using R

Introduction R language can also be used for commercial applications. In this article, I will describe how R language can be used for a commercial application like Payroll. In this exercise let us assume that in a typical Indian Company, every employee gets two types of allowances called Dearness Allowance(DA) and House Rent Allowance(HRA) besides … Continue...

Read more »

Why R is the best data science language to learn today

January 3, 2017
By
Why R is the best data science language to learn today

In last week’s blog, I explained why you should Master R (even if it may eventually become obsolete). I wrote that article to address people who claim mastering R is a bit of a waste of time (because it will eventually become obsolete). But when I suggested that R may eventually become obsolete, this seemed The post

Read more »

R Jobs for R users – 8 jobs from around the world (2017-01-03)

January 3, 2017
By
R Jobs for R users – 8 jobs from around the world (2017-01-03)

  To post your R job on the next post Just visit this link and post a new R job to the R community. You can post a job for free (and there is also a “featured job” option for extra exposure). Current R jobs Job seekers: please follow the links below to learn more and apply for your R job of interest: Featured Jobs Full-Time Analyst...

Read more »

Using R to download high frequency trade data directly from Bovespa

January 3, 2017
By
Using R to download high frequency trade data directly from Bovespa

Using package GetHFData - Recently, Bovespa, the Brazilian financial exchange company, allowed external access to its ftp site. In this address one can find several information regarding the Brazilian financial system, including d...

Read more »

Use CSS to format markdown or HTML files

January 3, 2017
By
Use CSS to format markdown or HTML files

Markdown (and Rmarkdown) are great ways to quickly develop material without worrying about the formatting. The documents can then be compiled using the knitr or rmarkdown packages to output formats such as HTML, latex, or even word. The main drawback o...

Read more »

Introducing the VisualResume (v0.1.0) R Package

January 2, 2017
By
Introducing the VisualResume (v0.1.0) R Package

Visual Resumes are cool Some years ago, during the course of one of my regular Google image searches for inspiring designs, I discovered Visual Resumes (aka. Infographic Resumes) like this one from Michael Anderson. I immediately fell in love. And of course, I quickly set out to create my own in R. The result is a new package called VisualResume....

Read more »

Understanding mixture models and expectation-maximization (using baseball statistics)

January 2, 2017
By
Understanding mixture models and expectation-maximization (using baseball statistics)

Previously in this series: Understanding the beta distribution Understanding empirical Bayes estimation Understanding credible intervals Understanding the Bayesian approach to false discovery rates Understanding Bayesian A/B testing Understanding beta binomial regression Understanding empirical Bayesian hierarchical modeling In this series on empirical Bayesian methods on baseball data, we’ve been...

Read more »

Django and R on Heroku

January 2, 2017
By

I’ve been playing with Django for a while now and I’m loving it. Don’t get me wrong, I still love Shiny, but Django is pretty tough to beat for data-heavy projects and managing user sessions. For deployment, I’ve been using Heroku and am very h...

Read more »

Handling Class Imbalance with R and Caret – Caveats when using the AUC

January 2, 2017
By
Handling Class Imbalance with R and Caret – Caveats when using the AUC

In my last post, I went over how weighting and sampling methods can help to improve predictive performance in the case of imbalanced classes. I also included an applied example with a simulated dataset that used the area under the ROC curve (AUC) as th...

Read more »

(lazy)Loading Cached Chunks into an Interactive R Session

January 2, 2017
By

If you cache code chunks when using knitr to generate reproducible documents then you’ve likely had the issue arrise of needing to load the results of cached chunks into an active interactive R session. The functions lazyload_cache_dir and lazyload_...

Read more »

Removing Personal Bias From Flu Severity Estimation (a.k.a. Misery Loves Data)

January 2, 2017
By
Removing Personal Bias From Flu Severity Estimation (a.k.a. Misery Loves Data)

The family got hit pretty hard with the flu right as the Christmas festivities started and we were all pretty much bed-ridden zombies up until today (2017-01-02). When in the throes of a very bad ILI it’s easy to imagine that you’re a victim of a severe outbreak, especially with ancillary data from others that... Continue reading...

Read more »

May the Force of R be With You, Always!

January 2, 2017
By
May the Force of R be With You, Always!

With my Telegram account connected to @TeleR, the force of R can always be with me, where I have data. The following is a screenshot of my mobile: If you want to have R where you are too, you will only need a Telegram account, then search for the pu...

Read more »

Shiny Chart Builder – Explore your database with a point-and-click interface

January 2, 2017
By
Shiny Chart Builder – Explore your database with a point-and-click interface

You can use R's Shiny to create an interactive dashboard with this application, make your database explorable without knowing SQL or R!

Read more »

eoda celebrates R – don’t miss the date and win!

January 2, 2017
By
eoda celebrates R – don’t miss the date and win!

#Rstatsgoes10k: Join eoda in the celebration of R and take the chance to win with your prediction! The easiest way to stay up-to-date regarding the current number of R packages is to follow eoda’s automated Twitter bot @Rstatsgoes10k. This bot regularly informs its followers of the status quo so that participants of the contest will …

Read more »

Animations in R using Plotly

January 2, 2017
By
Animations in R using Plotly

Like last year, lets have some fun with the Plotly package. We’ll try out Plotly’s new animation capabilities. You should now have something like this: For mode details visit: Plotly for R by Carson Seivert.

Read more »

Distributional Semantics in R: Part 2 Entity Recognition w. {openNLP}

January 2, 2017
By
Distributional Semantics in R: Part 2 Entity Recognition w. {openNLP}

The R code for this tutorial on Methods of Distributional Semantics in R is found in the respective GitHub repository. You will find .R, .Rmd, and .html files corresponding to each part of this tutorial (e.g. DistSemanticsBelgradeR-Part2.R, DistSemant...

Read more »

Forcasting Natural Catastrophes (is rather difficult)

January 2, 2017
By
Forcasting Natural Catastrophes (is rather difficult)

Following my previous post, I wanted to spend more time, on the time series with “global weather-related disaster losses as a proportion of global GDP” over the time period 1990-2016 that Roger Pilke sent me last night. db=data.frame(year=1990:2016, ratio=c(.23,.27,.32,.37,.22,.26,.29,.15,.40,.28,.14,.09,.24,.18,.29,.51,.13,.17,.25,.13,.21,.29,.25,.2,.15,.12,.12)) In my previous post, I spend some time explaining that we should provide some sort of ‘confidence interval’ when we...

Read more »

How to download and organize financial data from yahoo finance for several tickers

January 2, 2017
By
How to download and organize financial data from yahoo finance for several tickers

Using package BatchGetSymbols - One of the great things of working in finance is that financial datasets are freely available from sources such as Google and Yahoo Finance. This is an excelent feature for building up to date conte...

Read more »

Script to convert numeric integer data of data frame column into a digit matrix.

January 1, 2017
By
Script to convert numeric integer data of data frame column into a digit matrix.

At some point I found the need to manipulate and analyze each digit of a series of integer values, perform statistics with each of them and in some occasions add zeros at the beginning of each number. So I gave...

Read more »

What is a Linear Trend, by the way?

January 1, 2017
By
What is a Linear Trend, by the way?

I had a very stranger discussion on twitter (yes, another one), about regression curves. I think it started with a tweet based on some xkcd picture (just for fun, because it was New Year’s Day) “don’t trust linear regressions” https://t.co/exUCvyRd1G pic.twitter.com/O6rBJfkULa — Arthur Charpentier (@freakonometrics) 1 janvier 2017 There were comments on that picture, by econometricians, mainly about ‘significant’...

Read more »

Welcome to Data R Value

January 1, 2017
By
Welcome to Data R Value

HelloThank you very much for reading this blog that will be dedicated to everything related to the R programming language and to data science field. I will be publishing scripts, hints, algorithms and many more things. I will also be publishing data...

Read more »

3PL models viewed through the lens of total probability theorem

January 1, 2017
By
3PL models viewed through the lens of total probability theorem

As I currently am the NPM for PISA in Colombia, I must assist to several meetings dealing with the proper implementation of this assessment in my country. Few of them are devoted to the analysis of this kind of data (coming from IRT models). As usual, ...

Read more »

Dan Thompson 2017-01-29 11:29:11

January 1, 2017
By

Summary In this blog post I will show you how to make your own personal assistant (think Siri, Cortana, Alexa) in R, very quickly. This will be done in three steps: Get R to recognise your voice and convert it to text Set up a system which us...

Read more »

Building Shiny App exercises part 4

January 1, 2017
By
Building Shiny App exercises part 4

APPLICATION LAYOUT & REACTIVITY The fourth part of our series is “separated” into two “sub-parts”. In the first one we will start building the skeleton of our application by using tabsetPanel. This is how we will separate the sections of our app and also organize its structure better. In the second part you will learn

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.