#16: Complaining Works.

February 6, 2018
By

Welcome to the sixteenth post in the relatively random R related series of posts, or R4 for short. This one will likely be brief. But it is one post I have been meaning to get out for a little while---yet did not get around to. The meta point I am trying to make today is that despite overwhelming odds that...

Read more »

Continuous integration for your private R projects with CircleCI

February 6, 2018
By
Continuous integration for your private R projects with CircleCI

If you have ever developed or used an open-source R package, you’re likely familiar with continuous integration. By automating the process of testing each proposed change in the source code, you can reduce the risk of errors, avoid unnecessary...

Read more »

R & Python Machine Learning Courses

February 6, 2018
By
R & Python Machine Learning Courses

Leeds (Predictive Analytics in R) London (Tensorflow) Birmingham (Python & Machine Learning) Hi there! We’re running some courses on R, Python and Tensorflow around the UK that you might be interested in! All courses are spearheaded with lec...

Read more »

The AI Show: Data Science Virtual Machine

February 6, 2018
By

The Data Science Virtual Machine was featured on a recent episode of the AI Show with Seth Juarez and Gopi Kumar. If you want a quick and easy way to spin up a virtual machine with all of the data science tools you'll ever need — including R and RStudio — already installed and ready to go, this video...

Read more »

Quick and Clean DMARC Record Processing with “Inline” Rcpp

February 6, 2018
By

Much of what I need to do for work-work involves using tools that are (for the moment) not in R. Today, I needed to test the validity of (and other processing on) DMARC records and I’m loathe to either reinvent the wheel or reticulate bits from a fragmented programming language ecosystem unless absolutely necessary. Thankfully,... Continue reading →

Read more »

EARL Seattle Keynote Speaker announcement: Julia Silge

February 6, 2018
By
EARL Seattle Keynote Speaker announcement: Julia Silge

...

Read more »

How to Import a CSV to an R Notebook

February 6, 2018
By
How to Import a CSV to an R Notebook

Adding a file to your R notebook is a simple 2 step process.1. “Import” file to notebook session.This makes sure your dataset is isolated in one state so that it doesn’t change as you analyze it. Remember, you can always load the latest content...

Read more »

Data transformation in #tidyverse style: package sjmisc updated #rstats

February 6, 2018
By
Data transformation in #tidyverse style: package sjmisc updated #rstats

I’m pleased to announce an update for the sjmisc-package, which was just released on CRAN. Here I want to point out two important changes in the package. New default option for recoding and transformation functions First, a small change in the code with major impact on the workflow, as it affects argument defaults and is … Weiterlesen Data transformation...

Read more »

Introducing the Kernelheaping Package

February 6, 2018
By
Introducing the Kernelheaping Package

In this blog article I'd like to introduce the univariate kernel density estimation for heaped (i.e. rounded or interval censored) data with the Kernelheaping package. It is not unusual to have interval censored data such as in income surveys due to anonymisation or simplification issues. However,...

Read more »

SPE 2018

February 6, 2018
By
SPE 2018

The 2018 edition of the long-running Statistical Practice in Epidemiology using R (SPE) course will take place in Lyon, Fance on 20-24 June 2018 at the International Agency for Research on Cancer. The deadline for applications is 1 April.

Read more »

PVA: Publication Viability Analysis, round 3

February 6, 2018
By
PVA: Publication Viability Analysis, round 3

A friend and colleague of mine, Péter Batáry has circulated news from Nature magazine about the EU freezing innovation funds to Bulgaria. The article had a figure about publication trends for Bulgaria, compared with Romania and Hungary. As I have blogged about such trends in ecology before (here and here), I felt the need to update my PVA models with two years worth of data from...

Read more »

TensorFlow for R

February 5, 2018
By
TensorFlow for R

Over the past year we’ve been hard at work on creating R interfaces to TensorFlow, an open-source machine learning framework from Google. We are excited about TensorFlow for many reasons, not the least of which is its state-of-the-art infrastructure for deep learning applications. In the 2 years since it was initially open-sourced by Google, TensorFlow has rapidly become the framework...

Read more »

The prequel to the drake R package

The prequel to the drake R package

The drake R package is a pipeline toolkit. It manages data science workflows, saves time, and adds more confidence to reproducibility. I hope it will impact the landscapes of reproducible research and high-performance computing, but I originally created it for different reasons. This post is the prequel to drake’s inception. There was struggle, and drake was the answer. Dissertation frustration Sisyphus....

Read more »

Setting up a version controlled shiny-server

February 5, 2018
By

Last week I set up a shiny server, it was relatively easy! But I wanted something more, a way to make changes on my local computer and push it to the server. Shiny server (I used the open source version) has multiple installers provided by RStudi...

Read more »

Reproducible Research: Write your Clinical Chemistry paper using R Markdown

February 4, 2018
By
Reproducible Research: Write your Clinical Chemistry paper using R Markdown

Abstract Background: This blog post is going to show you how to write a reproducible article in the field of clinical chemistry using R Mardkown. The only thing that will change for journal to journal will be the reference fomating and perhaps section numbering. The source code itself will be provided so that you can … Continue reading Reproducible...

Read more »

Visualizing Foursquare Check-Ins: Insights about New Yorkers through the lens of Foursquare data

February 4, 2018
By
Visualizing Foursquare Check-Ins: Insights about New Yorkers through the lens of Foursquare data

Introduction "Are you listening?", I asked my best friend as she tapped the screen of her space grey iPhone with her right index finger as she smiled at the screen. She wasn't. I looked around the restaurant, and I noticed that she was not the only one engrossed in her phone. This exchange inspired the

Read more »

Exploratory & sentiment analysis of beer tweets from Untappd on Twitter

February 4, 2018
By
Exploratory & sentiment analysis of beer tweets from Untappd on Twitter

Project Objective Untappd has some usage restrictions for their API namely not allowing any exploratory of analytics uses, so I’m going to explore tweets of beer and brewery check-ins from the Untappd app to find some implicit trends in how users share their activity. Exploratory Analysis library(tidyverse) library(rtweet) library(stringr) library(wesanderson) library(maps) library(tidytext) library(dumas) # http://jasdumas.github.io/dumas/ library(wordcloud) All social media shares from the Untappd app include their own short URL ‘untp.beer’,...

Read more »

Hottest Opportunities and Trends in Solar Business

February 4, 2018
By
Hottest Opportunities and Trends in Solar Business

Background Do you like earning money? And, at the same time contribute to saving the planet? What if I prove to you Solar Business is the next booming field having plenty of opportunities in coming decades, with plus point of reducing our carbon footprint. You'd argue...Really? But solar installations are so expensive! They are not

Read more »

R in Latin America

February 4, 2018
By
R in Latin America

Following on from our post on R in Africa this next post in the series looks at the R community across Latin America. Conferences As the use of R has grown in Latin America, there has been an increasing demand for local R conferences. The community has responded and a number of conferences are planned for the coming months. International Seminar on Statistics with R One...

Read more »

April 12th & 13th: workshop on Deep Learning with Keras and TensorFlow in R

February 4, 2018
By
April 12th & 13th: workshop on Deep Learning with Keras and TensorFlow in R

Registration is now open for my 1.5-day workshop on deep learning with Keras and TensorFlow using R. It will take place on April 12th and 13th in Hamburg, Germany. In my workshop, you will learn the basics of deep learning what cross-entropy and loss ...

Read more »

Concatenate Embeddings for Categorical Variables with Keras

February 4, 2018
By
Concatenate Embeddings for Categorical Variables with Keras

In my last post, I explored how to use embeddings to represent categorical variables. Furthermore, I showed how to extract the embeddings weights to use them in another model. While the concept of embedding representation has been used in NLP for quit...

Read more »

Department of Consumer Affairs Charges

February 4, 2018
By
Department of Consumer Affairs Charges

The Charges Data Set The NYC Open Data project provides the Department of Consumer Affairs data from 2016-2017.  The Department of Consumer Affairs was started in 1969 with the goal to "ensure compliance with local consumer protection and licensing laws, and State and federal regulations."  They oversee almost every industry in NYC.  This data set describes the charges

Read more »

What digits should you bet on in Super Bowl squares?

February 4, 2018
By
What digits should you bet on in Super Bowl squares?

My new office introduced me to a betting game I wasn’t previously familiar with: Super Bowl squares. It’s played with a ten-by-ten grid, like this one from printyourbrackets.com: Each row and column gets an assortment of digits from 0-9 represen...

Read more »

S&P500 Dashboard

February 3, 2018
By
S&P500 Dashboard

This is my Shiny Dashboard for stocks listed on S&P500. Using R shiny to give a visualized general insight of market and using ARIMA model to analyze risk for a single stock. Background When I worked as a trader. I found that it was really inconvenient to do the daily market research because the information on

Read more »

How to raise money on Kickstarter – tutorial with EDA and predictions

February 3, 2018
By
How to raise money on Kickstarter – tutorial with EDA and predictions

 In this tutorial, we will explore the characterisitcs of projects on Kickstarter and try to understand what separates the winners from the projects that failed to reach their funding goals. ( This post is mirrored from our main blog site blog.journeyofanalytics.com . The code can be downloaded or run “LIVE” from Kaggle using this link.) Qs … Continue reading How...

Read more »

Documenting R packages: roxygen2 vs. direct Rd input

February 3, 2018
By
Documenting R packages: roxygen2 vs. direct Rd input

As the reader may know, R objects are documented in files written in “R documentation” (Rd) format, a simple markup language much of which closely resembles (La)TeX, which can be processed into a variety of formats, including LaTeX, HTML and plain text. This LaTeX-like syntax, combined with the fact that the actual R objects live… Continuar leyendo Documenting R...

Read more »

Enter SadMan

February 2, 2018
By
Enter SadMan

Yet more Metalliplots I’m just going to tidy up some issues from my last post where I did some tidytext analysis on the legendary rockers - Metallica. It was a fun way to pass some time and keep my hand in with text mining, something I st...

Read more »

RVowpalWabbit 0.0.12

February 2, 2018
By

And yet another boring little RVowpalWabbit package update, now to version 0.0.12, and still in response to the CRAN request of not writing files where we should not (as caught by new tests added by Kurt). I had misinterpreted one flag and actually i...

Read more »

Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

February 2, 2018
By
Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

A puzzle In my last blog post, I gave a hypothetical test exercise for candidates in a recruitment process, involving the International Visitor Survey (IVS) from the Ministry of Business, Innovation and Employment (MBIE). I once managed the team resp...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



Quantide: statistical consulting and training

ODSC2 west

ODSC1_jobs

datasociety

http://www.eoda.de



CRC R books series







Six Sigma Online Training



mljar.com



Contact us if you wish to help support R-bloggers, and place your banner here.