The prequel to the drake R package

The prequel to the drake R package

The drake R package is a pipeline toolkit. It manages data science workflows, saves time, and adds more confidence to reproducibility. I hope it will impact the landscapes of reproducible research and high-performance computing, but I originally created it for different reasons. This post is the prequel to drake’s inception. There was struggle, and drake was the answer. Dissertation frustration Sisyphus....

Read more »

Setting up a version controlled shiny-server

February 5, 2018
By

Last week I set up a shiny server, it was relatively easy! But I wanted something more, a way to make changes on my local computer and push it to the server. Shiny server (I used the open source version) has multiple installers provided by RStudi...

Read more »

Comparing Quality of Life and Demographics of Major Cities

February 5, 2018
By
Comparing Quality of Life and Demographics of Major Cities

Introduction: As an avid traveler, I have always been interested in discovering what makes a city unique. Tools informing travelers of unique landmarks and activities in the places to which they venture have been ubiquitous for ages. While I appreciate the different elements that make a city unique, I also have grown to understand that

Read more »

Shiny_Project1_Nan_Liu

February 5, 2018
By

View the code on Gist.

Read more »

Citi Bike Riders in Different Ages – the Potential of Target Advertising

February 5, 2018
By
Citi Bike Riders in Different Ages – the Potential of Target Advertising

Introduction As more and more people enjoy healthy and efficient lifestyles, riding bikes to work is recognized as a better commuting choice than driving or walking. The Citi Bike program launched in 2013 with 332 stations and 6000 bikes. It now owns 706 stations and 12,000 bikes, making it be the largest sharing bike program in

Read more »

Installing rJava on Ubuntu

February 5, 2018
By

Installing the rJava package on Ubuntu is not quite as simple as most other R packages. Some quick notes on how to do it. Install the Java Runtime Environment (JRE). sudo apt-get install -y default-jre Install the Java Development Kit (JDK). sudo apt-get install -y default-jdk Update where R expects to find various Java files. sudo R CMD javareconf Install the package. __ install.packages("rJava") If you have a...

Read more »

Reproducible Research: Write your Clinical Chemistry paper using R Markdown

February 4, 2018
By
Reproducible Research: Write your Clinical Chemistry paper using R Markdown

Abstract Background: This blog post is going to show you how to write a reproducible article in the field of clinical chemistry using R Mardkown. The only thing that will change for journal to journal will be the reference fomating and perhaps section numbering. The source code itself will be provided so that you can … Continue reading Reproducible...

Read more »

Visualizing Foursquare Check-Ins: Insights about New Yorkers through the lens of Foursquare data

February 4, 2018
By
Visualizing Foursquare Check-Ins: Insights about New Yorkers through the lens of Foursquare data

Introduction "Are you listening?", I asked my best friend as she tapped the screen of her space grey iPhone with her right index finger as she smiled at the screen. She wasn't. I looked around the restaurant, and I noticed that she was not the only one engrossed in her phone. This exchange inspired the

Read more »

Exploratory & sentiment analysis of beer tweets from Untappd on Twitter

February 4, 2018
By
Exploratory & sentiment analysis of beer tweets from Untappd on Twitter

Project Objective Untappd has some usage restrictions for their API namely not allowing any exploratory of analytics uses, so I’m going to explore tweets of beer and brewery check-ins from the Untappd app to find some implicit trends in how users share their activity. Exploratory Analysis library(tidyverse) library(rtweet) library(stringr) library(wesanderson) library(maps) library(tidytext) library(dumas) # http://jasdumas.github.io/dumas/ library(wordcloud) All social media shares from the Untappd app include their own short URL ‘untp.beer’,...

Read more »

A Crime Guide to New York City

February 4, 2018
By
A Crime Guide to New York City

    Background & Purpose Just 20 years ago, the streets of New York were racked with all kinds of crime, from murders, drug deals, to grand larceny, petite thefts. Since the late 90s, the city has seen an encouraging trend in steadily-declining crime rates. However, it goes without saying that even today, public safety

Read more »

New York City Motor Vehicle Collision Data Visualization

February 4, 2018
By
New York City Motor Vehicle Collision Data Visualization

Everybody loves New York City. Nobody likes car accidents. Why bother look at the motor vehicle collision data? Well, reality is reality. Road safety is by any means a critical issue, and is relevant to everybody's daily life. It's inevitable, and more often than not, a life-or-death situation indeed. Therefore, it is very important to

Read more »

Hottest Opportunities and Trends in Solar Business

February 4, 2018
By
Hottest Opportunities and Trends in Solar Business

Background Do you like earning money? And, at the same time contribute to saving the planet? What if I prove to you Solar Business is the next booming field having plenty of opportunities in coming decades, with plus point of reducing our carbon footprint. You'd argue...Really? But solar installations are so expensive! They are not

Read more »

R in Latin America

February 4, 2018
By
R in Latin America

Following on from our post on R in Africa this next post in the series looks at the R community across Latin America. Conferences As the use of R has grown in Latin America, there has been an increasing demand for local R conferences. The community has responded and a number of conferences are planned for the coming months. International Seminar on Statistics with R One...

Read more »

April 12th & 13th: workshop on Deep Learning with Keras and TensorFlow in R

February 4, 2018
By
April 12th & 13th: workshop on Deep Learning with Keras and TensorFlow in R

Registration is now open for my 1.5-day workshop on deep learning with Keras and TensorFlow using R. It will take place on April 12th and 13th in Hamburg, Germany. In my workshop, you will learn the basics of deep learning what cross-entropy and loss ...

Read more »

Concatenate Embeddings for Categorical Variables with Keras

February 4, 2018
By
Concatenate Embeddings for Categorical Variables with Keras

In my last post, I explored how to use embeddings to represent categorical variables. Furthermore, I showed how to extract the embeddings weights to use them in another model. While the concept of embedding representation has been used in NLP for quit...

Read more »

Department of Consumer Affairs Charges

February 4, 2018
By
Department of Consumer Affairs Charges

The Charges Data Set The NYC Open Data project provides the Department of Consumer Affairs data from 2016-2017.  The Department of Consumer Affairs was started in 1969 with the goal to "ensure compliance with local consumer protection and licensing laws, and State and federal regulations."  They oversee almost every industry in NYC.  This data set describes the charges

Read more »

What digits should you bet on in Super Bowl squares?

February 4, 2018
By
What digits should you bet on in Super Bowl squares?

My new office introduced me to a betting game I wasn’t previously familiar with: Super Bowl squares. It’s played with a ten-by-ten grid, like this one from printyourbrackets.com: Each row and column gets an assortment of digits from 0-9 represen...

Read more »

S&P500 Dashboard

February 3, 2018
By
S&P500 Dashboard

This is my Shiny Dashboard for stocks listed on S&P500. Using R shiny to give a visualized general insight of market and using ARIMA model to analyze risk for a single stock. Background When I worked as a trader. I found that it was really inconvenient to do the daily market research because the information on

Read more »

goodreads: what books #estoniareads?

February 3, 2018
By
goodreads: what books #estoniareads?

Estonian National Broadcasting Agency run a wonderful campaign called “Eesti loeb” (Estonia reads), where people were asked to post a picture of themselves with a book they are reading (or have read) and find inspiring. Photos of readers with their favourite books were published afterwards. But which books are the most popular ones among Estonians? We are going to answer...

Read more »

How to raise money on Kickstarter – tutorial with EDA and predictions

February 3, 2018
By
How to raise money on Kickstarter – tutorial with EDA and predictions

 In this tutorial, we will explore the characterisitcs of projects on Kickstarter and try to understand what separates the winners from the projects that failed to reach their funding goals. ( This post is mirrored from our main blog site blog.journeyofanalytics.com . The code can be downloaded or run “LIVE” from Kaggle using this link.) Qs … Continue reading How...

Read more »

Documenting R packages: roxygen2 vs. direct Rd input

February 3, 2018
By
Documenting R packages: roxygen2 vs. direct Rd input

As the reader may know, R objects are documented in files written in “R documentation” (Rd) format, a simple markup language much of which closely resembles (La)TeX, which can be processed into a variety of formats, including LaTeX, HTML and plain text. This LaTeX-like syntax, combined with the fact that the actual R objects live… Continuar leyendo Documenting R...

Read more »

Enter SadMan

February 2, 2018
By
Enter SadMan

Yet more Metalliplots I’m just going to tidy up some issues from my last post where I did some tidytext analysis on the legendary rockers - Metallica. It was a fun way to pass some time and keep my hand in with text mining, something I st...

Read more »

RVowpalWabbit 0.0.12

February 2, 2018
By

And yet another boring little RVowpalWabbit package update, now to version 0.0.12, and still in response to the CRAN request of not writing files where we should not (as caught by new tests added by Kurt). I had misinterpreted one flag and actually i...

Read more »

Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

February 2, 2018
By
Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

A puzzle In my last blog post, I gave a hypothetical test exercise for candidates in a recruitment process, involving the International Visitor Survey (IVS) from the Ministry of Business, Innovation and Employment (MBIE). I once managed the team responsible for that survey and other sector data, so I was familiar with it, knew the exercise made sense, and...

Read more »

Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

February 2, 2018
By
Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

A puzzle In my last blog post, I gave a hypothetical test exercise for candidates in a recruitment process, involving the International Visitor Survey (IVS) from the Ministry of Business, Innovation and Employment (MBIE). I once managed the team resp...

Read more »

Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

February 2, 2018
By
Average spend, activities and length of visit in the NZ International Visitor Survey by @ellis2013nz

A puzzle In my last blog post, I gave a hypothetical test exercise for candidates in a recruitment process, involving the International Visitor Survey (IVS) from the Ministry of Business, Innovation and Employment (MBIE). I once managed the team resp...

Read more »

Write Elegant Loops Using Loop Invariants

February 1, 2018
By
Write Elegant Loops Using Loop Invariants

Writing loops is an essential part of programming, but writing elegant loops is not easy. For example, in an interview Linus showed a commonly used piece of code which delete an entry in a link list (we will discuss this example). The task was simple but the loop in the solution was ugly.

Read more »

Free ticket to eRum

February 1, 2018
By
Free ticket to eRum

The Main Competition The Secondary Competition So… big news. Jumping Rivers is sponsoring eRum 2018 and in light of this news we are giving away a free place at the conference! (Not to mention our very own lead consultant, Colin Gillespie, i...

Read more »

Cost-Effective BigQuery with R

February 1, 2018
By
Cost-Effective BigQuery with R

Introduction Companies using Google BigQuery for production analytics often run into the following problem: the company has a large user hit table that spans many years. Since queries are billed based on the fields accessed, and not on the date-ranges queried, queries on the table are billed for all available days and are increasingly wasteful. A solution is to partition the...

Read more »

Search R-bloggers


Sponsors

Mango solutions





mckinsey.com global hackathon

Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC2 west

ODSC1_london

datasociety

http://www.eoda.de

max kuhn









Six Sigma Online Training



mljar.com

computationalanalytics.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.