Play with the cyphr package

March 24, 2019
By

The cyphr package seems to provide a good choice for small research group that shares sensitive data over internet (e.g., DropBox). I did some simple experiment myself and made sure it can actually serve my purpose.I did my experiment on two computers (using openssl): I created the test data on my Linux workstation running Manjaro then I tried to...

Read more »

nice student project

March 24, 2019
By

In all of my undergraduate classes, I require a term project, done in groups of 3-4 students. Though the topic is specified, it is largely open-ended, a level of...

Read more »

ShinyProxy 2.2.0

March 24, 2019
By
ShinyProxy 2.2.0

ShinyProxy is a novel, open source platform to deploy Shiny apps for the enterprise or larger organizations. Secured Embedding of Shiny Apps Since version 2.0.1 ShinyProxy provides a REST API to manage...

Read more »

Using R and H2O to identify product anomalies during the manufacturing process.

March 23, 2019
By
Using R and H2O to identify product anomalies during the manufacturing process.

Introduction:We will identify anomalous products on the production line by using measurements from testing stations and deep learning models. Anomalous products are not failures, these anomalies are products close...

Read more »

How cdata Control Table Data Transforms Work

March 23, 2019
By

With all of the excitement surrounding cdata style control table based data transforms (the cdata ideas being named as the “replacements” for tidyr‘s current methodology, by the tidyr authors...

Read more »

Strength of a Lennon song exposed with R function glue::glue

March 23, 2019
By
Strength of a Lennon song exposed with R function glue::glue

love_verse

Read more »

Can you turn 1,500 R$ into 1,000,430 R$ by investing in the stock market?

March 22, 2019
By
Can you turn 1,500 R$ into 1,000,430 R$ by investing in the stock market?

In the last few weeks we’ve seen a great deal of controversy in Brazil regarding financial investments. Too keep it short, Empiricus, an ad-based company that massively sells online...

Read more »

RcppArmadillo 0.9.300.2.0

March 22, 2019
By
RcppArmadillo 0.9.300.2.0

A new RcppArmadillo release based on a new Armadillo upstream release arrived on CRAN and Debian today. Armadillo is a powerful and expressive C++ template library for linear algebra aiming...

Read more »

Data Fun – Inspired by Darasaurus

March 22, 2019
By
Data Fun – Inspired by Darasaurus

After my recent post on Anscombe's Quartet in which I demonstrated how to efficiently adjust any data set to match mean, variance, correlation (x,y), as well as regression coefficients....

Read more »

Why we Did Not Name the cdata Transforms wide/tall/long/short

March 22, 2019
By
Why we Did Not Name the cdata Transforms wide/tall/long/short

We recently saw this UX (user experience) question from the tidyr author as he adapts tidyr to cdata techniques. The terminology that he is not adopting from cdata is...

Read more »

Decode Lyrics in Pop Music: Visualise Prose with the Songsim algorithm

March 22, 2019
By
Decode Lyrics in Pop Music: Visualise Prose with the Songsim algorithm

The post Decode Lyrics in Pop Music: Visualise Prose with the Songsim algorithm appeared first on The Lucid Manager. The lyrics of songs are more and more repetitive. Wihin this...

Read more »

A Quick and Tidy Look at the 2018 GSS

March 22, 2019
By
A Quick and Tidy Look at the 2018 GSS

The data from the 2018 wave of the General Social Survey was released during the week, leading to a flurry of graphs showing various trends. The GSS is one...

Read more »

AFL teams Elo ratings and footy-tipping by @ellis2013nz

March 22, 2019
By
AFL teams Elo ratings and footy-tipping by @ellis2013nz

So now that I live in Melbourne, to blend in with the locals I need to at least vaguely follow the AFL (Australian Football League). For instance, my work...

Read more »

Human Face Detection with R

March 22, 2019
By
Human Face Detection with R

Doing human face detection with computer vision is probably something you do once unless you work for police departments, you work in the surveillance industry or for the Chinese...

Read more »

How to Speed Up Gradient Boosting by a Factor of Two

March 22, 2019
By
How to Speed Up Gradient Boosting by a Factor of Two

Our latest tool development at STATWORX: random boost, an algorithm twice as fast as gradient boosting, with comparable prediction performance. Der Beitrag How to Speed Up Gradient Boosting by...

Read more »

How long since your team scored 100+ points? This blog’s first foray into the fitzRoy R package

March 21, 2019
By
How long since your team scored 100+ points? This blog’s first foray into the fitzRoy R package

When this blog moved from bioinformatics to data science I ran a Twitter poll to ask whether I should start afresh at a new site or continue here. “Continue...

Read more »

RStudio Connect 1.7.2

March 21, 2019
By
RStudio Connect 1.7.2

RStudio Connect 1.7.2 is ready to download, and this release contains some long-awaited functionality that we are excited to share. Several authentication and user-management tooling improvements have been added, including the...

Read more »

Upcoming talks in spring 2019

March 21, 2019
By
Upcoming talks in spring 2019

This spring, I’ll be giving talks at a couple of Meetups and conferences: March, 26th: At the data lounge Bremen, I’ll be talking about Explainable Machine Learning April, 11th: At the...

Read more »

How to Avoid Publishing Credentials in Your Code

March 20, 2019
By
How to Avoid Publishing Credentials in Your Code

Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. When accessing an API or database in R, it is often necessary to provide credentials such...

Read more »

All Around The World: Maps and Flags in R

March 20, 2019
By
All Around The World: Maps and Flags in R

Our lab is international. People born all over the world have come to work in my group. I’m proud of this fact, especially in the current political climate. I’ve...

Read more »

Pivoting data frames just got easier thanks to `pivot_wide()` and `pivot_long()`

Pivoting data frames just got easier thanks to `pivot_wide()` and `pivot_long()`

There’s a lot going on in the development version of {tidyr}. New functions for pivoting data frames, pivot_wide() and pivot_long() are coming, and will replace the current functions, spread() and gather()....

Read more »

Data Science Software Reviews: Forrester vs. Gartner

March 19, 2019
By
Data Science Software Reviews: Forrester vs. Gartner

In my previous post, I discussed Gartner's reviews of data science software companies. In this post, I show Forrester's coverage and discuss how radically different it is. As usual,...

Read more »

The importance of Graphing Your Data – Anscombe’s Clever Quartet!

March 19, 2019
By
The importance of Graphing Your Data – Anscombe’s Clever Quartet!

Francis Anscombe's seminal paper on "Graphs in Statistical" analysis (American Statistician, 1973) effectively makes the case that looking at summary statistics of data is insufficient to identify the...

Read more »

R and labelled data: Using quasiquotation to add variable and value labels #rstats

March 19, 2019
By

Labelling data is typically a task for end-users and is applied in own scripts or functions rather than in packages. However, sometimes it can be useful for both end-users...

Read more »

Tidyverse users: gather/spread are on the way out

March 19, 2019
By
Tidyverse users: gather/spread are on the way out

From https://twitter.com/sharon000/status/1107771331012108288: From https://tidyr.tidyverse.org/dev/articles/pivot.html: There are two important new features inspired by other R packages that have been advancing of reshaping in R: The reshaping operation can be specified...

Read more »

Learning Data Science: Predicting Income Brackets

March 19, 2019
By
Learning Data Science: Predicting Income Brackets

As promised in the post Learning Data Science: Modelling Basics we will now go a step further and try to predict income brackets with real world data and different...

Read more »

Assumptions Matter More Than Dependencies

March 18, 2019
By

There’s been alot of talk about “dependencies” in the R universe of late. This is not really a post about that but more of a “really, don’t do this”...

Read more »

The Credibility Crisis in Data Science

March 18, 2019
By
The Credibility Crisis in Data Science

Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Skipper Seabold, a Director of Data Science at Civis Analytics. Introducing Skipper Seabold Hugo: ...

Read more »

RStudio Connect Quickstart

March 18, 2019
By
RStudio Connect Quickstart

RStudio have recently announced ‘RStudio Connect QuickStart’ which is a VM containing a full suite of RStudio’s pro tools, available to be trialled for a 45 day period. RStudio...

Read more »

Search R-bloggers


Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R



wiley.com/learn/datascience

Quantide: statistical consulting and training

ODSC boston

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.