MySQL Data Type Mapping in R

October 2, 2018
By

There was a recent question in the /r/Rlanguage subreddit which piqued my interest. They asked how to find the right mapping, and with the large…Continue ReadingMySQL Data Type Mapping in R

Read more »

How to build a logistic regression model from scratch in R

October 2, 2018
By
How to build a logistic regression model from scratch in R

In a previous post, we showed how using vectorization in R can vastly speed up fuzzy matching. Here, we will show you how to use R’s vectorization functionality to efficiently build a logistic regression model. Now we could just use the caret or stats packages to create a model, but building algorithms from scratch is The post How to...

Read more »

Manifold Visualization: Second Example

October 1, 2018
By
Manifold Visualization: Second Example

In last night’s post, I introduced prVis(), a new visualization tool which we have invented, available in our polyreg package. Recall that prVis() is intended as a simpler alternative to recent visualization tools like t-SNE and UMAP. Here I will post another example. The dataset is prgeng, included in the package. It consists of wage … Continue reading Manifold...

Read more »

RStudio 1.2 Preview: SQL Integration

October 1, 2018
By
RStudio 1.2 Preview: SQL Integration

The RStudio 1.2 Preview Release, available today, dramatically improves support and interoperability with many new programming languages and platforms, including SQL, D3, Python, Stan, and C++. Over the next few weeks on the blog, we’re going to be taking a look at improvements for each of these in turn. Today, we’re looking at SQL, and as a motivating example, we’re...

Read more »

outcomerate: Transparent Communication of Quality in Social Surveys

outcomerate: Transparent Communication of Quality in Social Surveys

Background Surveys are ubiquitous in the social sciences, and the best of them are meticulously planned out. Statisticians often decide on a sample size based on a theoretical design, and then proceed to inflate this number to account for “sample losses”. This ensures that the desired sample size is achieved, even in the presence of non-response. Factors that reduce the...

Read more »

Rcpp 0.12.19: And more updates

October 1, 2018
By

The next bi-monthly update in the 0.12.* series of Rcpp releases landed on CRAN early this morning. It was actually released on September 20, but I made a first cut (see #887 below) at a deprecation which CRAN and I decided to reverted for now, then ...

Read more »

BlueSky Statistics 6.04 GUI for R Update

October 1, 2018
By
BlueSky Statistics 6.04 GUI for R Update

It has been just a few months since I reviewed five free and open-source point-and-click graphical user interfaces (GUIs) to the R language. I plan to keep those reviews up to date as new features are added. BlueSky’s interface would … Continue reading →

Read more »

Using R in 3 Different Ways With Datazar Desktop

October 1, 2018
By
Using R in 3 Different Ways With Datazar Desktop

With the increase in R usage over the past few years and the sheer diversity of industries employing data-science and analytics, the backgrounds of the people using R day-to-day has also changed. Different background, from hard-sciences to software-eng...

Read more »

Modeling muti-category Outcomes With vtreat

October 1, 2018
By

vtreat is a powerful R package for preparing messy real-world data for machine learning. We have further extended the package with a number of features including rquery/rqdatatable integration (allowing vtreat application at scale on Apache Spark or data.table!). In addition vtreat and can now effectively prepare data for multi-class classification or multinomial modeling. The two … Continue reading Modeling...

Read more »

Full Stack Data Science (Transcript)

October 1, 2018
By
Full Stack Data Science (Transcript)

Here is a link to the podcast. Introducing Vicki Boykis Hugo: Hi there, Vicki, and welcome to DataFramed. Vicki: Thank you so much for having me. What are you known for in the data...

Read more »

BooST series II: Pricing Optimization

October 1, 2018
By
BooST series II: Pricing Optimization

By Gabriel Vasconcelos & Yuri Fonseca Introduction This post is the second of a series of examples of the BooST (Boosting Smooth Trees) model. You can see an introduction to the model here and the first example here. Our objective … Continue reading →

Read more »

Machine Learning in R with H2O and LIME: slides, photos and video

October 1, 2018
By
Machine Learning in R with H2O and LIME: slides, photos and video

Hello, everyone! Did you have fun at June 25th’s workshop with H2O and Lime? If you were there, we’re sure you did. If you weren’t… here’s a recap, with the explanation, the link to the live event and all the materials too! The workshop started at 19 in Mikamai, and it began with a bang! The post Machine Learning...

Read more »

Pull the Right Strings with stringr: Exercises

October 1, 2018
By
Pull the Right Strings with stringr: Exercises

By providing a set of wrappers to existing functions, the stringr package allows for simple, consistent and efficient manipulations of strings in R. Even though there are some more basic packages that offer strings-related functions, you might find yourself in need for a more complete and straightforward solution for handling strings in R. With a Related exercise sets: Bayesian Inference...

Read more »

Manifold Visualization: Polynomials to the Rescue

October 1, 2018
By
Manifold Visualization: Polynomials to the Rescue

Our arXiv paper and the associated R package polyreg caused a bit of a stir, both pro and con, when we first announced them here in June. The discussion even spread as far as Twitter, Reddit and Hacker News. We’ll be announcing a revised paper, and various new features to the package, very soon. But … Continue reading Manifold...

Read more »

tinkr: editing Markdown documents using XML tools

Remember our recent post showing that one can wrangle Markdown files programmatically without regex? That tech note showed how to convert Markdown bodies to XML in order to extract information from them. Now, this post goes one step further and presents tinkr, a package for converting .md and .Rmd files to XML, editing them, and… writing them back as Markdown! General tinkr workflow The goal of tinkr...

Read more »

sparklyr 0.9

September 30, 2018
By
sparklyr 0.9

Today we are excited to share that a new release of sparklyr is available on CRAN! This 0.9 release enables you to: Create Spark structured streams to process real time data from many data sources using dplyr, SQL, pipelines, and arbitrary R code. Monitor connection progress with upcoming RStudio Preview 1.2 features and support for properly interrupting Spark jobs from R. Use...

Read more »

Scraping twitter data to visualize trending tweets in Kuala Lumpur

September 30, 2018
By
Scraping twitter data to visualize trending tweets in Kuala Lumpur

(Disclaimer: I’ve no grudge against python programming language per se. I think its equally great. In the following post, I’m merely recounting my experience.) It’s been quite a while since I last posted. The reasons are numerous, notable being,...

Read more »

nanotime 0.2.3

September 30, 2018
By

A minor maintenance release of the nanotime package for working with nanosecond timestamps just arrived on CRAN. nanotime uses the RcppCCTZ package for (efficient) high(er) resolution time parsing and formatting up to nanosecond resolution, and the b...

Read more »

RcppAPT 0.0.5

September 29, 2018
By

A new version of RcppAPT – our interface from R to the C++ library behind the awesome apt, apt-get, apt-cache, … commands and their cache powering Debian, Ubuntu and the like – is now on CRAN. This version is a bit of experiment. I had asked on...

Read more »

Using R’s set.seed() to set seeds for use in C/C++ (including Rcpp)

September 29, 2018
By

In native R, the user sets the seed for random number generation (RNG) with set.seed(). Random number generators exist in C and C++ too; these need their own seeds, which are not obviously settable by set.seed(). Good news! It can be done. pacman::p_load(inline, purrr) rbernoulli Base R (or technically the stats package) provides no rbernoulli(). It’s a pretty gaping hole in the...

Read more »

4 ways to be more productive, using RStudio’s terminal

September 29, 2018
By
4 ways to be more productive, using RStudio’s terminal

Introduction RStudio version 1.1 introduced the Terminal functionality, which does not seem to be getting enough deserved attention and love even though it is very well integrated with the rest of the IDE and can be extremely useful for several daily use-cases. In this post we will try to cover 4 very common scenarios where the Terminal can be very useful...

Read more »

TSstudio 0.1.2

September 28, 2018
By

Happy to announce the release of TSstudio 0.1.2 to CRAN. The TSstudio package provides tools for descriptive and predictive analysis of time series data, utilizing the visualization enegin of the plotly package and forecasting models from the forecast, forecastHybrid and bsts packages. Installation Install the stable version from CRAN: install.packages("TSstudio") or install the development version from Github: # install.packages("devtools") devtools::install_github("RamiKrispin/TSstudio") New features The new release includes new...

Read more »

Beginner Intermediate and Advanced R courses February 2019

September 28, 2018
By

Beginner, Intermediate and Advanced R courses February 2019 This is an announcement of our next set of R courses. There are 4 separate courses taught over 5th-8th February 2019. You can register with the Centre for Applications in Resource Management...

Read more »

Sexual Assault in the 80s and Christine Blasey Ford’s Testimony

September 28, 2018
By
Sexual Assault in the 80s and Christine Blasey Ford’s Testimony

Dr. Ford’s testimony alleging that Judge Kavanaugh attempted to rape her when she was 15 was extremely difficult to watch as she was and still is deeply traumatized by the event. Kavanaugh’s abrasive and highly rehearsed obstructionist response was even more difficult to watch. In response I decided to look at public data to see if the type...

Read more »

XKCD "Curve Fitting", in R

September 28, 2018
By
XKCD "Curve Fitting", in R

You probably saw this XKCD last week, which brought a grimace of recognition to statisticians everywhere: It's so realistic, that Barry Rowlingson was able to reproduce all but two of the "charts" above with a simple R function (and a little help from the xkcd ggplot2 theme): And now for @revodavid et al, with the xkcd package and font!...

Read more »

simmer 4.0.1

September 28, 2018
By

The 4.0.1 release of simmer, the Discrete-Event Simulator for R, is on CRAN since a couple of weeks ago. There are few changes, notably new getters (get_sources(), get_resources(), get_trajectory()) for simmer environments and some improvements in resource selection policies (see details in help(select)). A new convenience function, when_activated, makes it easier to generate arrivals on demand, triggered from trajectories. Let us consider,… Continuar leyendo simmer 4.0.1...

Read more »

Quick Hit: Using the New Equal Earth Projection in R

September 28, 2018
By
Quick Hit: Using the New Equal Earth Projection in R

In my semi-daily run of brew update I noticed that proj4 had been updated to 5.2. I kinda “squeee“‘d since (as the release notes show) the Equal Earth projection was added to it (+proj=eqearth). As the team who created the projection describes it: “The Equal Earth map projection is a new equal-area pseudocylindrical projection for... Continue reading →

Read more »

Visualize your Portfolio’s Performance and Generate a Nice Report with R

September 27, 2018
By
Visualize your Portfolio’s Performance and Generate a Nice Report with R

CategoriesVisualizing Data Tags Data Visualisation dplyr ggplot2 lares R Programming There are few things more exciting than seeing your stocks values going up! I started investing last year in stocks and, like visualization and R lover, I couldn’t help but create some nice plots and functions to automate the process of watching it happen. Some of you might already know the lares package; I’ve included Related...

Read more »

Downloading 24-Hour Weather Updates in R

September 27, 2018
By
Downloading 24-Hour Weather Updates in R

This post serves as a follow-up on a previous post about scheduled collection of Weather.gov’s XML feed in R, which itself was a follow-up to retrieving real-time data from Weather.gov in Excel. Reflecting on the best way to accomplishing this automation, I noticed something back on Weather.gov’s update page: an option for a 2-day weather history! Duh!

Read more »

Search R-bloggers


Sponsors

Mango solutions





mckinsey.com

Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC west

ODSC2 west

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.