Le Monde puzzle [#1063]

August 8, 2018
By
Le Monde puzzle [#1063]

A simple (summertime?!) arithmetic Le Monde mathematical puzzle A “powerful integer” is such that all its prime divisors are at least with multiplicity 2. Are there two powerful integers in a row, i.e. such that both n and n+1 are powerful?  Are there odd integers n such that n² – 1 is a powerful integer

Read more »

In case you missed it: July 2018 roundup

August 8, 2018
By

In case you missed them, here are some articles from July of particular interest to R users. A program to validate quality and security for R packages: the Linux Foundation's CII Best Practices Badge Program. R scripts to generate images in the style of famous artworks, like Mondrian's. A 6-minute video tour of the AI and Machine Learning services...

Read more »

Hotels vs Airbnb – Barcelona case study (proof of concept)

August 8, 2018
By
Hotels vs Airbnb – Barcelona case study (proof of concept)

Click here to access the map 1 - Background and motivation During the last few years, the sharing economy has become more and more ubiquitous, from taxi riding applications to DIY and so on. The Real Estate market is no exception and in the Hotels and Hospitality sector we have seen more of these examples.

Read more »

Probability of Selecting Matched Pairs and the Hypergeometric Distribution

August 8, 2018
By
Probability of Selecting Matched Pairs and the Hypergeometric Distribution

The problem Consider a case where we have a bag of marbles of size . The bag consists of black The post Probability of Selecting Matched Pairs and the Hypergeometric Distribution appeared first on Daniel Oehm | Gradient Descending.

Read more »

IEEE Language Rankings 2018

August 8, 2018
By
IEEE Language Rankings 2018

Python retains its top spot in the fifth annual IEEE Spectrum top programming language rankings, and also gains a designation as an "embedded language". Data science language R remains the only domain-specific slot in the top 10 (where it as listed as an "enterprise language") and drops one place compared to its 2017 ranking to take the #7 spot....

Read more »

Leave-one-out subset for your ggplot smoother

August 8, 2018
By
Leave-one-out subset for your ggplot smoother

I have a dashboard at work that plots the number of contracts my office handles over time, and I wanted to add a trendline to show growth. However, trendline on the entire dataset skews low because our fiscal year just restarted. It took a little trial and error to figure out how to exclude the most recent year’s count...

Read more »

Bug discovered in MODIStsp!

We are sorry to report that we recently discovered a nasty bug (or rather, a stupid mistake…) in the MODIStsp package. The bug led to improper computation of custom spectral indices in the case that their formula included addition or subtraction oper...

Read more »

MODIStsp approved on rOpenSci!

We are happy to report that our MODIStsp package for automatic preprocessing of MODIS time series has been recently approved for being included in the rOpenSci ecosystem of R packages for reproducible science! We wish to thank reviewers Leah Wasser and...

Read more »

phylotaR: Retrieve Orthologous Sequences from GenBank

phylotaR: Retrieve Orthologous Sequences from GenBank

In this technote I will outline what phylotaR was developed for, how to install it and how to run it with some simple examples. What is phylotaR? In any phylogenetic analysis it is important to identify sequences that share the same orthology – homologous sequences separated by speciation events. This is often performed by simply searching an online sequence repository using...

Read more »

Extending the OpenImageR package with Gabor feature extraction

August 7, 2018
By
Extending the OpenImageR package with Gabor feature extraction

This blog post illustrates the new functionality of the OpenImageR package (Gabor Feature Extraction). The Gabor features have been used extensively in image analysis and processing (Character and Face recognition). Gabor (Nobel prize winner,...

Read more »

Statistics Sunday: Highlighting a Subset of Data in ggplot2

August 7, 2018
By
Statistics Sunday: Highlighting a Subset of Data in ggplot2

Highlighting Specific Cases in ggplot2 Here's my belated Statistics Sunday post, using a cool technique I just learned about: gghighlight. This R package works with ggplot2 to highlight a subset of data. To demonstrate, I'll use a dataset I analyzed for a previous post about my 2017 reading habits. [Side note: My reading goal for this year is...

Read more »

Meta-packages, nails in CRAN’s coffin

August 7, 2018
By
Meta-packages, nails in CRAN’s coffin

Derek Jones recently discussed a possible future for the R ecosystem in “StatsModels: the first nail in R’s coffin”. This got me thinking on the future of CRAN (which I consider vital to R, and vital in distributing our work) in the era of super-popular meta-packages. Meta-packages are convenient, but they have a profoundly negative … Continue reading Meta-packages,...

Read more »

Notes from the 2018 APDU Conference

August 7, 2018
By
Notes from the 2018 APDU Conference

I recently attended the 2018 Association of Public Data Users (APDU) conference. This was my second time attending the conference, and I enjoyed learning more... The post Notes from the 2018 APDU Conference appeared first on AriLamstein.com.

Read more »

greybox 0.3.0 – what’s new

August 7, 2018
By
greybox 0.3.0 – what’s new

Three months have passed since the initial release of on CRAN. I would not say that the package develops like crazy, but there have been some changes since May. Let’s have a look. We start by loading both and : Rolling Origin First of all, function now has its own class

Read more »

New from RStudio: Package Manager

August 7, 2018
By
New from RStudio: Package Manager

One of the few remaining hurdles when working with R in the enterprise is consistent access to CRAN. Often desktop class systems will have unrestricted access while server systems might not have any access at all. This inconsistency often stems from security concerns about allowing servers access to the internet. There have been many different approaches to solving this...

Read more »

Use R to write multiple tables to a single Excel file

August 7, 2018
By

The possibility of saving several tables in a single file is a nice feature of Excel. When sharing results with colleagues, it might be useful to compact everything in a single file. As a bioinformatician, I am too lazy to do that manually, and I searched the web for tools that allow doing that. I

Read more »

ShinyProxy 2.0.1 is out!

August 7, 2018
By
ShinyProxy 2.0.1 is out!

ShinyProxy is a novel, open source platform to deploy Shiny apps for the enterprise or larger organizations. Embedding Shiny Apps Although Shiny apps are very popular for interactive data analysis purposes, many organizations communicated a need to more closely integrate these apps within larger applications and portals. In previous releases we broke down the walls to make this happen: hiding the navbar,...

Read more »

A Tour of RStudio

August 7, 2018
By
A Tour of RStudio

In a previous post I explained how to install RStudio, a popular integrated development environment for the R programming language. Open up RStudio for the first time and it might look like some mad scientist’s Mission Control. In this post I will walk through each pane and what it does. From this you will start

Read more »

Kaggle Competition In 30 Minutes: Predict Home Credit Default Risk With R

Kaggle Competition In 30 Minutes: Predict Home Credit Default Risk With R

We were very excited when Home Credit teamed up with Kaggle to host the Home Credit Default Risk Challenge. Default risk is a topic that impacts all financial institutions, one that machine learning can help solve. We decided to flip the goal of this c...

Read more »

The Growth of Soccer (1872 – Present)

August 6, 2018
By
The Growth of Soccer (1872 – Present)

Soccer (football) is one of the world’s most popular sports, played and watched by millions of people all around the world. But while today it is played in virtually every country, it hasn’t always been that way. As you can see in the animated map below, soccer started off as a small sport, segregated to…

Read more »

Animating the Goals of the World Cup: Comparing the old vs. new gganimate and tweenr API

August 6, 2018
By
Animating the Goals of the World Cup: Comparing the old vs. new gganimate and tweenr API

Welcome to Part 3 of my series on “Visualizing the World Cup with R”! This is the culmination of this mini project that I've been working on throughout the World Cup (You can check out the Github Repo here). In addition, from having listened to Thomas Pedersen's excellent keynote at UseR! 2018 in Brisbane on Related Post Machine Learning Results...

Read more »

What they forgot to teach you about R

August 6, 2018
By

Join Jenny Bryan and Jim Hester of RStudio for this two-day hands-on workshop designed for experienced R and RStudio users who want to (re)design their R lifestyle! If you’d missed this sold out course at rstudio::conf 2018 now is your chance. Register here: https://www.rstudio.com/workshops/what-they-forgot-to-teach-you-about-r/ In this workshop you’ll learn holistic workflows that address the most common sources of friction in data...

Read more »

Two Big Ideas from JSM 2018

August 6, 2018
By

The Joint Statistical Meetings offer an astounding number of talks. It is impossible for an individual to see more than a small portion of what is going on. Even so, a diligent attendee ought to come away with more than a few good ideas. The following are two big ideas that I got from the conference. Session 149, an invited...

Read more »

Extracting and Processing eBird Data

Extracting and Processing eBird Data

eBird is an online tool for recording bird observations. The eBird database currently contains over 500 million records of bird sightings, spanning every country and nearly every bird species, making it an extremely valuable resource for bird research and conservation. These data can be used to map the distribution and abundance of species, and assess how species’ ranges are changing over time. This dataset is...

Read more »

optimParallel: speed up your optim() calls

August 6, 2018
By
optimParallel: speed up your optim() calls

The R package optimParallel provides parallel versions of the gradient-based optimization methods of optim(). Its main function optimParallel() has the same usage and output as optim() while speeding-up optimization significantly.

Read more »

Mapping the stock market using self-organizing maps

August 6, 2018
By
Mapping the stock market using self-organizing maps

Self-organizing maps are an unsupervised learning approach for visualizing multi-dimensional data in a two-dimensional plane. They are great for clustering and finding out correlations in the data. In this post we apply self-organizing maps on historical US stock market data to find out interesting correlations and clusters. We'll use data from Shiller, Goyal and BLS to calculate the historical valuations levels, interest rates, inflation...

Read more »

SQL Server 2017 Machine Learning services with R book

August 6, 2018
By
SQL Server 2017 Machine Learning services with R book

Yes, I am finally blogging this. ? This blog post is slighty different, since it brings you the tittle of the book, that my dear friend Julie Koesmarno (blog | twitter) and I have written in and it was published…Read more ›

Read more »

Data Journalism & Interactive Visualization (Transcript)

August 6, 2018
By
Data Journalism & Interactive Visualization (Transcript)

Here is a link to the podcast. Introducing Amber Thomas Hugo: Hi there, Amber, and welcome to DataFramed. Amber: Hey, hi. Thanks for having me. Hugo: ...

Read more »

Temporal aggregations on time series data – Writing R functions to tidy meteorological data and getting some insights from it

August 6, 2018
By
Temporal aggregations on time series data – Writing R functions to tidy meteorological data and getting some insights from it

Hi everyone!   In this post we’re going to work with time series data, and write R functions to aggregate hourly and daily time series in monthly time series to catch a glimpse of their underlying patterns. For this analysis we’re going to use public meteorological data recorded by the government of the Argentinian province of

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



wiley.com/learn/datascience

datasciencego.com

Quantide: statistical consulting and training

ODSC boston

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.