How to share R visualizations in Microsoft PowerPoint

April 3, 2019
By
How to share R visualizations in Microsoft PowerPoint

Hadrien Dykiel is an RStudio Customer Success Engineer Microsoft PowerPoint is often the de facto choice for creating presentation slides, especially at larger companies. In many organizations, it comes pre-installed on workstations and pretty much everybody knows how to use it. This can make it an effective medium for sharing information, since most folks are comfortable with it. Unfortunately, valuable...

Read more »

(Trying) to get to the top of R-bloggers emails

(Trying) to get to the top of R-bloggers emails

(TL;DR: Author analyses R-Bloggers emails using Gmail API. Decides on when to post to get to the top of the email list. This could either work well or fail spectacularly if he’s missed something. Either way, he learnt a lot. Let’s take a chance!

Read more »

What is a Permutation Test?

April 3, 2019
By
What is a Permutation Test?

Permutation tests, which I'll be discussing in this post, aren't that widely used by econometricians. However, they shouldn't be overlooked.Let's begin with some background discussion to set the scene. This might seem a bit redundant, but it will help us to see how permutation tests differ from the sort of tests that we usually use in econometrics.Background MotivationWhen you...

Read more »

Simulating metapopulation occupation in a landscape

April 3, 2019
By
Simulating metapopulation occupation in a landscape

The objective of this post is to go into the inner workings of the package MetaLandSim, which I developed a few years ago. MetaLandSim’s main objectives are to i) simulate the occupation of an habitat network suffering some sort of change (but static landscapes work too); ii) simulate range expansion by a species with a … Continue reading Simulating...

Read more »

C++ is Often Used in R Packages

April 3, 2019
By

The recent r-project article “Use of C++ in Packages” stated as its own summary of recommendation: don’t use C++ to interface with R. A careful reading of the article exposes at least two possible meanings of this: Don’t use C++ to directly call R or directly manipulate R structures. A technical point directly argued (for … Continue reading C++...

Read more »

A text mining function for websites

April 3, 2019
By

For one of my projects I needed to download text from multiple websites. In this case, I used rvest and dplyr. Accessing the information you want can be relatively easy if the sources come from the same websites, but pretty tedious when the websites are heterogenous. The reason is how the content is kept in the HTML of … Continue reading A...

Read more »

Bayes vs. the Invaders! Part One: The 37th Parallel

April 3, 2019
By
Bayes vs. the Invaders! Part One: The 37th Parallel

Introduction From our earlier studies of UFO sightings, a recurring question has been the extent to which the frequency of sightings of inexplicable otherworldly phenomena depends on the population of an area. Intuitively: where there are more people to catch a glimpse of the unknown, there will be more reports...

Read more »

Don’t forget the “utils” package in R

April 3, 2019
By
Don’t forget the “utils” package in R

With thousands of powerful packages, it’s easy to glaze over the libraries that come preinstalled with R. Thus, this post will talk about some of the cool functions in the utils package, which comes with a standard installation of R. While utils comes with several familiar functions, like read.csv, write.csv, and help, it also contains The post Don’t forget...

Read more »

Wicked Fast, Accurate Quantiles Using ‘t-Digests’ in R with the {tdigest} Package

April 3, 2019
By

@ted_dunning recently updated the t-Digest algorithm he created back in 2013. What is this “t-digest”? Fundamentally, it is a probabilistic data structure for estimating any percentile of distributed/streaming data. Ted explains it quite elegantly in this short video: Said video has a full transcript as well. T-digests have been baked into many “big data” analytics... Continue reading →

Read more »

Mapping the Vikings using R

April 3, 2019
By
Mapping the Vikings using R

The commute to my workplace is 90 minutes each way. Podcasts are my friend. I’m a long-time listener of In Our Time and enjoyed the recent episode about The Danelaw. Melvyn and I hail from the same part of the world, and I learned as a child that many of the local place names there … Continue reading Mapping...

Read more »

Cost-effectiveness analysis with multi-state and partitioned survival models: hesim 0.2.0

April 2, 2019
By
Cost-effectiveness analysis with multi-state and partitioned survival models: hesim 0.2.0

Overview An example CTSTM Decision problem and model setup Parameterization Simulation Constructing the economic model ...

Read more »

R Expands to Machine Learning and Deep Learning at ODSC East

For many, R is the go-to language when it comes to data analysis and predictive analytics. However many data scientists are also expanding their use of R to include machine learning and deep learning. These are exciting new topics, and ODSC East — where thousands of data scientists will gather this year in Boston — … Continue reading R...

Read more »

Standard Evaluation Versus Non-Standard Evaluation in R

April 2, 2019
By
Standard Evaluation Versus Non-Standard Evaluation in R

There is a lot of unnecessary worry over “Non Standard Evaluation” (NSE) in R versus “Standard Evaluation” (SE, or standard “variables names refer to values” evaluation). This very author is guilty of over-discussing the issue. But let’s give this yet another try. The entire difference between NSE and regular evaluation can be summed up in … Continue reading Standard...

Read more »

Data: a cultural transformation and not a quick fix

April 2, 2019
By
Data: a cultural transformation and not a quick fix

Amid stronger business competition than ever before, companies need to do more than simply embrace buzzwords or trends. It’s something we see all the time when out in the field talking to customers, or speaking at events. When it comes to the role of data, the emphasis should instead be on instilling transformation into the very DNA of an...

Read more »

Randomisation tests comparing dependent correlations

April 2, 2019
By

This is about some academic work I did that never got published. But, I think it should be out there … More

Read more »

Random sampling of files

April 2, 2019
By

A great part of my job as a bat ecologist is to classify bat species from their echolocation calls. I regularly use automatic recording devices that generate thousands of recordings per location. Dealing with this huge amount of information is not an easy task as you can imagine. In the old days each recording was … Continue reading Random...

Read more »

Symbolic Regression, Genetic Programming… or if Kepler had R

April 2, 2019
By
Symbolic Regression, Genetic Programming… or if Kepler had R

A few weeks ago we published a post about using the power of the evolutionary method for optimization (see Evolution works!). In this post we will go a step further, so read on… A problem researchers often face is that they have an amount of data and need to find some functional form, e.g. some … Continue reading "Symbolic...

Read more »

So you want to deploy multiple containers running different R models?

April 2, 2019
By
So you want to deploy multiple containers running different R models?

This tutorial is the second part of a series on professional R deployment. Please find the previous part here (How to make a dockerized plumber API secure with SSL and Basic Authentication). If you followed the first part in this tutorial series, you have achieved the following things: running your R code with a plumber... Der Beitrag So you want...

Read more »

tint 0.1.1: New Styles

April 1, 2019
By
tint 0.1.1: New Styles

With almost year passed since the previous 0.1.0 release, a nice new release of the tint package arrived on CRAN today. Its name expands from tint is not tufte as the package offers a fresher take on the Tufte-style for html and pdf presentations. Th...

Read more »

Data Science Software Used in Journals: Stat Packages Declining (including R), AI/ML Software Growing

April 1, 2019
By
Data Science Software Used in Journals: Stat Packages Declining (including R), AI/ML Software Growing

In my neverending quest to track The Popularity of Data Science Software, it’s time to update the section on Scholarly Articles. The rapid growth of R could not go on forever and, as you’ll see below, its use actually declined … Continue reading →

Read more »

Getting your toes wet in R: Hydrology, meteorology, and more

Getting your toes wet in R: Hydrology, meteorology, and more

Importance of Hydrology Given that liquid water is essential to life on Earth, water research cuts across numerous disciplines including hydrology, meteorology, geography, climate science, engineering, ecology, and more. Numerous R packages have emerged from this diversity of approaches, and we recently gathered many of them into a new rOpenSci task view which we broadly titled ‘Hydrology’ and published to...

Read more »

A unified syntax for accessing models’ information

April 1, 2019
By

The richness and variety of packages for building and fitting statistical models in R is absolutely astonishing and contributes to the language’s popularity. However, this diversity makes it hard for developpers that want to create tools that work with different types of models. Indeed, the way to access models’ internal information (such as parameters names, formulae, data, etc.) is...

Read more »

Musings on missing data

April 1, 2019
By
Musings on missing data

I’ve been meaning to share an analysis I recently did to estimate the strength of the relationship between a young child’s ability to recognize emotions in others (e.g. teachers and fellow students) and her longer term academic success. The study itself is quite interesting (hopefully it will be published sometime soon), but I really wanted to write about it here...

Read more »

Data Science R&D at TD Ameritrade

April 1, 2019
By
Data Science R&D at TD Ameritrade

Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Sean Law, a Senior Applied Researcher and Data Scientist at TD Ameritrade. Introducing Sean Law Hugo: Hi there, Sean, and welcome to DataFramed. Sean: ...

Read more »

One week EARL London abstract extension – 8 April

April 1, 2019
By

Due to quite a few panicked email requests over the weekend we have decided to extend the EARL London abstract deadline until 8 April.  Thank you so much to all the people that have already submitted, we can’t wait to look through all of the brilliant submissions. Submit your abstract here.

Read more »

Regression 101: Understanding business flows with OLS regression in R

April 1, 2019
By
Regression 101: Understanding business flows with OLS regression in R

Clearly one of the most statistical techniques: OLS regression is one of the most commonly used methods for analysis and predictions. Furthermore, it also provides the basis for many more advanced techniques. All the more reasons to known how to conduct OLS regression in R and to understand the logic of this approach! This in-depth article provides an easy...

Read more »

Bioconductor S4 classes for high-throughput omics data

April 1, 2019
By

Bioconductor S4 classes for high-throughput omics data Motivation Multi-omics data integration and analysis. What a beast! It is one of the major challenges in the era of personalized/precision medicine (or whatever you want to call it). De...

Read more »

Discrete Event Simulation Metamodeling – Splines with R and Arena

Discrete Event Simulation Metamodeling – Splines with R and Arena

Simulation Metamodeling - building and using surrogate models that can approximate results from more complicated simulation models - is an interesting approach to analyze results from complicated, computationally expensive simulation models. Metamodels are useful because they can yield good approximations of the original simulation model response variables using less computational resources. For an introduction to Metamodeling, refer to (Barton...

Read more »

Easyalluvial 0.2.0 released

March 31, 2019
By
Easyalluvial 0.2.0 released

easyalluvial allows you to build exploratory alluvial plots (sankey diagrams) with a single line of code while automatically binning numerical variables. In version 0.2.0 marginal histograms improve the visibility of those numerical variables. Further a method has been added that creates model agnostic 4 dimensional partial dependence alluvial plots to visualise the response of statistical models. Introduction I am happy to...

Read more »

Search R-bloggers


Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R



wiley.com/learn/datascience

Quantide: statistical consulting and training

ODSC boston

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.