July 2017

Building a website with `pkgdown`: a short guide

July 31, 2017 | 0 Comments

As promised in my last post, here is a short guide with some tips and tricks for building a documentation website for an R package using pkgdown. In the end, this guide ended up way longer than I was expecting, but I hope you’ll find it useful, although it ...
[Read more...]

What analysis programs drive conservation science?

July 31, 2017 | 0 Comments

What analysis programs drive conservation science? With the International Congress for Conservation Biology on at the end of July I was wondering, what analysis programs are supporting conservation science? And, what programs support spatial analysis ... [Read more...]

How to use H2O with R on HDInsight

July 31, 2017 | 0 Comments

H2O.ai is an open-source AI platform that provides a number of machine-learning algorithms that run on the Spark distributed computing framework. Azure HDInsight is Microsoft's fully-managed Apache Hadoop platform in the cloud, which makes it easy to spin up and manage Azure clusters of any size. It's also ... [Read more...]

Counterfactual estimation on nonstationary data, be careful!!!

July 31, 2017 | 0 Comments

By Gabriel Vasconcelos In a recent paper that can be downloaded here, Carvalho, Masini and Medeiros show that estimating counterfactuals in a non-stationary framework (when I say non-stationary it means integrated) is a tricky task. It is intuitive that the … Continue reading →
[Read more...]

Machine Learning Explained: Dimensionality Reduction

July 31, 2017 | 0 Comments

Dealing with a lot of dimensions can be painful for machine learning algorithms. High dimensionality will increase the computational complexity, increase the risk of overfitting (as your algorithm has more degrees of freedom) and the sparsity of the data will grow. Hence, dimensionality reduction will project the data in a ...
[Read more...]

Google Vision API in R – RoogleVision

July 31, 2017 | 0 Comments

Using the Google Vision API in R Utilizing RoogleVision After doing my post last month on OpenCV and face detection, I started looking into other algorithms used for pattern detection in images. As it turns out, Google has done a phenomenal job with their Vision API. It’s absolutely incredible ...
[Read more...]

Upcoming Talk at the Bay Area R Users Group (BARUG)

July 31, 2017 | 0 Comments

Next Tuesday (August 8) I will be giving a talk at the Bay Area R Users Group (BARUG). The talk is titled Beyond Popularity: Monetizing R... The post Upcoming Talk at the Bay Area R Users Group (BARUG) appeared first on AriLamstein.com.
[Read more...]

sparklyr 0.6

July 30, 2017 | 0 Comments

We’re excited to announce a new release of the sparklyr package, available in CRAN today! sparklyr 0.6 introduces new features to: Distribute R computations using spark_apply() to execute arbitrary R code across your Spark cluster. You can now use all of your favorite R packages and functions in a ... [Read more...]

Data visualization with googleVis exercises part 9

July 30, 2017 | 0 Comments

Histogram & Calendar chart This is part 9 of our series and we are going to explore the features of two interesting types of charts that googleVis provides like histogram and calendar charts. Read the examples below to understand the logic of what we are going to do and then test yous ... [Read more...]

Matching, Optimal Transport and Statistical Tests

July 30, 2017 | 0 Comments

To explain the “optimal transport” problem, we usually start with Gaspard Monge’s “Mémoire sur la théorie des déblais et des remblais“, where the the problem of transporting a given distribution of matter (a pile of sand for instance) into another (an excavation for instance). This problem ...
[Read more...]

Scripting for data analysis (with R)

July 30, 2017 | 0 Comments

Course materials (GitHub) This was a PhD course given in the spring of 2017 at Linköping University. The course was organised by the graduate school Forum scientium and was aimed at people who might be interested in using R for data analysis. The materials developed from a part of a ...
[Read more...]

Understanding Overhead Issues in Parallel Computation

July 29, 2017 | 0 Comments

In my talk at useR! earlier this month, I emphasized the fact that a major impediment to obtaining good speed from parallelizing an algorithm is systems overhead of various kinds, including: Contention for memory/network. Bandwidth limits — CPU/memory, CPU/network, CPU/GPU. Cache coherency problems. Contention for I/O ...
[Read more...]

Forecasting workshop in Perth

July 29, 2017 | 0 Comments

On 26-28 September 2017, I will be running my 3-day workshop in Perth on “Forecasting: principles and practice” based on my book of the same name. Topics to be covered include seasonality and trends, exponential smoothing, ARIMA modelling, dynamic regression and state space models, as well as forecast accuracy methods and ... [Read more...]

More documentation for Win-Vector R packages

July 29, 2017 | 0 Comments

The Win-Vector public R packages now all have new pkgdown documentation sites! (And, a thank-you to Hadley Wickham for developing the pkgdown tool.) Please check them out (hint: vtreat is our favorite). The package sites: cdata replyr seplyr sigr vtre...
[Read more...]

Updated overbought/oversold plot function

July 29, 2017 | 0 Comments

A good six years ago I blogged about plotOBOS() which charts a moving average (from one of several available variants) along with shaded standard deviation bands. That post has a bit more background on the why/how and motivation, but as a teaser here is the resulting chart of the ... [Read more...]

R Markdown exercises part 1

July 29, 2017 | 0 Comments

INTRODUCTION R Markdown is one of the most popular data science tools and is used to save and execute code, create exceptional reports whice are easily shareable. The documents that R Markdown provides are fully reproducible and support a wide variety of static and dynamic output formats. Using markdown syntax, ... [Read more...]

Stan Weekly Roundup, 28 July 2017

July 28, 2017 | 0 Comments

Here’s the roundup for this past week. Michael Betancourt added case studies for methodology in both Python and R, based on the work he did getting the ML meetup together: RStan workflow PyStan workflow Michael Betancourt, along with Mitzi Morris, Sean Talts, and Jonah Gabry taught the women in ... [Read more...]
1 2 3 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)