January 2017

Extracting and Enriching Ocean Biogeographic Information System (OBIS) Data with R

January 25, 2017 | 0 Comments

Programmatic access to biodiversity data is revolutionising large-scale, reproducible biodiversity research. In the marine realm, the largest global database of species occurrence records is the Ocean Biogeographic Information System, OBIS. As of January 2017, OBIS contains 47.78 million occurrences of 117,345 species, all openly available and accessible via the OBIS API. The number ... [Read more...]

Modelling extremes using generalized additive models

January 25, 2017 | 0 Comments

Quite some years ago, whilst working on the EU Sixth Framework project Euro-limpacs, I organized a workshop on statistical methods for analyzing time series data. One of the sessions was on the analysis of extremes, ably given by Paul Northrop (UCL Department of Statistical Science). That intro certainly whet my ...
[Read more...]

Modelling extremes using generalized additive models

January 25, 2017 | 0 Comments

Quite some years ago, whilst working on the EU Sixth Framework project Euro-limpacs, I organized a workshop on statistical methods for analyzing time series data. One of the sessions was on the analysis of extremes, ably given by Paul Northrop (UCL Department of Statistical Science). That intro certainly whet my ... [Read more...]

A Glimpse into The Daily Life of a Data Scientist

January 24, 2017 | 0 Comments

A couple of weeks ago, I had a discussion with a co-worker regarding a project I was involved in, I felt that there was no clear understanding of the daily challenges data scientists face. Few days later, I was at Rstudio::Conf 2017 where I met lots of data scientists from ... [Read more...]

a typo that went under the radar

January 24, 2017 | 0 Comments

A chance occurrence on X validated: a question on an incomprehensible formula for Bayesian model choice: which, most unfortunately!, appeared in Bayesian Essentials with R! Eeech! It looks like one line in our LATEX file got erased and the likelihood part in the denominator altogether vanished. Apologies to all readers ...
[Read more...]

Building a machine learning model with the MicrosoftML package

January 24, 2017 | 0 Comments

Microsoft R Server 9 includes a new R package for machine learning: MicrosoftML. (So do the Data Science Virtual Machine and the free Microsoft R Client edition, incidentally.) This package includes a suite of fast predictive modeling functions implemented by Microsoft Research, including: Linear (rxFastLinear) and logistic (rxLogisticRegression) model functions based ... [Read more...]

Descriptive Analysis of MLST Data for MRSA

January 24, 2017 | 0 Comments

During one of my summers, I had the opportunity to conduct some research on the prevalence of methicillin-resistant Staphylococcus aureus (MRSA) in vulnerable populations and examining US emergency department data and I thought this would be a pretty interesting topic to expand on for my thesis in lieu of the ...
[Read more...]

Building Shiny App Exercises (part 5)

January 24, 2017 | 0 Comments

RENDER FUNCTIONS In the fourth part of our series we just “scratched the surface” of reactivity by analyzing some of the properties of the renderTable function. Now it is time to get deeper and learn how to use the rest of the render functions that shiny provides. As you were ... [Read more...]

I’m a cowboy…

January 24, 2017 | 0 Comments

No, I don't ride on a steel horse. This post is about data wrangling with Chainladder Continue reading →
[Read more...]

Distribution of Mean of the Combinations of a Set.

January 24, 2017 | 0 Comments

For some purpose I found myself generating and analyzing the average of the combinations of a set and when I generated the corresponding histogram I was surprised by its shape.It should be remembered that the combinations C(m, n) of a set are the number of subsets of a ... [Read more...]

xml2 1.1.1

January 24, 2017 | 0 Comments

Today we are pleased to release version 1.1.1 of xml2. xml2 makes it easy to read, create, and modify XML with R. You can install it with: install.packages("xml2") As well as fixing many bugs, this release: Makes it easier to create an modify XML Improves roundtrip support between XML ...
[Read more...]

sparklyr 0.5

January 24, 2017 | 0 Comments

We’re happy to announce that version 0.5 of the sparklyr package is now available on CRAN. The new version comes with many improvements over the first release, including: Extended dplyr support by implementing: do() and n_distinct(). New functions including sdf_quantile(), ft_tokenizer() and ft_regex_tokenizer(). Improved compatibility, ...
[Read more...]

Euler Problem 9 : Special Pythagorean Triple

January 24, 2017 | 0 Comments

Solution to Euler Problem 9 in the R Language: Find the Pythagorean triple for which a+b+c equals 1000. Continue reading → The post Euler Problem 9 : Special Pythagorean Triple appeared first on The Devil is in the Data. [Read more...]

How to do an analysis in R (part 2, visualization and analysis)

January 24, 2017 | 0 Comments

In several recent blog posts, I've emphasized the importance of data analysis. My main point has been, that if you want to learn data science, you need to learn data analysis. Data analysis is the foundation of practical data science. With that statement in mind, I want to show you ... [Read more...]

How to use viridis colors with plotly and leaflet?

January 24, 2017 | 0 Comments

“… avoiding catastrophe becomes the first principle in bringing color to information: Above all, do no harm.” - Envisioning Information, Edward Tufte, Graphics Press, 1990 Choosing colors for your plot is not so simple. Why is that so? First of all, it depends on numerous things… What plot are you creating? What ... [Read more...]

Parallel Computation with R and XGBoost

January 23, 2017 | 0 Comments

Share This: XGBoost is a comprehensive machine learning library for gradient boosting. It began from the Kaggle community for online machine learning challenges, and then maintained by the collaborative efforts from the developers in the community. It is well known for its accuracy, efficiency and flexibility for various interfaces: the ... [Read more...]

French villages and a sort of resolution

January 23, 2017 | 0 Comments

Sort of introduction to this post and hopefully the next ones I usually don’t have any New Year resolution. However, recent tweets about productivity – from people I actually find productive and inspiring – made me ponder a bit on my unfinished... [Read more...]
1 2 3 4 5 6 19

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)