Boxplots are a good way to get some insight in your data, and while R provides a fine ‘boxplot’ function, it doesn’t label the outliers in the graph. However, with a little code you can add labels yourself:The numbers plotted next to ...

Introducing: Machine Learning in R Machine learning is a branch in computer science that studies the design of algorithms that can learn. Typical machine learning tasks are concept learning, function learning or “predictive modeling”, clustering and finding predictive patterns. These tasks are learned through available data that were observed through experiences or instructions, for example. The post

by Herman Jopia What is Binning? Binning is the term used in scoring modeling for what is also known in Machine Learning as Discretization, the process of transforming a continuous characteristic into a finite number of intervals (the bins), which allows for a better understanding of its distribution and its relationship with a binary variable. The bins generated by...

Yesterday, I did upload a post where I tried to show that “standard” regression models where not performing bad. At least if you include splines (multivariate splines) to take into accound joint effects, and nonlinearities. So far, I do not discuss the possible high number of features (but with boostrap procedures, it is possible to assess something related to...

(This is a guest post by my friend Yoni Sidi, a PhD candidate in statistics at the Hebrew University) Background The Israeli elections are coming up this Tuesday, 17/3/2015 (i.e.: tomorrow!). They are a bit more complicated than your average US presidential race. The elections in Israel are based on nationwide proportional representation. The electoral … Continue reading...

by Matt Sundquist co-founder of Plotly Domino's new R Notebook and Plotly's R API let you code, make interactive R and ggplot2 graphs, and collaborate entirely online. Here is the Notebook in action: Published R Notebook To execute this Notebook, or to build your own, head to Domino's Plotly Project. The GIF below shows how to get started: choose...

On March 20, Thomas Lumley, the creator of the R Package “Survey”, will give an online course (in statistics.com) titled “Survey Analysis in R” The purpose of this 4-week online course, is to teach survey researchers who are familiar with R how to use it in survey research. The course uses Lumley’s Survey package. You will learn how to...

This post summarizes and visualizes eight key ideas we’ve heard from data visualization experts. Check out our first Case Study to learn more about using Plotly Enterprise on-premise, on your servers. To get started on free online graphing like in this post, check out our tutorials. Make Interactive Graphs Pictures of graphs in PowerPoints, dashboards, and emails can...