R bloggers

A guide to working with character data in R

June 22, 2018
By
A guide to working with character data in R

R is primarily a language for working with numbers, but we often need to work with text as well. Whether it's formatting text for reports, or analyzing natural language data, R provides a number of facilities for working with character data. Handling Strings with R, a free (CC-BY-NC-SA) e-book by UC Berkeley's Gaston Sanchez, provides an overview of the...

Read more »

Using DataCamp’s Autograder to Teach R

June 22, 2018
By
Using DataCamp’s Autograder to Teach R

Immediate and personalized feedback has been central to the learning experience on DataCamp since we launched the first courses. If students submit code that contains a mistake, they are told where they made a mistake, and how they can fix this. You can play around with it in our free Introduction to R course. The screenshot below is from...

Read more »

Melt and cast the shape of your data.frame – Exercises

June 22, 2018
By
Melt and cast the shape of your data.frame – Exercises

  Datasets often arrive to us in a form that is different from what we need for our modelling or visualisations functions who in turn don’t necessary require the same format. Reshaping data.frames is a step that all analysts need but many struggle with. Practicing this meta-skill will in the long-run result in more time Related exercise sets:Spatial Data...

Read more »

Creating Slopegraphs with R

June 22, 2018
By
Creating Slopegraphs with R

Presenting data results in the most informative and compelling manner is part of the role of the data scientist. It's all well and good to master the arcana of some algorithm, to manipulate and master the numbers and bend them to your will to produce a “solution” that is both accurate and useful. But, those Related PostHow to use...

Read more »

Parallelizing Linear Regression or Using Multiple Sources

June 21, 2018
By
Parallelizing Linear Regression or Using Multiple Sources

My previous post was explaining how mathematically it was possible to parallelize computation to estimate the parameters of a linear regression. More speficially, we have a matrix which is matrix and a -dimensional vector, and we want to compute by spliting the job. Instead of using the observations, we’ve seen that it was to possible to compute “something” using...

Read more »

Announcing new software review editors: Anna Krystalli and Lincoln Mullen

Announcing new software review editors: Anna Krystalli and Lincoln Mullen

Part of rOpenSci’s mission is to create technical infrastructure in the form of carefully vetted R software tools that lower barriers to working with data sources on the web. Our open peer software review system for community-contributed tools is a key component of this. As the rOpenSci community grows and more package authors submit their work for peer review,...

Read more »

Idle thoughts lead to R internals: how to count function arguments

June 21, 2018
By
Idle thoughts lead to R internals: how to count function arguments

“Some R functions have an awful lot of arguments”, you think to yourself. “I wonder which has the most?” It’s not an original thought: the same question as applied to the R base package is an exercise in the Functions chapter of the excellent Advanced R. Much of the information in this post came from … Continue reading Idle...

Read more »

A Comparative Review of the BlueSky Statistics GUI for R

June 21, 2018
By
A Comparative Review of the BlueSky Statistics GUI for R

Introduction BlueSky Statistics’ desktop version is a free and open source graphical user interface for the R software that focuses on beginners looking to point-and-click their way through analyses.  A commercial version is also available which includes technical support and a … Continue reading →

Read more »

Non-Linear Model in R Exercises

June 21, 2018
By

A mechanistic model for the relationship between x and y sometimes needs parameter estimation. When model linearisation does not work,we need to use non-linear modelling. There are three main differences between non-linear and linear modelling in R: 1. specify the exact nature of the equation 2. replace the lm() with nls() which means nonlinear least Related exercise sets:Spatial Data...

Read more »

Scraping Responsibly with R

Scraping Responsibly with R

I recently wrote a blog post here comparing the number of CRAN downloads an R package gets relative to its number of stars on GitHub. What I didn’t really think about during my analysis was whether or not scraping CRAN was a violation of its Terms and Conditions. I simply copy and pasted some code from R-bloggers that...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)