Blog Archives

Package Spotlight: anim.plots

May 28, 2019
By

The package anim.plots behaves like a sort of user-friendly shell on top of animate that makes animations of some of the most common types of plots in base R in a more intuitive fashion that animate. This package depends on two other important packages: -   magick, which is an R implementation of imageMagick, which itself is software used to create animated...

Read more »

Natural Language Processing in R: Edit Distance

April 9, 2019
By
Natural Language Processing in R: Edit Distance

These are the notes for the second lecture in the unit on text processing. Some useful ideas like exact string matching and the definitions of characters and strings are covered in the notes of Natural Language Processing in R: Strings and Regular Expressions Edit distance, also called Levenshtein distance, is a measure of the number of...

Read more »

Natural Language Processing in R: Strings and Regular Expressions.

April 9, 2019
By
Natural Language Processing in R: Strings and Regular Expressions.

In this post, I go through a lesson in natural language processing (NLP), in R. Specifically, it covers how strings operate in R, how regular expressions work in the stringr package by Hadley Wickham, and some exercises. Included with the exercises are a list of expected hang-ups, as well as an R function that can...

Read more »

Writing R documentation, simplified

April 7, 2019
By
Writing R documentation, simplified

A massive part of statistical software development is the documentation. Good documentation is more than just a help file, it serves as commentary on how the software works, includes use cases, and cites any relevant sources. One cool thing about R documentation is that it uses a system that allows it to be put into a variety of...

Read more »

Bingo analysis, a tutorial in R

April 1, 2019
By
Bingo analysis, a tutorial in R

I'm toying with the idea of writing a book about statistical analyses of classic games. The target audience would be mathematically interested laypeople, much like Jeffrey Rosenthal's book Struck by Lightning ( https://www.amazon.ca/Struck-Lightning-Jeffrey-S-Rosenthal/dp/0006394957 ). The twist would be that chapter would contain step-by-step R code or Python code so that the reader could do the same analysis and make...

Read more »

Dataset – The Giant Marmots of Moscow

March 26, 2019
By

Stat 403/640/890 Analysis Assignment 3: Polluted Giant Marmots Due Wednesday, April 3rd Drop off in the dropbox by the stats workshop, or hand in in class. For this assignment, use the Marmots_Real.csv dataset. Main goal: The giant marmots of Moscow have a pollution problem. Find a model to predict the pollutant concentration (mg per kg) in the local population without resorting to measuring...

Read more »

Parameter Estimation of Binned Data

October 31, 2018
By
Parameter Estimation of Binned Data

Section 1: Introduction – The Problem of Binned Data Hypothetically, say you’re given data like this in Table 1 below, and you’re asked to find the mean: Group Frequency 0 to 25 114 25 to 50 ...

Read more »

Analyzing Jeopardy in R – Part 2

May 25, 2018
By
Analyzing Jeopardy in R – Part 2

My previous Jeopardy analyzer was built using a base of about 30 daily Coryat scores. This one has more than 1600 scores that were either recorded directly, e-mailed to me, or scraped from the forum at jboard.tv . Here we look at the consistency of tou...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)