ROC for Decision Trees – where did the data come from?

August 8, 2020 | 0 Comments

ROC for Decision Trees – where did the data come from? By Jerry Tuttle       In doing decision tree classification problems, I have often graphed the ROC (Receiver Operating Characteristic) curve. The True Positive Rate (TPR) is on the y-axis, and the False Positive Rate (FPR) is on the x-axis. True Positive ... [Read more...]

Quick Hit: Speeding Up Data Frame Creation

August 8, 2020 | 0 Comments

(This is part 2 of n “quick hit” posts, each walking through some approaches to speeding up components of an iterative operation. Go here for part 1). Thanks to the aforementioned previous post, we now have a super fast way of reading individual text files containing HTTP headers from HEAD requests into ... [Read more...]

Characterisation of human metabolic phenotypes using R

August 8, 2020 | 0 Comments

Characterising and understanding how human phenotypes relates to populations requires being able to count the occurrence of certain traits in individuals from different populations. Using NMR and MS based metabolic profiles, this means being able to estimate the presence of a feature, aka a signal, across the whole dataset aggregate ...
[Read more...]

RVowpalWabbit 0.0.15: Some More CRAN Build Issues

August 7, 2020 | 0 Comments

Another maintenance RVowpalWabbit package update brought us to version 0.0.15 earlier today. We attempted to fix one compilation error on Solaris, and addressed a few SAN/UBSAN issues with the gcc build. As noted before, there is a newer package rvw ... [Read more...]

Why I don’t use R Markdown’s ref.label

August 7, 2020 | 0 Comments

Outline R Markdown provides the chunk option ref.label to reuse chunks. In this post, I'll show potential problems with this approach and present an easy and safe alternative. If you don't bother with the detailed Explanation, feel free to jump right to the Summary section. Explanation Consider you have ... [Read more...]

Will Netflix Renew the Show?

August 7, 2020 | 0 Comments

In last couple of years, Netflix has become a part of my lifestyle. At the end of my day when I turn on my TV, by default i’m tuned to check out Netflix. I always look forward for Friday when they release their original content and make sure I ...
[Read more...]

Quick Hit: Comparison of “Whole File Reading” Methods

August 7, 2020 | 0 Comments

(This is part 1 of n posts using this same data; n will likely be 2-3, and the posts are more around optimization than anything else.) I recently had to analyze HTTP response headers (generated by a HEAD request) from around 74,000 sites (each response stored in a text file). They look ... [Read more...]

High School Swimming State-Off Tournament Florida (3) vs. Illinois (6)

August 6, 2020 | 0 Comments

This week in the High School Swimming State-Off Tournament we have the third seeded Sunshine State, Florida (3) taking on the sixth seeded Illinois (6) aka the Prairie State. library(SwimmeR) library(dplyr) library(stringr) library(purrr) library(flextable) Florida Results Florida has a nice results repository, with Hy-Tek real-time results. This ... [Read more...]

nanotime 0.3.0: Yuge New Features!

August 6, 2020 | 0 Comments

A fresh major release of the nanotime package for working with nanosecond timestamps is hitting CRAN mirrors right now. nanotime relies on the RcppCCTZ package for (efficient) high(er) resolution time parsing and formatting up to nanosecond resolutio... [Read more...]

Join us at R/Medicine – Aug 27-29, 2020

August 6, 2020 | 0 Comments

August 27-29, 5:30am PDT / 8:30am EDT / 2:30pm CEST – Register now! Brought to you by the Children’s Hospital of Philadelphia, Yale School of Public Health, and the R... The post Join us at R/Medicine – Aug 27-29, 2020 appeared first on R Consortium.
[Read more...]

Basic manipulation of GIF frames with magick

August 6, 2020 | 0 Comments

The magick package is a really powerful package for image processing in R. The official vignette is a great place to start learning how to use the package. I’ve been playing around with using magick for manipulating GIFs and found … Continue reading → [Read more...]

Useful YAML options for generating HTML reports in R

August 5, 2020 | 0 Comments

I think most people start using R with scripts initially. Scripts are great for modular computational tasks, but for generating informative reports, Rmarkdown is a must. Rmd makes it possible to use a YAML header to specify certain parameters right at the beginning of the document. Built-in YAML parameters make ... [Read more...]

Tutorial: Getting Started with R and RStudio

August 5, 2020 | 0 Comments

Get your R programming journey off on the right foot with this RStudio tutorial that walks through everything from installation to best practices. The post Tutorial: Getting Started with R and RStudio appeared first on Dataquest.
[Read more...]

Image contours in R

August 5, 2020 | 0 Comments

I recently came across this short fun post on R-bloggers that demonstrated how to use the image.ContourDetector package (available on CRAN) to extract contours from an image. The image of the contours looked really cool so I thought I would … Continue reading →
[Read more...]

RcppCCTZ 0.2.8: Minor API Extension

August 4, 2020 | 0 Comments

A new minor release 0.2.8 of RcppCCTZ is now on CRAN. RcppCCTZ uses Rcpp to bring CCTZ to R. CCTZ is a C++ library for translating between absolute and civil times using the rules of a time zone. In fact, it is two libraries. One for dealing with civ... [Read more...]

Predicting pneumonia outcomes: EDA part 1

August 4, 2020 | 0 Comments

Intro Data dictionary EDA blueprint Outcome Discard the noise 1 Other_ related category 2 Pt_ Patient related category Appropriate patients Case_number Age 3 R_ Radiology related category Effusion and effusion site On chest x-ray ( R_CXR_effusion, R_CXR_effusionSite) On CT chest (R_CT_effusion, R_CT_effusionSite) 4 SS_ Category related ...
[Read more...]

Cheesecake Diagrams: Pie Charts with a Different Flavour

August 4, 2020 | 0 Comments

Part of my job at a regional water utility involves visualising operational data. We manage water and sewerage services for a large number of small and medium-sized towns in regional Victoria (Australia). Traditionally, performance reports consist of extensive tables filled with numbers with a line for each city. To make ...
[Read more...]

Do, Share, Teach, and Learn Data Science with RStudio Cloud

August 4, 2020 | 0 Comments

RStudio is proud to announce the general availability of RStudio Cloud, its cloud-based platform for doing, teaching, and learning data science using only a browser. This general release incorporates feedback from thousands of users, based on more than 3.5 million hours of compute time. What is RStudio Cloud? RStudio Cloud is ... [Read more...]
1 2 3 1,681