May 2020

Looking Normal(ly Distributed)

May 12, 2020 | R on Data & The World

Among all probability distributions, the normal distribution is probably the most well-established and well-characterized. The importance of things like the central limit theorem and the normality assumptions in linear regression highlight it well. One of the more interesting ones is the fact that you can approximate a binomial distribution with ...
[Read more...]

deep learning model for titanic data

May 12, 2020 | Modeling with R

Introducction Data preparation Partition the data & impute the missing values. Convert the data into a numeric matrix. Train the model. Create the model Compile the model Fit the model The model evaluation model tuning Conclusion Introducction Deep learning model belongs to the area of machine learning models which can be ... [Read more...]

Bayesian hyperparameters optimization

May 12, 2020 | Modeling with R

Introduction Bayesian optimization Acquisition functions Data preparation Random forest model The true distribution of the hyperparameters random search bayesian optimization UCB bayesian optimization PI bayesian optimization EI Contrast the results deep learning model Random search Bayesian optimization UCB Bayesian optimization PI Bayesian optimization EI Contrast the results Conclusion Session info ...
[Read more...]

Calculating ratios with Tidyverse

May 12, 2020 | R | TypeThePipe

Calculating percentages is a fairly common operation, right? However, doing it without leaving the pipeflow always force me to do some bizarre piping such as double grouping and summarise. I am using again the nuclear accidents dataset, and trying to calculate the percentage of accidents that happened in Europe each ... [Read more...]

coronavirus v0.2.0 is now on CRAN

May 12, 2020 | Rami Krispin

Version 0.2.0 of the coronavirus R data package was pushed today to CRAN. The coronavirus package provides a tidy format for Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE) Coronavirus dataset. Version 0.2.0 catch up with the significant changes in the data that took place since the initial release ... [Read more...]

One-proportion and goodness of fit test (in R and by hand)

May 12, 2020 | R on Stats and R

Introduction In R Data One-proportion test Assumption of prop.test() and binom.test() Chi-square goodness of fit test Does my distribution follow a given distribution? Observed frequencies Expected frequencies Observed vs. expected frequencies By hand One-proportion test Verification in R Goodness of fit test Verification in R Introduction In a ...
[Read more...]

Filtering with string statements in dplyr

May 12, 2020 | R on Alan Yeung

A question came up recently at work about how to use a filter statement entered as a complete string variable inside dplyr’s filter() function – for example dplyr::filter(my_data, "var1 == 'a'"). There does not seem to be much out there on this and I was not sure how ... [Read more...]

Building R 4+ for Windows with OpenBLAS

May 12, 2020 | Avi

This post outlines the steps needed to build R 4+ for Windows with OpenBLAS. The release of R 4.0 includes significant changes to the Windows build system from prior versions—for the better! Before anything, we all owe Jeroen Ooms significant gratitude for the many hours he spent working on the build ...
[Read more...]

Will AI become conscious any time soon?

May 12, 2020 | Learning Machines

We all know the classical Sci-Fi trope of intelligent machines becoming conscious and all the potential ramifications that could follow from there (free will, fighting their human creators, ethical dilemmas and so forth). Now, is this a realistic scenario? As a researcher in the area of AI (see e.g. ... [Read more...]

How to schedule R scripts

May 11, 2020 | Andrew Treadway

Running R with taskscheduleR and cronR In a previous post, we talked about how to run R from the Windows Task Scheduler. This article will talk about two additional approaches to schedule R scripts, including using the taskscheduleR package on Windows and the cronR package for Linux. For scheduling Python ...
[Read more...]

In defence of the 95% CI

May 11, 2020 | R on easystats

TLDR: BayestestR currently uses a 89% threshold by default for Credible Intervals (CI). Should we change that? If so, by what? Join the discussion here. Magical numbers, or conventional thresholds, have bad press in statistics, and there are many of them. For instance, .05 (for the p-value), or the 95% range for the ...
[Read more...]
1 9 10 11 12 13 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)