Data Science to Analyze Big Genomic Data

July 28, 2018
By
Data Science to Analyze Big Genomic Data

Finding the neural stem cell populations in mouse brain   Introduction The main objective of this project is to identify new stem cell populations in mouse brain. Characterizing the unique gene expression signature of these cells could be the starting point for finding and defining cancer stem cells (CSC) in human tumors. In particular, glioblastoma,

Read more »

Mysteriously Slow sample

July 28, 2018
By
Mysteriously Slow sample

Hi everyone, I'm at JSM 2018 right now, so feel free to drop by my session or drop by in the halls! Just give me a tweet! Back to the meat-and-potatoes of this post. A while ago I was running good old sample and comparing its performance to my lpm2_kdtree function in the BalancedSampling package (Grafström and Lisic,...

Read more »

Le Monde puzzle [#1062]

July 27, 2018
By
Le Monde puzzle [#1062]

A simple Le Monde mathematical puzzle none too geometric: Find square triangles which sides are all integers and which surface is its perimeter. Extend to non-square rectangles. No visible difficulty by virtue of Pythagore’s formula: for (a in 1:1e4) for (b in a:1e4) if (a*b==2*(a+b+round(sqrt(a*a+b*b)))) print(c(a,b)) produces two answers 5 12 6 8 and in

Read more »

aRt with code

July 27, 2018
By
aRt with code

Looking for something original to decorate your wall? Art With Code, created by Harvard University bioinformatician Jean Fan, provides a collection of R scripts to generate artistic images in the style of famous artworks, for example this randomly-generated piece in the style of Mondrian: Other art generators include "Tunnel" (rotated and scaled designs in the style of Päivi Julin),...

Read more »

No worries! Afterthoughts from UseR 2018

July 27, 2018
By
No worries! Afterthoughts from UseR 2018

This year the UseR conference took place in Brisbane, Australia. UseR is my favorite conference and this one was mine 11th (counting from Dortmund 2008).  Every UseR is unique. Every UseR is great. But my feelings are that European UseRs are (on average) more about math, statistics and methodology while US UseRs are more about … Czytaj dalej No...

Read more »

Weight loss in the U.S. – An analysis of NHANES data with tidyverse

July 27, 2018
By
Weight loss in the U.S. – An analysis of NHANES data with tidyverse

Based on a paper published in JAMA last year, the weight gain is increasing among US adults while there is no difference in the percentage of people that were trying to lose weight. The authors used the data from the National Health and Nutrition Examination Survey NHANES from 1988 to 2014 and calculated the proportion Related PostMachine Learning Results...

Read more »

The Ten Rules of Defensive Programming in R

July 27, 2018
By
The Ten Rules of Defensive Programming in R

When you think of R, defensive coding may not be your first thought. But writing code that fails well & is easy to debug is more important than you'd think. The post The Ten Rules of Defensive Programming in R appeared first on Doodling in Data.

Read more »

How to use Covariates to Improve your MaxDiff Model

July 27, 2018
By
How to use Covariates to Improve your MaxDiff Model

MaxDiff is a type of best-worst scaling. Respondents are asked to compare all choices in a given set and pick their best and worse (or...

Read more »

Using themes in ggplot2

July 27, 2018
By
Using themes in ggplot2

As noted elsewhere, sometimes beauty matters. A plot that’s pleasing to the eye will be considered more gladly, and thus might be understood more thoroughly. Also, since we at STATWORX oftentimes need to subsume and communicate our results, we have come to appreciate how a nice plot can upgrade any presentation. So how make a plot look good? How...

Read more »

Cucumber time, food on a 2D plate / plane

July 27, 2018
By
Cucumber time, food on a 2D plate / plane

Introduction It is 35 degree Celsius out side, we are in the middle of the ‘slow news season’, in many countries also called cucumber time.  A period typified by the appearance of less informative and frivolous news in the media. … Continue reading →

Read more »

EARL London interviews – Patrik Punco, NOZ Medien

July 27, 2018
By

Our next interviewee is Patrik Punco, Marketing Analyst at German media company, NOZ Medien. Patrik is presenting a lighting talk ‘Subscription Analytics with focus on Churn Pattern Recognition in a German News Company’ at EARL London. Ruth Thomson, Mango’s Practice Lead for Strategic Advice chatted to Patrik about the business need for his project, what value it created for the...

Read more »

Two new Apache Drill UDFs for Processing UR[IL]s and Internet Domain Names

July 26, 2018
By
Two new Apache Drill UDFs for Processing UR[IL]s  and Internet Domain Names

Continuing the blog’s UDF theme of late, there are two new UDF kids in town: drill-url-tools🔗 for slicing & dicing URI/URLs (just going to use ‘URL’ from now on in the post) drill-domain-tools🔗 for slicing & dicing internet domain names (IDNs). Now, if you’re an Apache Drill fanatic, you’re likely thinking “Hey hrbrmstr: don’t you... Continue reading →

Read more »

Hacking our way through UpSetR

Hacking our way through UpSetR

For our club meeting today we were going to summarize the Demystifying Data Science conference but we forgot that the videos are not released yet. Oops, we'll have to postpone our blog post. We didn't read the fine print that talk recordings will be available sometime next week. Sorry about that!— LIBD rstats club (@LIBDrstats) July 27, 2018 So we adjusted...

Read more »

CHAID v ranger v xgboost – a comparison – July 27, 2018

July 26, 2018
By
CHAID v ranger v xgboost – a comparison – July 27, 2018

In an earlier post, I focused on an in depth visit with CHAID (Chi-square automatic interaction detection). Quoting myself, I said “As the name implies it is fundamentally based on the venerable Chi-square test – and while not the most powerful (in terms of detecting the smallest possible differences) or the fastest, it really is easy to manage and...

Read more »

Announcing the 1st Bookdown Contest

July 26, 2018
By
Announcing the 1st Bookdown Contest

Since the release of the bookdown package in 2016, there have been a large number of books written and published with bookdown. Currently there are about 200 books (including tutorials and notes) listed on bookdown.org alone! We have also heard about other applications of bookdown based on custom templates (e.g., dissertations). As popular as bookdown is becoming, especially with teachers,...

Read more »

How to use rquery with Apache Spark on Databricks

July 26, 2018
By
How to use rquery with Apache Spark on Databricks

A big thank you to Databricks for working with us and sharing: rquery: Practical Big Data Transforms for R-Spark Users How to use rquery with Apache Spark on Databricks rquery on Databricks is a great data science tool.

Read more »

Stan Pharmacometrics conference in Paris July 24 2018

July 25, 2018
By

I just got back from attending this amazing conference in Paris:http://www.go-isop.org/stan-for-pharmacometrics---paris-franceA few people were disturbed/surprised by the fact that I am linguist ("what are you doing at an pharmacometrics conference?")....

Read more »

RStudio Connect 1.6.6 – Custom Emails

July 25, 2018
By
RStudio Connect 1.6.6 – Custom Emails

We are excited to announce RStudio Connect 1.6.6! This release caps a series of improvements to RStudio Connect’s ability to deliver your work to others. Custom Email The most significant change in RStudio Connect 1.6.6 is the new ability for publishers to customize the emails sent to others when they update their data products. In RStudio Connect, it is already...

Read more »

Explaining Black-Box Machine Learning Models – Code Part 2: Text classification with LIME

July 25, 2018
By
Explaining Black-Box Machine Learning Models – Code Part 2: Text classification with LIME

This is code that will encompany an article that will appear in a special edition of a German IT magazine. The article is about explaining black-box machine learning models. In that article I’m showcasing three practical examples: Explaining supervised classification models built on tabular data using caret and the iml package Explaining image classification models with keras and lime Explaining text classification...

Read more »

Singularity as a software distribution / deployment tool

July 25, 2018
By

In this blog post, I’ll explain how someone can take advantage of Singularity to make R or Python packages available as an image file to users. This is a necessity if the specific R or Python package is difficult to install across different operating systems making that way the installation process cumbersome. Lately, I’ve utilized the reticulate package in...

Read more »

rOpenSci Educators Collaborative: How Can We Develop a Community of Innovative R Educators?

rOpenSci Educators Collaborative: How Can We Develop a Community of Innovative R Educators?

tl;dr: we propose three calls to action: Share your curricular materials in the open. Participate in the rOpenSci Education profile series. Discuss with us how you want to be involved in rOpenSci Educators’ Collaborative. In previous posts in this series, we identified challenges that individual instructors typically face when teaching science with R, and shared characteristics of effective educational resources to help address...

Read more »

New Course: Structural Equation Modeling with lavaan in R

July 25, 2018
By
New Course: Structural Equation Modeling with lavaan in R

Here is the course link. Course Description When working with data, we often want to create models to predict future events, but we also want an even deeper understanding of how our data is connected or structured. In this course, you will explore ...

Read more »

New Course: Experimental Design in R

July 25, 2018
By
New Course: Experimental Design in R

Here is the course link. Course Description Experimental design is a crucial part of data analysis in any field, whether you work in business, health or tech. If you want to use data to answer a question, you need to design an experiment! In this c...

Read more »

New Project: Visualizing Inequalities in Life Expectancy

July 25, 2018
By
New Project: Visualizing Inequalities in Life Expectancy

Here is the project link. Project Description Do women live longer than men? How long? Does it happen everywhere? Is life expectancy increasing? Everywhere? Which is the country with the lowest life expectancy? Which is the one with the highest? In...

Read more »

Hi Pawel, I’m glad you enjoyed it.

July 25, 2018
By
Hi Pawel, I’m glad you enjoyed it.

Hi Pawel, I’m glad you enjoyed it. I was trying to play around with facet_grid() earlier but I guess I didn’t stumble upon the proper parameters. Your suggestion works perfectly; not only does it keep each grid x-axis width proportional to its length, but it also keeps appropriate space-between-variables. Thank you for sharing that!

Read more »

Forcasting the price of bitcoin with the CRAN forecast package

July 25, 2018
By
Forcasting the price of bitcoin with the CRAN forecast package

There is interest in bitcoin at the moment because it is displaying signs of steady year to year growth with brief boosts followed by rapid declines. It is considered a risky investment by investors yet, has the potential for high returns in a fairly short duration (1-2 years). John McAfee, inventor of McAfee anti virus

Read more »

Gender diversity in the film industry

July 25, 2018
By
Gender diversity in the film industry

The year 2017 has completely turned the film industry upside down. The allegations of harassment and sexual assault against Harvey Weinstein have raised the issue of sexism and misogyny in this industry to the eyes of the general public. In addition, it has helped raise awareness of the poor gender diversity and under-representation of women in Hollywood. One of the...

Read more »

JSM 2018 Itinerary

July 24, 2018
By
JSM 2018 Itinerary

JSM 2018 is almost here! Usually around this time, I comb through the entire program manually making an itinerary for myself. But this year I decided to try something new – a programmatic way of going through the program, and then building a Shiny app that helps me better navigate the online program. The end result of the app is...

Read more »

The Revamped bookdown.org Website

July 24, 2018
By
The Revamped bookdown.org Website

Since we announced the bookdown package in 2016, there have been a large number of books, reports, notes, and tutorials written with this package and published to https://bookdown.org. We were excited to see that! At the same time, however, ...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC2 west

ODSC1_london

datasociety

http://www.eoda.de

max kuhn









Six Sigma Online Training



mljar.com

computationalanalytics.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.