t-sne dimension reduction on Spotify mp3 samples

January 31, 2018
By
t-sne dimension reduction on Spotify mp3 samples

Introduction Not long ago I was reading on t-Distributed Stochastic Neighbor Embedding (t-sne), a very interesting dimension reduction technique, and on Mel frequency cepstrum a sound processing technique. Details of both techniques can be found here and here. Can we combine … Continue reading →

Read more »

Last call for the course on Advanced R programming

January 31, 2018
By
Last call for the course on Advanced R programming

Last call for the course on Advanced R programming scheduled in Leuven, Belgium on Febuary 20-21 2018. Register at: https://lstat.kuleuven.be/training/coursedescriptions/AdvancedprogramminginR.html You'll learn during that course: The apply family of functions, basic parallel programming for these functions and commonly needed data manipulation skills Making a basic reproducible report using Sweave and knitr including tables, graphs and literate programming How to create an R package Understand how...

Read more »

Connecting to SQL Server on shinyapps.io

January 31, 2018
By

If you use SQL Server (or Azure SQL DB) as your data store and you need to connect to the databasse from shinyapps.io, you’re presently stuck with FreeTDS. If you have any control over infrastructure I cannot recommend highly enough the actual ODBC Driver on Linux for ease. Alas, shinyapps.io does not let you control the infrastructure. We have...

Read more »

Where do you run to? Map your Strava activities on static and Leaflet maps.

January 31, 2018
By
Where do you run to? Map your Strava activities on static and Leaflet maps.

So, Strava's heatmap made quite a stir the last few weeks. I decided to give it a try myself. I wanted to create some kind of "personal heatmap" of my runs, using Strava's API. Also, combining the data with Leaflet maps allows us to make use of the beautiful map tiles supported by Leaflet and to zoom and move...

Read more »

Fair communication requires mutual consent

January 30, 2018
By
Fair communication requires mutual consent

I was pleased to read Shirish Agarwal's blog in reply to the blog I posted last week Do the little things matter? Given the militaristic theme used in my own post, I was also somewhat amused to see news this week of the Strava app leaking locations and...

Read more »

Create your Machine Learning library from scratch with R ! (1/3)

January 30, 2018
By
Create your Machine Learning library from scratch with R ! (1/3)

When dealing with Machine Learning problems in R, most of the time you rely on already existing libraries. This fastens the analysis process, but do you really understand what is behind the algorithms? Could you implement a logistic regression from scratch with R? The goal of this post is to create our own basic machine The post Create your...

Read more »

Deep Learning from first principles in Python, R and Octave – Part 3

January 30, 2018
By
Deep Learning from first principles in Python, R and Octave – Part 3

“Once upon a time, I, Chuang Tzu, dreamt I was a butterfly, fluttering hither and thither, to all intents and purposes a butterfly. I was conscious only of following my fancies as a butterfly, and was unconscious of my individuality as a man. Suddenly, I awoke, and there I lay, myself again. Now I do … Continue reading Deep...

Read more »

A smooth transition between chloropleth and cartogram

January 30, 2018
By

Read more »

PK/PD reserving models

January 29, 2018
By
PK/PD reserving models

This is a follow-up post on hierarchical compartmental reserving models using PK/PD models. It will show how differential equations can be used with Stan/ brms and how correlation for the same group level terms can be modelled. PK/ PD is usually short for pharmacokinetic/ pharmacodynamic models, but as Eric Novik of Generable pointed out to me, it could also be...

Read more »

Scraping Wikipedia Tables from Lists for Visualisation

January 29, 2018
By
Scraping Wikipedia Tables from Lists for Visualisation

Get WikiTables from Lists Recently I was asked to submit a short take-home challenge and I thought what better excuse for writing a quick blog post! It was on short notice so initially I stayed within the confines of my comfort zone and went for something safe and bland. However, I alleviated that rather fast; I guess you want to...

Read more »

JAX 2018 talk announcement: Deep Learning – a Primer

January 29, 2018
By
JAX 2018 talk announcement: Deep Learning – a Primer

I am happy to announce that on Tuesday, April 24th 2018 Uwe Friedrichsen and I will give a talk about Deep Learning - a Primer at the JAX conference in Mainz, Germany. Deep Learning is one of the “hot” topics in the AI area – a lot of hype, a lo...

Read more »

Hardwired..for tidy text

January 29, 2018
By
Hardwired..for tidy text

Song lyric and sentiment analysis for all So - a while back I did a tidy text analysis on Faith No More lyrics. I had thought about doing this with Metallica album lyrics, as they have had a long career, spanning thier late teens/twenties to...

Read more »

Introducing DataFramed, a Data Science Podcast

January 29, 2018
By
Introducing DataFramed, a Data Science Podcast

We are super pumped to be launching a weekly data science podcast called DataFramed, in which Hugo Bowne-Anderson (me), a data scientist and educator at DataCamp, speaks with industry experts about what data science is, what it's capable of, what it looks like in practice and the direction it is heading over the next decade … Continue reading Introducing...

Read more »

Speed up R with Parallel Programming in the Cloud

January 29, 2018
By

This past weekend I attended the R User Day at Data Day Texas in Austin. It was a great event, mainly because so many awesome people from the R community came to give some really interesting talks. Lucy D’Agostino McGowan has kindly provided a list of the talks and links to slides, and I thoroughly recommend checking it out:...

Read more »

The EARLy career scholarship

January 29, 2018
By
The EARLy career scholarship

...

Read more »

The “cluster of six”

January 29, 2018
By
The “cluster of six”

Unsupervised machine learning looks for hidden structure in "unlabeled" data, i.e. a classification or categorisation that is not included in the observations. Hierarchical clustering will reveal a "cluster of six" Labour MPs as being the most "distant" from the wider party. The post The “cluster of six” appeared first on thinkr.

Read more »

How do I calculate the R squared metric for a Bayesian model

January 28, 2018
By

How do I calculate the R squared metric for a Bayesian model? A good friend I met on a field trip long ago, Dominique Roche, recently emailed me to ask about evaluation of Bayesian models. He has been delving into generalized linear models, using Ba...

Read more »

Deep Learning for Cancer Immunotherapy

January 28, 2018
By
Deep Learning for Cancer Immunotherapy

This is a guest post from Leon Eyrich Jessen, a postdoctoral researcher in the Immunoinformatics and Machine Learning Group at the Technical University of Denmark. Introduction Simon Caulton, Adoptive T-cell therapy, CC BY-SA 3.0 In my...

Read more »

Introducing Maëlle Salmon, rOpenSci’s new Research Software Engineer

Introducing Maëlle Salmon, rOpenSci’s new Research Software Engineer

We’re very pleased to be introducing someone who needs no introduction in the R community. Join us in welcoming Maëlle Salmon to rOpenSci as a Research Software Engineer (part time, working from Nancy, France). We’d like to formally introduce her here and share a bit about the kinds of things she’ll be working on. Maëlle did a B.Sc. in Biology...

Read more »

sparklyr 0.7

January 28, 2018
By
sparklyr 0.7

We are excited to share that sparklyr 0.7 is now available on CRAN! Sparklyr provides an R interface to Apache Spark. It supports dplyr syntax for working with Spark DataFrames and exposes the full range of machine learning algorithms available in Spark. You can also learn more about Apache Spark and sparklyr in spark.rstudio.com and our new webinar series...

Read more »

Year 2 of Locke Data

January 28, 2018
By

Hey folks, I wanted to give y’all an update about Locke Data one year on from when I started it up. In the past year, I’ve delivered more than 32 days of training, wrote and published 2 books, worked with 3 clients, and generally whimpered at my schedule. It has been amazing how much support the community has given me,...

Read more »

Moving parts of a country over a map

January 28, 2018
By
Moving parts of a country over a map

I love making maps, I also love making gifs. In this short post I make an animated gif of parts of a map moving. In this case the parts of the map only move in the xy direction, but you can also turn them, and make them bigger or smaller. Today I ...

Read more »

Exploring Embeddings for Categorical Variables with Keras

January 28, 2018
By
Exploring Embeddings for Categorical Variables with Keras

In order to stay up to date, I try to follow Jeremy Howard on a regular basis. In one of his recent videos, he shows how to use embeddings for categorical variables (e.g. weekdays). First off; what are embeddings? An embedding is a mapping of a catego...

Read more »

Type I error rates in two-sample t-test by simulation

January 28, 2018
By

What do you do when analyzing data is fun, but you don't have any new data? You make it up. This simulation tests the type I error rates of two-sample t-test in R and SAS. It demonstrates efficient methods for simulation, and it reminders the reader not to take the result of any single hypothesis test as gospel truth....

Read more »

digest 0.6.15

January 28, 2018
By

And yet another small maintenance release, now at version 0.6.15, of the digest package arrived on CRAN and in Debian today. digest creates hash digests of arbitrary R objects (using the 'md5', 'sha-1', 'sha-256', 'sha-512', 'crc32', 'xxhash32', 'xxh...

Read more »

RVowpalWabbit 0.0.11

January 28, 2018
By

Another boring little RVowpalWabbit package update to version 0.0.11 came in response to another CRAN request: We were writing temporary output (a cache file for the fit/prediction, to be precise) to a non-temporary directory, which is now being caug...

Read more »

Log shiny app visitors and R usage to Google Analytics

January 28, 2018
By
Log shiny app visitors and R usage to Google Analytics

If you work on applications for clients or have open sourced some shiny apps, a question that arises is how is your application being used. What you can do in order to find out how your hard work is being consumed is putting your code in logs and then ...

Read more »

R or Python? Python or R? The ongoing debate.

January 28, 2018
By
R or Python? Python or R? The ongoing debate.

On every SQL community event, where there could be a cluster of sessions dedicated to BI or analytics, I would have people asking me, “which one would you recommend?” or “which one I  prefer?” So, questions about recommendation and preferences are in my opinion the hardest one. And not that I would know my preferences … Continue reading R...

Read more »

Modeling LGD with Proportional Odds Model

January 28, 2018
By

The LGD model is an important component in the expected loss calculation. In https://statcompute.wordpress.com/2015/11/01/quasi-binomial-model-in-sas, I discussed how to model LGD with the quasi-binomial regression that is simple and makes no distributional assumption. In the real-world LGD data, we usually would observe 3 ordered categories of values, including 0, 1, and in-betweens. In cases with a

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



Quantide: statistical consulting and training

ODSC2 west

ODSC1_jobs

datasociety

http://www.eoda.de



CRC R books series







Six Sigma Online Training



mljar.com



Contact us if you wish to help support R-bloggers, and place your banner here.