3151 search results for "Map"

Exploiting Heterogeneity to Reveal Consumer Preference: Data Matrix Factorization

August 11, 2014
By
Exploiting Heterogeneity to Reveal Consumer Preference: Data Matrix Factorization

We begin with a data matrix, a set of numbers arrayed so that each row contains information from a different consumer. Marketing research focuses on the consumer, but the columns are permitted more freedom, although they ought to tell us something abou...

Read more »

Vtreat: designing a package for variable treatment

August 7, 2014
By
Vtreat: designing a package for variable treatment

When you apply machine learning algorithms on a regular basis, on a wide variety of data sets, you find that certain data issues come up again and again: Missing values (NA or blanks) Problematic numerical values (Inf, NaN, sentinel values like 999999999 or -1) Valid categorical levels that don’t appear in the training data (especially Related posts:

Read more »

why clusterProfiler fails

August 6, 2014
By

Recently, there are some comments said that sometimes clusterProfiler failed in KEGG enrichment analysis. kaji331 compared cluserProfiler with GeneAnswers and found that clusterProfiler gives larger p values. The result forces me to test it. Read More: 251 Words Totally

Read more »

Results of the Readers’ Survey

August 5, 2014
By
Results of the Readers’ Survey

 First of all, let me say “Thank You” to all of the 357 people who completed the survey. I was hoping for 100, so needless to say the response blew away my expectations. This endeavor seems like a worthwhile effort to do once a year. Next year I will refine the...

Read more »

Introducing rlist 0.3

August 5, 2014
By

rlist 0.3 is released! This package now provides a wide range of functions for dealing with list objects. It can be especially useful when they are used to store non-tabular data. Two notable features are added in this version. First, list.search and equal() are added in support of fuzzy filtering and searching. Second, List object is added to provide object-based,...

Read more »

Clarifying difference between Ratio and Interval Scale of Measurement

August 5, 2014
By

Clarifying difference between Ratio and Interval Scale of Measurement Clarifying difference between Ratio and Interval Scale of Measurement Introduction Recently while preparing lecture on scales of measurements and types of statistical data, I came across two scales of measurement when numbers are used to denote a quantitative variable. ...

Read more »

Parameterized SQL queries

August 5, 2014
By

Mateusz Żółtak asked me to spread the word about his new R package for parameterized SQL queries. Below you can find the copy of package vignette. If you work with SQL in R you may find it useful. Mateusz Żółtak The package RODBCext is an extension of the RODBC database connectivity package. It provides support

Read more »

Parsing Domain Names in R with tldextract

August 4, 2014
By

The R Language is really good at data and statistical analysis, but when it comes to working with information security data it has a few holes that need plugging up. Bob has been doing a couple of posts using Rcpp to do things like Basic DNS Lookups, TXT lookups, and IPv4 Conversions. I wanted to add to some of that work with a quick package...

Read more »

Customer Segmentation Using Purchase History: Another Example of Matrix Factorization

August 2, 2014
By
Customer Segmentation Using Purchase History: Another Example of Matrix Factorization

As promised in my last post, I am following up with another example of how to perform market segmentations with nonnegative matrix factorization. Included with the R package bayesm is a dataset called Scotch containing the purchase history for 21 brands of whiskey over a one year time period from 2218 respondents. The brands along with some features...

Read more »

The odds of a cluster of airplane accidents

August 2, 2014
By
The odds of a cluster of airplane accidents

Recently, there have been a lot of airplane accidents. July, 17th 2014, Hrabove, Ukraine, Malaysia Airlines, Boeing 777, fatalities 298 (/298) July, 23rd 2014, Magong, Taiwan, TransAsia Airways, ATR 72-500, fatalities 47 (/58) July, 24th 2014, Aguelhok, Mali, Air Algerie, Mc Donnell Douglas MD-83, fatalities 116 (/116) It is simple to find a lot of datasets about airplane crashes....

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)