2955 search results for "ggplot"

Code as Magic, and the Vernacular of Data Wrangling Verbs

February 11, 2015
By
Code as Magic, and the Vernacular of Data Wrangling Verbs

It’s been some time now since I drafted most of my early unit contributions to the TM351 Data management and analysis course. Part of the point (for me) in drafting that material was to find out what sorts of thing we actually wanted to say and help identify the sorts of abstractions we wanted to

Read more »

Autocommit with ceODBC is slow

February 10, 2015
By
Autocommit with ceODBC is slow

You already know that a SQL INSERT is faster with bulk loading than inserting a record at a time, but what about the effect of autocommit on performance? While this is probably not specific to ceODBC, using autocommit is astonishingly slow. Here is how slow. First, the Python code to run the benchmark:import ceODBCimport datetimeimport osimport timeconnection_string="driver=sql...

Read more »

Why you should start by learning data visualization and manipulation

February 10, 2015
By

One of the biggest issues that comes up when I talk to people who want to get started learning data science is the following: I don’t know where to get started! Recently, I argued that R is the best programming language to learn when you’re getting started with data science. While this helps you select The post

Read more »

Mixing Waves

February 10, 2015
By
Mixing Waves

Fill a cocktail shaker with ice; add vodka, triple sec, cranberry, and lime, and shake well; strain into a chilled cocktail glass and garnish with orange twist (Cosmopolitan Cocktail Recipe) This is a tribute to Blaise Pascal and Joseph Fourier, two of the greatest mathematicians in history. As Pascal did in his famous triangle, I generate a set of … Continue reading...

Read more »

[ggtree] updating a tree view using %<% operator

February 9, 2015
By
[ggtree] updating a tree view using %<% operator

I am very exciting that I have received very positive feedback from Ahmed Moustafa and Simon Frost. Read More: 401 Words Totally

Read more »

Hierarchical log odds model example

February 8, 2015
By
Hierarchical log odds model example

I am working through Bayesian Approaches to Clinical Trials and Health-Care Evaluation (David J. Spiegelhalter, Keith R. Abrams, Jonathan P. Myles) (referred to as BACTHCE from here on). In chapter three I saw an example (3.13) where I wanted to d...

Read more »

Introducing stackr: An R package for querying the Stack Exchange API

February 3, 2015
By
Introducing stackr: An R package for querying the Stack Exchange API

There’s no end of interesting data analyses that can be performed with Stack Overflow and the Stack Exchange network of Q&A sites. Earlier this week I posted a Shiny app that visualizes the personalized prediction data from their machine learning system, Providence. I’ve also looked at whether high-reputation users were decreasing their answering activity over time,...

Read more »

QuickTip: Utilizing Machine Learning Methods to Identify Important Variables

February 2, 2015
By
QuickTip: Utilizing Machine Learning Methods to Identify Important Variables

Machine Learning is the field of scientific study that concentrates on induction algorithms and on other algorithms that can be said to “learn.” In order to identify important variables in a multivariate dataset one can utilize machine learning methods. There are many different machine learning algorithms for different tasks. One common task is to decide if a feature vector...

Read more »

Spearman Correlation Heat Map with Correlation Coefficients and Significance Levels in R

February 2, 2015
By
Spearman Correlation Heat Map with Correlation Coefficients and Significance Levels in R

Figure 1: Spearman correlation heat map with correlation coefficient and significance levels based on the mtcars data set.In a recent paper we included data from a survey we conducted. During the publication process, one of the reviewers asked for a...

Read more »

Should you teach Python or R for data science?

February 2, 2015
By

Last week, I published a post titled Lessons learned from teaching an 11-week data science course, detailing my experiences and recommendations from teaching General Assembly's 66-hour introductory data science course. In the comments, I received the ...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)