Document Classification using R

July 25, 2013
By
Document Classification using R

September 23, 2013Recently I have developed interest in analyzing data to find trends, to predict the future events etc. & started working on few POCS on Data Analytics such as Predictive analysis, text mining.  I’m putting my next blog on Data Mining- more specifically document classification using R Programming language, one of the powerful languages...

Read more »

shinyPsychometric: simulating how experimental choices affect results uncertainty

July 25, 2013
By
shinyPsychometric: simulating how experimental choices affect results uncertainty

In the R community, Shiny is trending a lot. Shiny is an R package for developing interactive web applications. I wanted to give it a try, and ended up with an app to simulate the impact of data collection choices on the outcome reliability of a curve fitting process. I wrote the code behind it

Read more »

Development Update

July 25, 2013
By

This is a quick update regarding the status of my R packages on google code. Since google decided to disallow uploads from Jan-2014 for existing projects, and immediately for new ones (meaning that the tarballs and zips could not be hosted on their servers anymore), I have had no choice but to return the development

Read more »

Deferred evaluation in Renjin, Riposte, and pqR

July 24, 2013
By
Deferred evaluation in Renjin, Riposte, and pqR

The previously sleepy world of R implementation is waking up.  Shortly after I announced pqR, my “pretty quick” implementation of R, the Renjin implementation was announced at UserR! 2013.  Work also proceeds on Riposte, with release planned for a year from now. These three implementations differ greatly in some respects, but interestingly they all try

Read more »

Another great year together. Have a good vacation!

July 24, 2013
By
Another great year together. Have a good vacation!

Dear R users, August is just round the corner and MilanoR close for vacation. From September 2012, we have organized two meetings in Milano, the first Italian bioR day and two R courses. We published 14 posts in the blog. … Continue reading →

Read more »

A Note About proj4 in R

July 24, 2013
By

It's been a long time since I had to transform some coordinates 'manually'. I mean I had a list of coordinates that I needed to project. There are some nice proprietary tools for it, but who needs them when the opensource is everywhere?Proj4 is an ...

Read more »

What is R? A new video on the history, community and applications of R

July 24, 2013
By

We meet a lot of R users on our travels, and something we often hear from them is that while they're doing amazing things with R (incredible data visualizations, statistical analysis, and data science applications), their supervisors or peers may not know that the R language is involved, or that others could benefit from using it. It would be...

Read more »

Selecting subset of variables in data frame

July 24, 2013
By

I frequently work with datasets with many variables. In this case I often need to apply some function to subset of variables in data frame. To simplify this task I wrote short function that allows me to specify what variables to include and what variables should be excluded.   I do choose subset of variables based on the following condition types: variable/column...

Read more »

Making infographics using R and Inkscape

July 24, 2013
By
Making infographics using R and Inkscape

I have been making charts with R for almost as long as I have been using R, and with good reason: R is an amazing tool for filtering and visualizing data. With R, and particularly if we use the excellent ggplot2 library, we can go from raw data to compelling visualization in minutes. But what if we want...

Read more »

Archival, Analysis, and Visualization of #ISMBECCB 2013 Tweets

July 24, 2013
By
Archival, Analysis, and Visualization of #ISMBECCB 2013 Tweets

As the 2013 ISMB/ECCB meeting is winding down, I archived and analyzed the 2000+ tweets from the meeting using a set of bash and R scripts I previously blogged about.The archive of all the tweets tagged #ISMBECCB from July 19-24, 2013 is and ...

Read more »

Electricity Usage in a High-rise Condo Complex pt 4

July 24, 2013
By
Electricity Usage in a High-rise Condo Complex pt 4

This is the fourth article in the series, where the techiness builds to a crescendo. If this is too statistical/programming geeky for you, the next posting will return to a more investigative and analytical flavor. Last time, we looked at … Continue reading →

Read more »

Sure, this is silly, but this makes me feel a little bit cooler

July 24, 2013
By

Look at this nice video on R statistics. It really advertises doing statistics in a way that is open to anyone!

Read more »

Power Analysis by Simulation: R, RCT, Malaria Example

July 24, 2013
By
Power Analysis by Simulation: R, RCT, Malaria Example

I have received a number of requests for demonstration code on how to perform a power analysis using simulation in R.  I have already demonstrated howto do this in Stata but lacked the easy to use Stata command “simulate” that I preferred.  However, in a recent post I have written up a command very similar to simulate in...

Read more »

Postscript to Data Visualization

July 23, 2013
By
Postscript to Data Visualization

Much to my chagrin, I realized I forgot to include one of the more interesting features in the lattice package. You can quickly turn a quantitative variable into one of levels of equal counts.  This provides a nice way of looking at slices of your...

Read more »

R Credit Scoring – WoE & Information Value in woe Package

July 23, 2013
By
R Credit Scoring – WoE & Information Value in woe Package

In credit scoring, Information Value (IV) is frequently used to compare predictive power among variables. When developing new scorecards using logistic regression, variables are often binned and recoded using WoE concept. Package riv will help you to a...

Read more »

Using survival models for marketing attribution

July 23, 2013
By

by Andrie de Vries Prior to joining Revolution Analytics in March this year, I spent several years in the field of market research and survey analytics. During this period, I spent a few months consulting to a digital marketing agency based in London. My role was to help build their capability in building customer surveys and integrating these into...

Read more »

A Beginner’s Look at Julia

July 23, 2013
By

Over the past month or so, I’ve been playing with a new scientific programming language called ‘Julia‘, which aims to be a high-level language with performance approaching that of C. With that goal in mind, Julia could be a replacement for the ‘multi-language’ problem of needing to move between R, Python, MATLAB, C, Fortran, Scala, A Beginner’s Look...

Read more »

The R User Conference 2013: Albacete, Spain

July 23, 2013
By
The R User Conference 2013: Albacete, Spain

I was fortunate enough to attend the 2013 UseR! conference in Albacete, Spain this year. I had a great time meeting fellow R users and exchanging ideas on R implementations. The conference is also one of the few opportunities to gain exposure to uses of R in other disciplines because there are so many talks

Read more »

How to run R in the cloud (for teaching)

July 23, 2013
By
How to run R in the cloud (for teaching)

Last week, we launched the early stage beta version of our interactive online learning platform for R: DataMind.org. The development of this educational platform required the creation of a new IT infrastructure able to run R in the cloud. In this post, we share our approach and insights on the design of such an application and hope

Read more »

Generating Sankey Diagrams from rCharts

July 23, 2013
By
Generating Sankey Diagrams from rCharts

A couple of weeks or so ago, I picked up an inlink from an OCLC blog post about Visualizing Network Flows: Library Inter-lending. The post made use of Sankey diagrams to represent borrowing flows, and by implication suggested that the creation of such diagrams is not as easy as it could be… Around the same

Read more »

Review: Kölner R Meeting 19 July 2013

July 23, 2013
By
Review: Kölner R Meeting 19 July 2013

Despite the hot weather and the beginning of the school holiday season in North Rhine Westphalia the Cologne R user group met yet again for two fascinating talks and beer and schnitzel afterwards.Analysing Twitter data to evaluate the US Dollar / Euro exchange rates Dietmar Janetzko presented ideas to forecast US Dollar / Euro exchange rate movements...

Read more »

Vectors of S4 classes with non-trivial slots

July 22, 2013
By
Vectors of S4 classes with non-trivial slots

Here’s another rabbit hole where I spent a bit of time this evening. I like OOP and I like the way R uses vectors. I’ve created a few classes and had started to code a function which would plot a set of them. It all seemed straightforward until I realized that the infrastructure for treating

Read more »

Visual debugging with RStudio

July 22, 2013
By
Visual debugging with RStudio

Introduction From release 098.208 the last RStudio IDE comes with a visual debugger. Now debugging with R and RStudio becomes a simple and efficient task. This short post does not want to be a crash course: “debugging with R” nor … Continue reading →

Read more »

Bike sharing in 100 cities

July 22, 2013
By
Bike sharing in 100 cities

Many cities around the world have bike sharing programs: pick up a bike at a docking station, ride it across town and drop it off at another session, and just pay for the time you use. (Even Albacete, the Spanish college down hosting last month's UseR conference, had one.) Most of these systems provide open data feeds of bike...

Read more »

More goodies from rCharts

July 22, 2013
By
More goodies from rCharts

The guys developing rCharts continue to release enhancements by the day and I have taken advantage to update a couple of Shiny apps The CRAN download app now sports the new exporter feature so that any chart a user comes up with can be saved as a SVG vector, PNG or JPEG image or as

Read more »

Hierarchical Linear Model

July 22, 2013
By
Hierarchical Linear Model

Linear regression probably is the most familiar technique of data analysis, but its application is often hamstrung by model assumptions. For instance, if the data has a hierarchical structure, quite often the assumptions of linear regression are feas...

Read more »

Create R package – Rstudio, github, devtools

July 22, 2013
By

If you are going to create your first package in R, there is common set of tools you will probably use - Rstudio, devtools package and github. You don't have to, but it will save you a lot of time and your code wil be versioned and better understandabl...

Read more »

David vs. Goliath in Men’s Professional Tennis

July 22, 2013
By
David vs. Goliath in Men’s Professional Tennis

David dances lightly from side to side, his small feet stirring up wisps of dust from the clay surface. Twirling his racket in anticipation, he peers intently at his colossal foe hoping to spot some clue where the first serve will go. Across the net is...

Read more »

Loading up your custom toolkits at startup – R

July 22, 2013
By

If you are anything like me then you have dozens if not hundreds of personal functions that you have written to accomplish a number of tasks.  Many of the tasks that you would like to do are similar to previous tasks that you have already done. &n...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.