Rstudio OverView
we have 4 panes1) script pan - to write and save the programing script2) Console pane - where all the code will get executed3) Environment/history pane - displays all the variables created,functionsused with in the current session4) Helper pane - contains multiple tabs to install/display pacakges,... [Read more...]

Introduction:In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to ...

[Read more...] As part of Data Science with R, this is third tutorial after basic data types,control structures in r.One of the issues with for loop is its memory consumption and its slowness in executing a repetitive task at hand. Often dealing with large data and iterating it, for loop ...

[Read more...]
[Read more...] As part of Data Science tutorial Series in my previous post I posted on basic data types in R. I have kept the tutorial very simple so that beginners of R programming may takeoff immediately. Please find the online R editor at the end of the post...

[Read more...] Curse of Dimensionality:One of the most commonly faced problems while dealing with data analytics problem such as recommendation engines, text analytics is high-dimensional and sparse data. At many times, we face a situation where we have a large set of features and fewer data points, or we have data ...

[Read more...]As part of tutorial series on Data Science with R from Data Perspective, this first tutorial introduces the very basics of R programming language about basic data types in R.What we learn:Assignment OperatorNumericIntegerComplex numberlogicalCharacterFactorVectorData FrameAfter the end of the chapter, you are provided with R console so that ... [Read more...]

As R programming language becoming popular more and more among data science group, industries, researchers, companies embracing R, going forward I will be writing posts on learning Data science using R. The tutorial course will include topics on data types of R, handling data using R, probability theory, Machine Learning, ... [Read more...]

In the series of implementing Recommendation engines, in my previous blog about recommendation system in R, I have explained about implementing user based collaborative filtering approach using R. In this post, I will be explaining about basic implementation of Item based collaborative filtering recommender systems in r. Intuition:Item based ... [Read more...]

Recently I have come across a term, CRISP-DM - a data mining standard. Though this process is not a new one but I felt every analyst should know about commonly used Industry wide process. In this post I will explain about different phases involved in creating a data mining solution. ... [Read more...]

In my previous blog I have explained about linear regression. In today’s post I will explain about logistic regression. Consider a scenario where we need to predict a medical condition of a patient (HBP) ,HAVE HIGH BP or NO HIGH BP, based on some observed symptoms – Age, weight, Issmoking, ... [Read more...]

R is getting popular programming language in the area of Data Science. Integrating Rscript with web UI pages is a challenge which many application developers are facing. In this blog post I will explain how we can expose R script as an API, using rApache and Apache webserver. rApache is ... [Read more...]

What is a Prediction Problem?A business problem which involves predicting future events by extracting patterns in the historical data. Prediction problems are solved using Statistical techniques, mathematical models or machine learning techniques.For example: Forecasting stock price for the next week, predicting which football team wins the world cup, ... [Read more...]

In our day to day life, we come across a large number of Recommendation engines like Facebook Recommendation Engine for Friends’ suggestions, and suggestions of similar Like Pages, Youtube recommendation engine suggesting videos similar to our previous searches/preferences. In today’s blog post I will explain how to build ... [Read more...]

In today’s blog post, we shall look into time series analysis using R package – forecast. Objective of the post will be explaining the different methods available in forecast package which can be applied while dealing with time series analysis/forecasting. What is Time Series?A time series is a ... [Read more...]

Ever since I’ve started working on R , I always wondered how I can presentthe results of my statistical models as web applications. After doing someresearch over the internet I’ve come across – ShinyR – a new packagefrom RStudio which can be used to develop interactive web applications with R.Before ... [Read more...]

In my previous blog post I have explained the steps needed to solve a data analysis problem. Going further, I will be discussing in-detail each and every step of Data Analysis. In this post, we shall discuss about exploratory Analysis.What is Exploratory Analysis?“Understanding data visually”Exploratory Analysis means ... [Read more...]

After going through the overview of tools & technologies needed to become a Data scientist in my previous blog post, in this post, we shall understand how to tackle a data analysis problem.Any data analysis project starts with identifying a business problem where historical data exists. A business problem can ... [Read more...]

As mentioned in my previous post , in this post I will be listing out the tools, blogs and forums, online courses that I have gathered over the past one year, which I felt necessary in my journey, which will be helpful to my fellow data science aspirants. Skillset Required: Knowledge ... [Read more...]

As we all are marching into the New Year, I would like to post about my plans to become a Data Scientist, my 2014 resolution at Professional front. The term Data Science was first introduced to me a year ago same time. Since then I have started researching and gathering necessary ... [Read more...]

