Therefor in R, there is predict() function of making a prediction with this relation. But for making prediction we require input data. We will store request data to input_data. Now, suppose we have an input dataset ( href="http://www.tatvic.com/blog/product-revenue-prediction-with-r/#codesnap" >description ) like
- xcartaddtotalrs_out = 0
- xcartremove_out = 0
- xproductviews_out = 47
- xuniqprodview_out = 38
- xprodviewinrs_out = 5828
> input_data <- data.frame(xcartaddtotalrs_out=0, xcartremove_out=0, xprodviews_out=47, xuniqprodview_out=38, xprodviewinrs_out=5828)
This means we want to predict the transactional product revenue on the base of xcartaddtotalrs, xcartremove, xproductviews, xuniqprodview and xprodviewinrs. Now, we will make a prediction by predict function and model_out prediction model (which we have already developed in href="http://www.tatvic.com/blog/?p=3013">Product revenue prediction with R – part 2).
style="text-align: justify">We style="text-align: justify"> can do style="text-align: justify">the same prediction activity style="text-align: justify"> on google prediction API with less effort. When we processed the same href="http://www.tatvic.com/blog/product-revenue-prediction-with-r/#codesnap" >dataset with google prediction API for predictive modeling, it’s model summary would look likestyle="text-align: center"> href="http://www.tatvic.com/blog/wp-content/uploads/2012/09/get1.jpg"> class="aligncenter wp-image-3293" src="http://www.tatvic.com/blog/wp-content/uploads/2012/09/get1.jpg" alt="" width="700" height="270" />
Let’s identify above attributes from prediction result. The id is unique model identity information for model identification. In model information, there are numberInstances which describes total numbers of rows are 4061 in the dataset, modelType attribute describes the type of model (either regression or categorical) which is regression and meanSquaredError which is 1606123.17. Root of mean squared error is the cost of this regression model in the Google Prediction API model which is 1267.33.
In R, our href="http://www.tatvic.com/blog/product-revenue-prediction-with-r-part-2/#formula" >predictive model has cost 656.4 which is lower than the Google Prediction API. The reason behind the reduced cost is variable selection and removal of outliers from our dataset. I style="text-align: justify">n style="text-align: justify"> R, style="text-align: justify">we style="text-align: justify"> can improve our prediction accuracy with improving model as well as improving our style="text-align: justify">dataset style="text-align: justify"> quality as per model type. But in google prediction API, we can improve prediction accuracy only by dataset quality, we can’t update model.style="text-align: justify">Don’t think this stuff is more complex, it’s pretty interesting once you are used to developing it. To start learning this predictive modeling, just start with rough implementation and improve step by step as per your requirement. If you need to do it yourself you can href="http://www.tatvic.com/blog/downloads/product_revenue_r.rar?" onclick="_gaq.push(['_trackEvent','Downloads','Product Revenue-r','Blog',,1]);">download this R code + sample dataset. In next of my blog- title="Product revenue prediction with Prediction API" href="http://www.tatvic.com/blog/product-revenue-prediction-with-prediction-api/" >Product revenue prediction with Prediction API, I will discuss about generating prediction with Google Prediction API with more description. style="text-align: justify">Want us to help you implement or analyze the data for your visitors. href="http://www.tatvic.com/contact/?ref=blogpost">Contact us class="wp-about-author-containter-top" style="background-color:#FFEAA8;"> class="wp-about-author-pic"> src="http://www.tatvic.com/blog/wp-content/uploads/userphoto/15.jpg" alt="Vignesh Prajapati" width="60" class="photo" />