Site icon R-bloggers

The Data Mining Process: Modeling

[This article was first published on ThinkToStart » R Tutorials, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Happy new year, everyone! Continuing this series on the data mining process that has previously examined understanding business problems and associated data as well as data preparation,  this post focuses on modeling.

Developing models calls for using specific algorithms to explore, recognize, and ultimately output any patterns or themes in your data.  The two goals of modeling are to classify or predict. Some algorithms specialize in either classifying or predicting while others can be applied to do both. Choosing which algorithms to employ in developing models depends on the goals of the business, the nature of the data (structured versus unstructured), and the quantity as well as quality of the data.

There are many popular algorithms that are often seen to develop models for specific types of business problems:

It is important to mention that, while specific algorithms are often used to develop specific types of models, there are typically multiple types of algorithms that might be the best option in any specific instance. Choosing which algorithm is best for any specific modeling task requires evaluation. Evaluating models will be covered in the next post.

The post The Data Mining Process: Modeling appeared first on ThinkToStart.

To leave a comment for the author, please follow the link and comment on their blog: ThinkToStart » R Tutorials.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.