Articles by Nagdev Amruthnath

Multi-Output Regression using Sklearn

October 6, 2021 | Nagdev Amruthnath

Regression analysis is a process of building a linear or non-linear fit for one or more continuous target variables. That’s right! there can be more than one target variable. Multi-output machine learning problems are more common in classification than regression. In classification, the categorical target variables are encoded to ... [Read more...]

Visualizing trees with Sklearn

October 3, 2021 | Nagdev Amruthnath

Tree-based models are probably the second easiest ML technique for explaining the model to a non-data scientist. I am a big fan of tree-based models because of their simplicity and interpretability. But, when I try to visualize them is, when it gets my nerves. There are so many packages out ...
[Read more...]

Sentiment Analysis on Reddit using R

June 24, 2021 | Nagdev Amruthnath

According to Wikipedia, Reddit is an American social news aggregation, web content rating, and discussion website. Registered members submit content to the site such as links, text posts, images, and videos, which are then voted up or down by other members. Posts are organized by subject into user-created boards called “... [Read more...]

Big Data Ignite 2020 Webinar Series

September 15, 2020 | Nagdev Amruthnath

Big Data Ignite (BDI) was born out of a shared vision: To foster a local center of excellence in advanced computing technologies and practice. After initial success in organizing local Meetup groups, co-founders Elliott and Tuhin realized that to achieve their goal, the scope and scale of activism would need ... [Read more...]

Benford’s Law: Applying to Existing Data

August 18, 2020 | Nagdev Amruthnath

Benford’s Law is one of the most underrated and widely used techniques that are commonly used in various applications. United States IRS neither confirms nor denies their use of Benford’s law to detect any number of manipulations in income tax filing. Across the Atlantic, the EU is very ...
[Read more...]

How to use CI/CD for your ML Projects?

August 13, 2020 | Nagdev Amruthnath

The terms CI/CD stands for Continuous Integration and Continuous Delivery – Deployment. Before we jump into how all these work, let’s take a step back and walk through the process of ML. Most of the data scientists do their data analytics on their laptops. For every data analytics projects ...
[Read more...]

Will Netflix Renew the Show?

August 7, 2020 | Nagdev Amruthnath

In last couple of years, Netflix has become a part of my lifestyle. At the end of my day when I turn on my TV, by default i’m tuned to check out Netflix. I always look forward for Friday when they release their original content and make sure I ...
[Read more...]

How to become a data scientist in 30 days?

June 29, 2020 | Nagdev Amruthnath

On a late evening, I was scrolling through Reddit and came across a news article about “Why Bill Gates wants us all to get vaccinated?”. The news site looked legitimate. I was half way through the article and saw quite a few grammatical errors. Me being lurker, I switched to ...
[Read more...]

Visualizing Principle Components for Images

June 28, 2020 | Nagdev Amruthnath

Principle Component Analysis (PCA) is a great tool for a data analysis projects for a lot of reasons. If you have never heard of PCA, in simple words it does a linear transformation of your features using covariance or correlation. I will add a few links below if you want ...
[Read more...]

Data Science Application in Manufacturing

June 22, 2020 | Nagdev Amruthnath

Last week, I had a great opportunity to give a talk on data science application in manufacturing at Acharya Institute of Technology(AIT), Bangalore. Being an alumni, AIT has a special place in my heart. A lot of curious young minds who attended my session had great questions. Some of ... [Read more...]

COVID-19 Data and Prediction for Michigan

March 26, 2020 | Nagdev Amruthnath

Every country is facing a global pandemic caused by COVID19 and it’s quite scary for everyone. Unlike any other pandemic we faced before, COVID19 is providing plenty of quality data in near real time. Making this available for general public has helped citizen data scientists to share their reports, ...
[Read more...]

Convolutional Neural Network under the Hood

February 27, 2020 | Nagdev Amruthnath

Neural networks have really taken over for solving image recognition and high sample rate data problems in the last couple of years. In all honesty, I promise I won’t be teaching you what neural networks are or CNN’s are. There are hundred’s of resources that are published ...
[Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)