Articles by Fisseha Berhane

Machine Learning with R Caret – Part 1

January 25, 2018 | Fisseha Berhane

This blog post series is on machine learning with R. We will use the Caret package in R. In this part, we will first perform exploratory Data Analysis (EDA) on a real-world dataset, and then apply non-regularized linear regression to solve a supervised regression problem on the dataset. We will ...
[Read more...]

Using MongoDB with R

March 2, 2017 | Fisseha Berhane

MongoDB is a NoSQL database program which uses JSON-like documents with schemas. It is free and open-source cross-platform database. MongoDB, top NoSQL database engine in use today, could be a good data storage alternative when analyzing large volume data. To use MongoDB with R, first, we have to download and ...
[Read more...]

Machine Learning for Drug Adverse Event Discovery

September 26, 2016 | Fisseha Berhane

We can use unsupervised machine learning to identify which drugs are associated with which adverse events. Specifically, machine learning can help us to create clusters based on gender, age, outcome of adverse event, route drug was administered, purpose the drug was used for, body mass index, etc. This can help ... [Read more...]

The importance of Data Visualization

September 2, 2016 | Fisseha Berhane

Before we perform any analysis and come up with any assumptions about the distributions of and relationships between variables in our datasets, it is always a good idea to visualize our data in order to understand their properties and identify appropriate analytics techniques. In this post, let’s see the ... [Read more...]

Best packages for data manipulation in R

May 17, 2016 | Fisseha Berhane

There are 2 packages that make data manipulation in R fun. These are dplyr and data.table. Both packages have their strengths. While dplyr is more elegant and resembles natural language, data.table is succinct and we can do a lot with data.table in just a single line. Further, data.... [Read more...]

Working with databases in R

April 2, 2016 | Fisseha Berhane

The dplyr package, which is one of my favorite R packages, works with in-memory data and with data stored in databases. In this extensive and comprehensive post, I will share my experience on using dplyr to work with databases. The basic functions of dplyr package are covered by Teja in ... [Read more...]

Performing SQL selects on R data frames

March 13, 2016 | Fisseha Berhane

For anyone who has SQL background and who wants to learn R, I guess the sqldf package is very useful because it enables us to use SQL commands in R. One who has basic SQL skills can manipulate data frames in R using their SQL skills. You can read more ... [Read more...]

Sentiment Analysis on Donald Trump using R and Tableau

January 2, 2016 | Fisseha Berhane

Recently, the presidential candidate Donal Trump has become controversial. Particularly, associated with his provocative call to temporarily bar Muslims from entering the US, he has faced strong criticism. Some of the many uses of social media analytics is sentiment analysis where we evaluate whether posts on a specific issue are ... [Read more...]

Google scholar scraping with rvest package

January 1, 2016 | Fisseha Berhane

In this post, I will show how to scrape google scholar. Particularly, we will use the 'rvest' R package to scrape the google scholar account of my PhD advisor. We will see his coauthors, how many times they have been cited and their affiliations. “rvest, inspired by libraries like beautiful ... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)