280 search results for "web scraping"

Scraping R-Bloggers with Python

January 4, 2012
By

In this post I promised to show how I use Python with the BeautifulSoup and Mechanize modules to scrape information from different websites. As a fun exercise, and something that should interest the readers of R-bloggers, I thought it would be interest...

Read more »

R-Function GScholarScraper to Webscrape Google Scholar Search Result

November 9, 2011
By
R-Function GScholarScraper to Webscrape Google Scholar Search Result

Based on my previous post on Web Scraping I coded and uploaded the Function "GScholarScraper" HERE for testing!The function will pull all (!) results, processing pages in chunks of 100 results/titles, and return a file with all titles, links, etc. It w...

Read more »

Interacting with bioinformatics webservers using R

September 8, 2011
By
Interacting with bioinformatics webservers using R

In an ideal world, all bioinformatics tools would be made available via the Web as a web service with an API, as well as a standalone package to download for local use. This is rarely the case and sometimes, even where one or the other is available, factors such as cost come into play. So

Read more »

R Screen Scraping: 105 Counties of Election Data

February 18, 2011
By

by Earl F. Glynn, Kansas Watchdog The goal of this article is to show how to visit 105 online web pages programmatically and “scrape” data from them to form a statewide summary of election data in Kansas. An earlier article gave details of ...

Read more »

Simple R Screen Scraping Example

February 18, 2011
By

by Earl F. Glynn, Kansas Watchdog The goal of this exercise is to show how to “screen scrape” data from an online web page using R. Additional articles will extend this example to scrape data from 105 Kansas county pages to form a statewide...

Read more »

Scrape Web data using R

August 13, 2010
By
Scrape Web data using R

Plenty of people have been scraping data from the web using R for a while now, but I just completed my first project and I wanted to share the code with you.  It was a little hard to work through some of the “issues”, but I had some great help from @DataJunkie on twitter. As

Read more »

Russell 2000 Quantitative Stock Analysis in R: Six Stocks with Amazing, Consistent Growth

November 29, 2016
By
Russell 2000 Quantitative Stock Analysis in R: Six Stocks with Amazing, Consistent Growth

The Russell 2000 Small-Cap Index, ticker symbol: ^RUT, is the hottest index of 2016 with YTD gains of over 18%. The index components are interesting not only because of recent performance, but because the top performers either grow to become mid-cap stocks or are bought by large-cap companies at premium prices. This means selecting the best components...

Read more »

Sentimental Analysis of the First Presidential Debate of 2016 Using Machine Learning

September 27, 2016
By
Sentimental Analysis of the First Presidential Debate of 2016 Using Machine Learning

This post analyzes sentiment of people towards the 2016 presidential candidates post the First Presidential Debate using Tweets. We focus on Tweets containing the names of Hillary Clinton and Donald Trump.Let’s load the required packageslibrary(twitteR)library(ROAuth)require(RCurl)library(stringr)library(tm)library(plyr)library(tm)library(wordcloud)Setting up Twitter Authenticationconsumer_key = "consumer_key"consumer_secret = "consumer_secret"token_secret = "token_secret"access_token = "access_token"authenticate <- OAuthFactory$new(consumerKey = consumer_key,consumerSecret = consumer_secret,requestURL="https://api.twitter.com/oauth/request_token",accessURL="https://api.twitter.com/oauth/access_token",authURL="https://api.twitter.com/oauth/authorize")setup_twitter_oauth(consumer_key, consumer_secret, access_token, token_secret)Web ScrapingLet’s search what has been tweeted...

Read more »

Mastering the art of Data Preparation II

September 15, 2016
By
Mastering the art of Data Preparation II

It is not the beauty of a building you should look at; it is the construction of the foundation that will stand the test of time. ~ David Allan CoeThis week we focus on getting data from MYSQL, HDF5, API and the WEB.Extracting Data from MYSQLMysql is one of the most widely used open source databases.The data is stored in tabular format...

Read more »

Mastering the Art of Data Preparation I

September 6, 2016
By
Mastering the Art of Data Preparation I

Without a solid foundation, you’ll have trouble creating anything of value. ~ Erika OppenheimerAlthough building models to predict the future outcome is the ultimate goal of any study, getting the data and preparing it for analysis is the foundation based on which the models are built. Although often neglected, this quintessential step is of paramount importance for the success...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)