A follow up note on our web scraping tutorial
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
We had published a web scraping tutorial a couple of days back and it had received a good response from the #rstats community. While we thank you for that, we made a mistake in choosing one of the case study as pointed out by @hrbrmstr in this tweet:
Whomever runs “R Squared Academy” needs to _really_ learn more about web scraping. https://t.co/jOQRAxFVro clearly prohibits the activity in their recent blog post and puts #rstats users in harm's way. Robots check is _not_ enough. This is seriously uncool.
— boB • Everywhere truly is Baltimore • Rudis (@hrbrmstr) April 11, 2019
We choose the case study to appeal to a wide audience but in doing so we set a bad example. We have removed the case study from our post and apologize to the #rstats community for our oversight and promise to be more responsible in the future.
We reiterate it is very important to read the terms and conditions before scraping data from a website and checking the robots.txt file is not sufficient.
Team Rsquared Academy
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
