Articles by Rstats on Julia Silge

Text Mining of Stack Overflow Questions

July 5, 2017 | Rstats on Julia Silge

Note: Cross-posted with the Stack Overflow blog. This week, my fellow Stack Overflow data scientist David Robinson and I are happy to announce the publication of our book Text Mining with R with O’Reilly. We are so excited to see this project out in the world, and so relieved ...
[Read more...]

Using tidycensus and leaflet to map Census data

June 23, 2017 | Rstats on Julia Silge

Recently, I have been following the development and release of Kyle Walker’s tidycensus package. I have been filled with amazement, delight, and well, perhaps another feeling… There should be a word for “the regret felt when an R ?, which would have saved untold hours of your life, is released”… #... [Read more...]

tidytext 0.1.3

June 17, 2017 | Rstats on Julia Silge

I am pleased to announce that tidytext 0.1.3 is now on CRAN! In this release, my collaborator David Robinson and I have fixed a handful of bugs, added tidiers for LDA models from the mallet package, and updated functions for changes to quanteda’s API. You can check out the NEWS ...
[Read more...]

Mining CRAN DESCRIPTION Files

May 3, 2017 | Rstats on Julia Silge

A couple of weeks ago, I saw on Dirk Eddelbuettel’s blog that R 3.4.0 was going to include a function for obtaining information about packages currently on CRAN, including basically everything in DESCRIPTION files. When R 3.4.0 was released, this was one of the things I was most immediately excited about ...
[Read more...]

How Do You Discover R Packages?

March 19, 2017 | Rstats on Julia Silge

Like I mentioned in my last blog post, I am contributing to a session at userR 2017 this coming July that will focus on discovering and learning about R packages. This is an increasingly important issue for R users as we all decide which of the 10,000+... [Read more...]

Scraping CRAN with rvest

March 5, 2017 | Rstats on Julia Silge

I am one of the organizers for a session at userR 2017 this coming July that will focus on discovering and learning about R packages. How do R users find packages that meet their needs? Can we make this process easier? As somebody who is relatively new... [Read more...]

Women in the 2016 Stack Overflow Survey

January 18, 2017 | Rstats on Julia Silge

Note: Cross-posted with the Stack Overflow blog The 2017 Stack Overflow Developer Survey opened last week, and we on the Data Team are looking forward to analyzing the survey results to better understand our developer community. I am particularly interested in women in tech, for probably obvious reasons, and recently I ... [Read more...]

Text Mining in R: A Tidy Approach

January 13, 2017 | Rstats on Julia Silge

I spoke on approaching text mining tasks using tidy data principles at rstudio::conf yesterday. I was so happy to have the opportunity to speak and the conference has been a great experience. If you want to catch up on what has been going on at rstudio::conf, Karl Broman ... [Read more...]

Reddit Responds to the Election

December 5, 2016 | Rstats on Julia Silge

It’s been about a month since the U.S. presidential election, with Donald Trump’s victory over Hillary Clinton coming as a surprise to most. Reddit user Jason Baumgartner collected and published every submission and comment posted to Reddit on the day of (and a bit surrounding) the U.... [Read more...]

Measuring Gobbledygook

November 24, 2016 | Rstats on Julia Silge

In learning more about text mining over the past several months, one aspect of text that I’ve been interested in is readability. A text’s readability measures how hard or easy it is for a reader to read and understand what a text is saying; it depe... [Read more...]
1 2 3

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)