Posts Tagged ‘ latent dirichlet allocation ’

Text Mining

October 15, 2012
By
Text Mining

When it comes down to it R does a really good job handling structured data like matrices and data frames. However, its ability to work with unstructured data is still a work in progress. It can and it does handle text mining but the documentation is incomplete and the capabilities still don’t compare to other

Read more »

Topic Modeling the Sarah Palin Emails

June 27, 2011
By
Topic Modeling the Sarah Palin Emails

tl;dr Browse through Sarah Palin’s emails, automagically organized by topic, here. LDA-based Email Browser Earlier this month, several thousand emails from Sarah Palin’s time as governor of Alaska were released. The emails weren’t organized in any fashion, though, so to make them easier to browse, I did some topic modeling (in particular, using latent Dirichlet

Read more »