Blog Archives

Neural Text Modelling with R package ruimtehol

January 15, 2019
By

Last week the R package ruimtehol was released on CRAN (https://github.com/bnosac/ruimtehol) allowing R users to easily build and apply neural embedding models on text data. It wraps the 'StarSpace' library https://github.com/facebookresearch/StarSpace allowing users to calculate word, sentence, article, document, webpage, link and entity 'embeddings'. By using the 'embeddings', you can perform text based multi-label classification, find similarities between texts and...

Read more »

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

January 8, 2019
By
You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

A small note on the growing list of users of the udpipe R package. In the last month of 2018, we've updated the package on CRAN with some noticeable changes The default models which are now downloaded with the function udpipe_download_model are now models built on Universal Dependencies 2.3 (released on 2018-11-15) This means udpipe now has models for 60 languages....

Read more »

Starspace for NLP #nlproc

December 4, 2018
By
Starspace for NLP #nlproc

Our recent addition to the NLP R universe is called R package ruimtehol which is open sourced at https://github.com/bnosac/ruimtehol This R package is a wrapper around Starspace which provides a neural embedding model for doing the following on text: Text classification Learning word, sentence or document level embeddings Finding sentence or document similarity Ranking web documents Content-based recommendation (e.g. recommend text/music based on the...

Read more »

crfsuite for natural language processing

October 29, 2018
By
crfsuite for natural language processing

A new R package called crfsuite supported by BNOSAC landed safely on CRAN last week. The crfsuite package (https://github.com/bnosac/crfsuite) is an R package specific to Natural Language Processing and allows you to easily build and apply models for named entity recognition text chunking part of speech tagging intent recognition or classification of any category you have in mind The focus of the...

Read more »

Last call for the course on text mining of next week

October 2, 2018
By
Last call for the course on text mining of next week

Last call for the 2-day course on Text Mining with R, held next week (08-09 October 2018) in Brussels, Belgium. Subscribe at https://www.eventbrite.co.uk/e/dsb2018-text-mining-with-r-jan-wijffels-bnosac-session-03-04-tickets-50586501588 You'll learn during that course the following: Cleaning of text data, regular expressions String distances Graphical displays of text data Natural language processing: stemming, parts-of-speech tagging, tokenization, lemmatisation, dependency parsing, noun phrase detection and keyword extraction Entity recognition & chunking using Conditional...

Read more »

udpipe version 0.7 for Natural Language Processing (#NLP) alongside #tidytext, #quanteda, #tm

September 11, 2018
By
udpipe version 0.7 for Natural Language Processing (#NLP) alongside #tidytext, #quanteda, #tm

This blogpost announces the release of the udpipe R package version 0.7 on CRAN. udpipe is an R package which does tokenization, parts of speech tagging, lemmatization, morphological feature tagging and dependency parsing. It's main feature is that it is a lightweight R package which works on more than 50 languages and gives you rich NLP output out of...

Read more »

How to detect hatespeech in plain text #schildnvrienden

September 7, 2018
By
How to detect hatespeech in plain text #schildnvrienden

Yesterday there was a pretty controversial Pano TV documentary called 'Wie is Schild & Vrienden echt' at the national television channel 'één' (https://www.vrt.be/vrtnu/a-z/pano/2018/pano-s2018a10). The documentary revealed the internal communication of a right-wing group from Belgium, called #schildnvrienden. After that, there was a show by Van Gils & gasten where a representative of the police explained or tried not to explain...

Read more »

Upcoming public courses on Text mining with R, Statistical machine learning with R, Applied Spatial Modelling with R, Advanced R programming, Computer Vision and Image Recognition

September 6, 2018
By
Upcoming public courses on Text mining with R, Statistical machine learning with R, Applied Spatial Modelling with R, Advanced R programming, Computer Vision and Image Recognition

I'm happy to announce that the following list of courses for R users is ready to be booked. All courses are face-to-face courses held in Belgium. 08-09/10/2018: Text mining with R. Brussels (Belgium). http://di-academy.com/bootcamp + send mail to [email protected] 15-16/10/2018: Statistical machine learning with R. Leuven (Belgium). Subscribe here 05-06/11/2018: Text mining with R. Leuven (Belgium). Subscribe here 19-20/12/2018: Applied spatial modelling with R....

Read more »

Basic R Automation

May 11, 2018
By

Last Wednesday, a small presentation was given at the RBelgium meetup in Brussels on Basic R Automation. For those of you who could not attend, here are the slides of that presentation which showed the use of the cronR and taskscheduleR R packages for ...

Read more »

An overview of keyword extraction techniques

April 3, 2018
By
An overview of keyword extraction techniques

In this blogpost, we will show 6 keyword extraction techniques which allow to find keywords in plain text. Keywords are frequently occuring words which occur somehow together in plain text. Common examples are New York, Monte Carlo, Mixed Models, Brussels Hoofdstedelijk Gewest, Public Transport, Central Station, p-values, ... If you master these techniques, it will allow you to easily step...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)