Blog Archives

Transfer learning and semi-supervised learning with ruimtehol

May 13, 2019
By

Last week the R package ruimtehol was updated on CRAN giving R users who perform Natural Language Processing access to the possibility to Allow to do semi-supervised learning (learning where you have both text as labels but not always both of them on ...

Read more »

Koning Filip lijkt op …

March 26, 2019
By
Koning Filip lijkt op …

Last call for the course on Text Mining with R, held next week in Leuven, Belgium on April 1-2. Viewing the course description as well as subscription can be done at https://lstat.kuleuven.be/training/coursedescriptions/text-mining-with-r Some things you'll learn ... is that King Filip of Belgium is similar to public expenses if we just look at open data from questions and answers in...

Read more »

Human Face Detection with R

March 22, 2019
By
Human Face Detection with R

Doing human face detection with computer vision is probably something you do once unless you work for police departments, you work in the surveillance industry or for the Chinese government. In order to reduce the time you lose on that small exercise, bnosac created a small R package (source code available at https://github.com/bnosac/image) which wraps the weights of a...

Read more »

Making thematic maps for Belgium

February 26, 2019
By
Making thematic maps for Belgium

For people from Belgium working in R with spatial data, you can find excellent workshop material on creating thematic maps for Belgium at https://workshop.mhermans.net/thematic-maps-r/index.html by Maarten Hermans (researcher at the HIVA - Onderzoeksinstituut voor Arbeid en Samenleving - https://mhermans.net). The plots are heavily based on BelgiumMaps.Statbel - an R package from bnosac released 2 years ago (more info at...

Read more »

An overview of the NLP ecosystem in R (#nlproc #textasdata)

February 4, 2019
By
An overview of the NLP ecosystem in R (#nlproc #textasdata)

At BNOSAC, R is used a lot to perform text analytics as it is an excellent tool that provides anything a data scientist needs to perform data analysis on text in a business settings. For users unfamiliar with all the possibilities that the wealth of R ...

Read more »

Neural Text Modelling with R package ruimtehol

January 15, 2019
By

Last week the R package ruimtehol was released on CRAN (https://github.com/bnosac/ruimtehol) allowing R users to easily build and apply neural embedding models on text data. It wraps the 'StarSpace' library https://github.com/facebookresearch/StarSpace allowing users to calculate word, sentence, article, document, webpage, link and entity 'embeddings'. By using the 'embeddings', you can perform text based multi-label classification, find similarities between texts and...

Read more »

You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

January 8, 2019
By
You did a sentiment analysis with tidytext but you forgot to do dependency parsing to answer WHY is something positive/negative

A small note on the growing list of users of the udpipe R package. In the last month of 2018, we've updated the package on CRAN with some noticeable changes The default models which are now downloaded with the function udpipe_download_model are now models built on Universal Dependencies 2.3 (released on 2018-11-15) This means udpipe now has models for 60 languages....

Read more »

Starspace for NLP #nlproc

December 4, 2018
By
Starspace for NLP #nlproc

Our recent addition to the NLP R universe is called R package ruimtehol which is open sourced at https://github.com/bnosac/ruimtehol This R package is a wrapper around Starspace which provides a neural embedding model for doing the following on text: Text classification Learning word, sentence or document level embeddings Finding sentence or document similarity Ranking web documents Content-based recommendation (e.g. recommend text/music based on the...

Read more »

crfsuite for natural language processing

October 29, 2018
By
crfsuite for natural language processing

A new R package called crfsuite supported by BNOSAC landed safely on CRAN last week. The crfsuite package (https://github.com/bnosac/crfsuite) is an R package specific to Natural Language Processing and allows you to easily build and apply models for named entity recognition text chunking part of speech tagging intent recognition or classification of any category you have in mind The focus of the...

Read more »

Last call for the course on text mining of next week

October 2, 2018
By
Last call for the course on text mining of next week

Last call for the 2-day course on Text Mining with R, held next week (08-09 October 2018) in Brussels, Belgium. Subscribe at https://www.eventbrite.co.uk/e/dsb2018-text-mining-with-r-jan-wijffels-bnosac-session-03-04-tickets-50586501588 You'll learn during that course the following: Cleaning of text data, regular expressions String distances Graphical displays of text data Natural language processing: stemming, parts-of-speech tagging, tokenization, lemmatisation, dependency parsing, noun phrase detection and keyword extraction Entity recognition & chunking using Conditional...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)