Review: Kölner R Meeting 19 July 2013

July 23, 2013

(This article was first published on mages' blog, and kindly contributed to R-bloggers)

Despite the hot weather and the beginning of the school holiday season in North Rhine Westphalia the Cologne R user group met yet again for two fascinating talks and beer and schnitzel afterwards.

Analysing Twitter data to evaluate the US Dollar / Euro exchange rates

Dietmar Janetzko presented ideas to forecast US Dollar / Euro exchange rate movements for the following day.

To forecast exchange rate movements, Dietmar distinguishes two school of thoughts. The first one is based on the analysis of fundamental analysis, e.g. figures of GDP, debt, unemployment, etc. and the other one is based on news, e.g. announcements from central banks, e.g. from Ben Bernanke and other industry experts.

While the data for the fundamental analysis is usually updated slowly, e.g. annually or quarterly, news can be of higher frequency and less regular. As a result the forecasting horizon in a very liquid market, as the forex market, can vary from one minute or less to next year or decade.

Dietmar’s aim was to forecast the exchange rate for the following day and to outperform the forecast of a random walk. For his experiment he used daily exchange rates from Quandl, which has a nice R interface, and Twitter data from topsy, which gives him access to Twitter’s ‘firehose’.

For his analysis Dietmar focused on the number of tweets of the terms Euro + Crisis + <concept>, whereby he used a dictionary of nearly 600 different concept words.

His training algorithms used functions of the following packages: forecast, caret and car, looking for predictors that have a smaller error than a random error.

It goes without saying that Dietmar hasn’t made millions from his algorithms yet, but the discussion and the end of his presentation will hopefully have given him a few pointers to do just that.

Graphs in R

Afshin Sadeghi, who has a background in Steiner tree methods for Protein-Protein interaction networks, gave an overview of the various graph packages in R. He started his talk with a little overview of the graph terminology of nodes, edges, trees, directed and undirected graphs.

Afshin then gave a brief overview of the various graph packages in R and the different visualisation options. The most popular package seems to be igraph, maintained by Gabor Csardi. Although different packages use sometimes different graph objects, there are often conversation tools available, e.g., allowing users to use the best algorithms from all packages.

You can access Afshin’s slides via our Meetup site.

Next Kölner R meeting

The next meeting has been scheduled for 18 October 2013.

Please get in touch if you would like to present and share your experience, or indeed if you have a request for a topic you would like to hear more about. For more details see also our Meetup page.

Thanks again to Bernd Weiß for hosting the event and Revolution Analytics for their sponsorship.

To leave a comment for the author, please follow the link and comment on their blog: mages' blog. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.


Mango solutions

RStudio homepage

Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training


CRC R books series

Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)