Articles by Dzidorius Martinaitis

Machine learning for hackers

October 23, 2012 | Dzidorius Martinaitis

Which way do you prefer to learn a new material – deep theoretical background first and practice later or do you like to break things in order to fix them? If latter is your way of learning things, then most likely you will enjoy Machine Learning for Hackers. The book has ... [Read more...]

Garmin data visualization

October 4, 2012 | Dzidorius Martinaitis

People go on rage, when governments initiate surveillance projects like CleanIT, nevertheless share very private data without a doubt. I have to admit, that some data leaks are well buried in the process. Take for example Garmin which produces GPS training devices for runners. In order to see your workouts ... [Read more...]

RStudio server through ssh

August 10, 2012 | Dzidorius Martinaitis

R language has numerous IDEs – RStudio, Vim plugin, Eclipse plugin. RStudio really shines for R language, nevertheless Vim plugin might be well adapted for R if you are Vim guru. Eclipse? Who needs such behemoth? Turns out a student in Ljubljana badly needs it. Most of the time I use ... [Read more...]

Building a presentation, report or paper in R

August 1, 2012 | Dzidorius Martinaitis

If you need to build a presentation, obviously you have following options: Powerpoint alike presentation Online engines LaTex The first two are beloved by business people and the third one is widely used in academia. The objective of the first group is shiny presentation, contrary to the second where asceticism ... [Read more...]

How to track Twitter unfollowers in R

July 18, 2012 | Dzidorius Martinaitis

I have Twitter account and it is relatively easy to see new followers or subscribers. However, I was looking for ways to know who are the unfollowers. I have noticed, that some (un)subscriptions happen in bulks, which made me thinking that either I tweeted some bullshit and upset bunch ...
[Read more...]

My first competition at Kaggle

July 2, 2012 | Dzidorius Martinaitis

For me Kaggle becomes a social network for data scientist, as stackoverflow.com or github.com for programmers. If you are data scientist, machine learner or statistician you better off to have a profile there, otherwise you do not exist. Nevertheless, I won’t bet on rosy future for data ... [Read more...]

GitHub data analysis

May 15, 2012 | Dzidorius Martinaitis

Few weeks ago GitHub announced, that its timeline data is available on bigquery for analysis. Moreover, it offers prizes for the best visualization of the data. Despite my art skills and minimal chances to win beauty contest, I decided to crunch GitHub data and run data analysis. After initial trial ... [Read more...]

How to organize R user group

April 18, 2012 | Dzidorius Martinaitis

The first thing, what you have to do is to estimate how many users will be interested in local R group. I would say, that out of one million inhabitants you can expect 10-20 users. Based on this raw number, you can know, what challenges are waiting for you. If ... [Read more...]

Levenshtein distance in C++ and code profiling in R

March 25, 2012 | Dzidorius Martinaitis

At work, the client requested, if existing search engine could accept singular and plural forms equally, e. g. “partner” and “partners” would lead to the same result. The first option – stemming. In that case, search engine would use root of a word, e. g. “partn”. However, stemming has many weaknesses: ... [Read more...]

I see high frequency data

March 1, 2012 | Dzidorius Martinaitis

In the previous post I shared an example how to get high frequency data from IB broker (well, it is retail version of HFD – it has only best bid/ask and the trades). Now, once you saved some data – what should you do next? Next logical step would be data ... [Read more...]

Vectorized R vs Rcpp

February 1, 2012 | Dzidorius Martinaitis

In my previous post, I tried to show, that Rcpp is 1000 faster than pure R and that generated the fuss in the comments. Being lazy, I didn’t vectorize R code and at the end I was comparing apples vs oranges. To fix that problem, I built a new script, ... [Read more...]

The power of Rcpp

January 30, 2012 | Dzidorius Martinaitis

While ago I built two R scripts to track OMX Baltic Benchmark Fund against the index. One script returns the deviation of  fund from the index and it works fast enough. The second calculates the value of the fund every minute and it used to take for while. For example, ... [Read more...]

ai-class.com vs ml-class.com

December 16, 2011 | Dzidorius Martinaitis

For those who did not know, Stanford university offered free off charge 3 courses at beginning of the autumn. It is kind of shocking – US based institution offers education for free! Take any socialism oriented country and one of the promises is education for free. But it seems, that the argument ... [Read more...]

C++ is dead. Long live C++

December 1, 2011 | Dzidorius Martinaitis

During the summer I was contacted by a hedge fund from Bahamas. The fund was looking for someone with R language skills on-site and insisted for phone interview. Besides obvious questions about finance, statistics, coding and how many tennis balls can fit in Boeing 747 (ok, this question was omitted), they ... [Read more...]

Trading volume forecast for an illiquid stock

August 8, 2011 | Dzidorius Martinaitis

When dealing with transaction cost analysis, a stock’s volume is assumed to be stable or foreseeable.  However, there is different picture, then we are dealing with an illiquid stock. It is relatively easy to forecast the volume of a liquid stock, because trading volume has high autocorrelation – the volumes ... [Read more...]

How big block trades affect stock market prices?

July 27, 2011 | Dzidorius Martinaitis

I will be giving a presentation on “Optimal transaction cost” in Vilnius on  16  August. While preparing the presentation and looking for an optimal execution solution, a natural question arises: does the size of the trade affect stock market price? I’m sure, you would say 100 % yes. Well, you would be ... [Read more...]
1 2

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)