Articles by Jan Górecki - R

Boost Your Data Munging with R

June 29, 2016 | Jan Górecki - R

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab....
[Read more...]

Boost Your Data Munging with R

June 29, 2016 | Jan Górecki - R

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab....
[Read more...]

Scaling data.table using index

November 22, 2015 | Jan Górecki - R

R can handle fairly big data working on a single machine, 2B (2E9) rows and couple of columns require about 100 GB of memory. This is already well enough to care about performance. With this post I'm going discuss scalability of filter queries. The index has been introduced to data.table ...
[Read more...]

Scaling data.table using index

November 22, 2015 | Jan Górecki - R

R can handle fairly big data working on a single machine, 2B (2E9) rows and couple of columns require about 100 GB of memory. This is already well enough to care about performance. With this post I'm going discuss scalability of filter queries. The index has been introduced to data.table ...
[Read more...]

Accept payments in shiny app

August 3, 2015 | Jan Górecki - R

Have you ever think about accepting payments in your shiny app? Probably not, but now you can start ;) Shiny apps are usually single task, not very heavy websites. It may be not so easy to turn them into online shop/service provider. Anyway you can find this post interesting as ...
[Read more...]

Accept payments in shiny app

August 3, 2015 | Jan Górecki - R

Have you ever think about accepting payments in your shiny app? Probably not, but now you can start ;) Shiny apps are usually single task, not very heavy websites. It may be not so easy to turn them into online shop/service provider. Anyway you can find this post interesting as ...
[Read more...]

Data Warehousing with R

June 29, 2015 | Jan Górecki - R

Under this link you can find today's slides from the Cardiff R User Group meeting. On the slides you may find interesting packages from the Data Warehousing / ETL perspective. Including few examples and a lot of links to packages repositories. Slides are fully reproducible, including connection via DBI, RJDBC and ... [Read more...]

Auditing data transformation

June 2, 2015 | Jan Górecki - R

Auditing data transformation can be simply described as gathering metadata about the transformation process. The most basics metadata would be a timestamp, atomic transformation description, data volume on input, data volume on output, time elapsed. If you work with R only interactively you may find it more like a fancy ... [Read more...]

R in Business Intelligence

January 18, 2015 | Jan Górecki - R

Business Intelligence (BI) can be simply described as extracting useful informations from the data. This is quite a broad process as the source data structure (and quality) can vary, as well the useful information structure can vary. More technically process of such transformation can be described as ETL (extract, transform, ... [Read more...]

Data anonymization in R

November 6, 2014 | Jan Górecki - R

Use cases Public reports. Public data sharing, e.g. R packages download logs from CRAN's RStudio mirror - cran-logs.rstudio.com - mask ip addresses. Reports or data sharing for external vendor. Development works can operate on anonymized PRODUCTION data. Manually or semi-manually populated data can often brings some new ... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)