Here is a small overview of some selected public projects.
Open source projects
Currently I am the maintainer of the following R packages:
parallelDist | R, C++11
The parallelDist package provides a fast parallelized alternative to R’s native ‘dist’ function to calculate distance matrices for continuous, binary, and multi-dimensional input matrices and offers a broad variety of distance functions from the ‘stats’, ‘proxy’ and ‘dtw’ R packages. For ease of use, the ‘parDist’ function extends the signature of the ‘dist’ function and uses the same parameter naming conventions as distance methods of existing R packages. Currently 39 different distance methods are supported.
ibmdbR | R, SQL
Functionality required to efficiently use R with IBM(R) Db2(R) Warehouse offerings (formerly IBM dashDB(R)) and IBM Db2 for z/OS(R) in conjunction with IBM Db2 Analytics Accelerator for z/OS. Many basic and complex R operations are pushed down into the database, which removes the main memory boundary of R and allows to make full use of parallel processing in the underlying database.
IBM Db2/dashDB Warehouse Spark integration | Java 7, Docker, REST Implemented distributed multi-tenant Apache Spark cluster manager, which allows to spawn resource restricted Spark clusters for single users.
Data Mining Cup 2014 | R, SPSS Modeler, Python
Second place (best german team) in international DMC student contest. Objective: Minimizing prediction error of a classification problem.
SAP Hana: In-database KORD association rule mining | C++11
While working as a student at SAP, I've implemented a version of the "K-optimal rule discovery" association rule mining algorithm, which is now part of the predictive analysis library (PAL) of SAP HANA.
Data Mining Cup 2013 | R, SPSS Modeler, Python
Third place in international DMC student contest. Objective: Minimizing prediction error of a classification problem.
Battlefield Stats Viewer | Delphi
Multilingual statistics GUI app for the game "Battlefield 2" developed in my free time around 2006. More than 150.000 downloads from my project page and feature on gamespy.com.