Skip to content
View dkapitan's full-sized avatar
☸️
☸️

Sponsoring

@arp242

Highlights

  • Pro

Organizations

@jads-nl @EAISI

Block or report dkapitan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
.github/profile/README.md

Hi there 👋. Thanks for stopping by.

I suppose you came here to find out who I am and what I do.

My wife says I am a tree-hugging hacker (in Dutch: geitenwollensokken nerd), which I think is pretty accurate.

I spend most of my working days on something related to data science, data architecture and platform engineering. I teach at EAISI Academy Professional Education. I try to keep track of interesting teaching materials by collating an anthology of data science.

I mostly work in the healthcare domain, trying to push the needle towards better data interoperability with open source development as a lever. I am particularly interested in implementing decentralized data processing networks and have contributed to standardizing the concept of data stations (in Dutch) within the context of the European Health Data Space. As the lead architect at PLUGIN (also in Dutch) I am involved in deploying a nationwide federated learning network in the Netherlands.

I am a big open source advocate, particularly of the PyData ecosystem. With a gang of like-minded open source enthousiast, we are developing a 'data-platform-in-a-single-repo' that combines the best components from the composable data stack into a sovereign data stack. As of January 2026 we are testing the platform on Scaleway. The project is named wisent, after the European bison which is currently being re-introduced in the Netherlands. I like to think that we should be able to regain our independence and strategic autonomy in data here in the low countries, given our long-standing heritage with heroes like Guido van Rossum, Edsger Dijkstra and more recently Ritchie Vink, Hannes Muhleisen and Maarten Grootendorst, to name a few. Drop me a note if you are interested to collaborate.

Pinned Loading

  1. anthology-of-data-science/anthology-of-data-science.github.io anthology-of-data-science/anthology-of-data-science.github.io Public

    Source code of https://anthology-of-data.science

    Jupyter Notebook 2

  2. hands-on-federated-analytics hands-on-federated-analytics Public

    Working on a new book

    TeX

  3. dkapitan.github.io dkapitan.github.io Public

    My professional website

    Jupyter Notebook 6 2

  4. srdp srdp Public

    The Single Repo Data Platform

    HCL 8

  5. data-station-specification data-station-specification Public

    Forked from Health-RI/data-station-specification

    Specification of composable data stations

    TeX

  6. dagster-data-station dagster-data-station Public

    Python 1