Skip to content
Change the repository type filter

All

    Repositories list

    • Metis

      Public
      Metis is a framework to automatically assess the quality of tabular data across multiple dimensions.
      Python
      1513Updated Mar 23, 2026Mar 23, 2026
    • 0000Updated Mar 20, 2026Mar 20, 2026
    • MELArt_experiments

      Public
      Experiments in the evaluation of multimodla entity linking models on MELArt
      Python
      MIT License
      0000Updated Mar 19, 2026Mar 19, 2026
    • DisMis

      Public
      Disguised MIssing Value Detection & Benchmarking
      Python
      0000Updated Mar 1, 2026Mar 1, 2026
    • wf-optimization

      Public
      C++
      MIT License
      0000Updated Feb 10, 2026Feb 10, 2026
    • hamilton

      Public
      Python
      0000Updated Feb 9, 2026Feb 9, 2026
    • burr

      Public
      Python
      1300Updated Feb 5, 2026Feb 5, 2026
    • MELArt

      Public
      Jupyter Notebook
      MIT License
      0200Updated Feb 1, 2026Feb 1, 2026
    • dependency-based-qo

      Public
      Python
      0300Updated Dec 12, 2025Dec 12, 2025
    • dependency-based-optimization-hyrise

      Public
      C++
      MIT License
      0000Updated Dec 12, 2025Dec 12, 2025
    • hypex

      Public
      A Framework for Hyperparameter Optimization in Time Series Anomaly Detection
      Jupyter Notebook
      MIT License
      0000Updated Dec 1, 2025Dec 1, 2025
    • schuyler

      Public
      Python
      0000Updated Nov 7, 2025Nov 7, 2025
    • Progressive HAC system for variable-length time series
      TypeScript
      MIT License
      0000Updated Oct 1, 2025Oct 1, 2025
    • SHACL-DQA

      Public
      Prototype for SHACL-based data quality assessment
      Python
      0200Updated Sep 12, 2025Sep 12, 2025
    • MetaSynth

      Public
      Metadata-based Synthesis of Realistic Tabular Data using Large Language Models
      Python
      0000Updated Sep 1, 2025Sep 1, 2025
    • Armadillo

      Public
      Table Overlap Approximation and Datasets
      Jupyter Notebook
      1510Updated Jun 21, 2025Jun 21, 2025
    • Metanome

      Public
      The source repository of the Metanome tool
      Java
      Apache License 2.0
      67190307Updated Jun 5, 2025Jun 5, 2025
    • Java
      2100Updated Apr 29, 2025Apr 29, 2025
    • Pollock

      Public
      Pollock is a benchmark for data loading on character-delimited files.
      Python
      52600Updated Apr 9, 2025Apr 9, 2025
    • Java
      0000Updated Apr 2, 2025Apr 2, 2025
    • Strudel

      Public
      Python
      Apache License 2.0
      0000Updated Apr 2, 2025Apr 2, 2025
    • AggreCol

      Public
      Python
      Apache License 2.0
      0000Updated Apr 2, 2025Apr 2, 2025
    • hopf

      Public
      Holistic primary key and foreign key detection
      Java
      0200Updated Apr 2, 2025Apr 2, 2025
    • pyro

      Public
      Pyro is an algorithm to detect approximate keys and functional dependencies in relational datasets.
      Java
      Apache License 2.0
      5600Updated Mar 24, 2025Mar 24, 2025
    • DQ4AI

      Public
      Experimental study of the effects of data quality dimensions on machine learning performance
      Jupyter Notebook
      MIT License
      51020Updated Jan 27, 2025Jan 27, 2025
    • ReCLAIM

      Public
      Digital platform to explore Nazi-looted cultural artefacts (bachelors project with JDCRP)
      Python
      0000Updated Nov 5, 2024Nov 5, 2024
    • prisma

      Public
      Repository for schema matching data and source code, used for PRISMA
      Java
      5000Updated Oct 24, 2024Oct 24, 2024
    • AutoTSAD

      Public
      Unsupervised Anomaly Detection System for Univariate Time Series
      Jupyter Notebook
      MIT License
      22000Updated Sep 25, 2024Sep 25, 2024
    • Weever

      Public
      Java
      GNU General Public License v3.0
      2200Updated Sep 17, 2024Sep 17, 2024
    • jet

      Public
      Jaunty Estimation of Hierarchical Time Series Clustering
      Python
      1000Updated Aug 2, 2024Aug 2, 2024