Capture, organize, and reuse knowledge from your data science experiments.
KMDS is a Python-based tool designed for systematic knowledge management in data science projects. It helps you document the incremental process of experimentation, including context, decisions, and rationale, ensuring that valuable insights are not lost over time.
Data scientists live by experimentation. However, the context and rationale behind each experiment are often documented in an ad-hoc manner. When it's time to revisit a question or build upon previous work, it's difficult to piece together the research and its results. KMDS addresses this by providing a structured way to log your findings.
π₯ Watch a quick overview of KMDS: YouTube Video
- Structured Logging: Log findings from your exploratory data analysis, data representation, and modeling phases.
- Knowledge Base Export: Export your knowledge base to communicate your findings to your team or management.
- Integration with Generative AI: Use generative AI tools like NotebookLM to create reports, videos, and other documentation artifacts from your exported knowledge base.
- Process Agnostic: Complements process guidelines like CRISP-DM and semantic vocabularies like OpenML by capturing the "why" behind your data science tasks.
Install KMDS in your Python environment:
pip install kmdsAs you work through your analysis, log your findings to kmds. Check out the examples below.
This repository includes two detailed examples:
-
Analytics Example: Evaluates the effectiveness of a ticket resolution help desk.
-
Machine Learning Example: Uses Principal Component Analysis (PCA) to summarize online store sales activity.
We welcome contributions! If you have an idea for a new feature or would like to report a bug, please open an issue. If you'd like to contribute code, please fork the repository and submit a pull request.
This project is licensed under the Apache 2.0 License. See the LICENSE file for details.
If you have questions or are interested in the following, please schedule a meeting:
- Help with a data analysis task for your use case.
- Developing a custom ontology-based solution.
- Integrating KMDS with other tools in your data science stack.