You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+14-9Lines changed: 14 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,16 +1,21 @@
1
1
# DataJoint for Python
2
2
3
-
DataJoint is an open-source Python framework for building scientific data pipelines.
4
-
It implements the **Relational Workflow Model**—a paradigm that extends relational
5
-
databases with native support for computational workflows.
3
+
DataJoint is a framework for scientific data pipelines that introduces the **Relational Workflow Model**—a paradigm where your database schema is an executable specification of your workflow.
6
4
7
-
**Key Features:**
5
+
Traditional databases store data but don't understand how it was computed. DataJoint extends relational databases with native workflow semantics:
8
6
9
-
-**Declarative schema design** — Define tables and relationships in Python
-**Built-in computation** — Imported and Computed tables run automatically
12
-
-**Data integrity** — Referential integrity and transaction support
13
-
-**Reproducibility** — Immutable data with full provenance
7
+
-**Tables represent workflow steps** — Each table is a step in your pipeline where entities are created
8
+
-**Foreign keys encode dependencies** — Parent tables must be populated before child tables
9
+
-**Computations are declarative** — Define *what* to compute; DataJoint determines *when* and tracks *what's done*
10
+
-**Results are immutable** — Computed results preserve full provenance and reproducibility
11
+
12
+
### Object-Augmented Schemas
13
+
14
+
Scientific data includes both structured metadata and large data objects (time series, images, movies, neural recordings, gene sequences). DataJoint solves this with **Object-Augmented Schemas (OAS)**—a unified architecture where relational tables and object storage are managed as one system with identical guarantees for integrity, transactions, and lifecycle.
15
+
16
+
### DataJoint 2.0
17
+
18
+
**DataJoint 2.0** solidifies these core concepts with a modernized API, improved type system, and enhanced object storage integration. Existing users can refer to the [Migration Guide](https://docs.datajoint.com/migration/) for upgrading from earlier versions.
0 commit comments