DataJoint Documentation¶

Documentation for DataJoint 2.0+

Python code examples in this documentation use DataJoint 2.0 syntax. If you're using DataJoint pre-2.0, see the Migration Guide or visit legacy docs.

About DataJoint¶

DataJoint is a framework for scientific data pipelines built on the Relational Workflow Model—a paradigm where your database schema is an executable specification of your workflow.

pipeline

Unlike traditional databases that merely store data, DataJoint pipelines process data: tables represent workflow steps, foreign keys encode computational dependencies, and the schema itself defines what computations exist and how they relate. Combined with Object-Augmented Schemas for seamless large-data handling, DataJoint delivers reproducible, scalable scientific computing with full provenance tracking.

Concepts

Understand the Relational Workflow Model and DataJoint's core principles

Learn the concepts

Tutorials

Build your first pipeline with hands-on Jupyter notebooks

Start learning

How-To Guides

Practical guides for common tasks and patterns

Find solutions

Reference

Specifications, API documentation, and technical details

Look it up

DataJoint Elements

Reusable pipeline modules for neurophysiology experiments

Explore Elements

DataJoint Platform

A cloud platform for automated analysis workflows. It relies on DataJoint Python and DataJoint Elements.

Learn more | Sign-in

New to DataJoint? Start with the Quick Start tutorial.