Metadata & tags#

One of the core benefits of Dagster is fostering collaboration between the engineers who build data pipelines and the end users who consume the data pipelines produce.

Dagster offers several ways to provide useful information and documentation alongside your data pipelines, including metadata and tagging.

Benefits#

Using metadata and tags helps you:

Create built-in documentation that makes your pipelines easy to understand
Provide useful context for other users of your project (and your future self!)
Improve the ease of debugging when issues arise
Organize definitions in your Dagster project and improve filtering in the Dagster UI

How metadata works#

There are two main types of metadata in Dagster: definition and runtime. In code, you can attach metadata to definitions like assets, jobs, and runs. You'll then be able to view the metadata in the Dagster UI.

Definition metadata#

Definition metadata is information that's fixed or doesn't frequently change. For example, definition metadata could be:

The columns an asset is expected to contain
A link to source file in Github that contains the code that defines the asset or job

Refer to the Attaching definition metadata section of the Software-defined Assets documentation for more information.

Runtime metadata#

Runtime metadata is information that changes after an action occurs. This could be how many records were processed, how long an asset took to materialize, etc. Refer to the Recording materialization metadata section of the Software-defined Assets documentation for more information.

How tags work#