One of the core benefits of Dagster is fostering collaboration between the engineers who build data pipelines and the end users who consume the data pipelines produce.
Dagster offers several ways to provide useful information and documentation alongside your data pipelines, including metadata and tagging.
Using metadata and tags helps you:
There are two main types of metadata in Dagster: definition and runtime. In code, you can attach metadata to definitions like assets, jobs, and runs. You'll then be able to view the metadata in the Dagster UI.
Definition metadata is information that's fixed or doesn't frequently change. For example, definition metadata could be:
Refer to the Attaching definition metadata section of the Software-defined Assets documentation for more information.
Runtime metadata is information that changes after an action occurs. This could be how many records were processed, how long an asset took to materialize, etc. Refer to the Recording materialization metadata section of the Software-defined Assets documentation for more information.
In addition to definition metadata, you can also use tags to label and organize definitions in your Dagster project. Definitions like assets, jobs, and runs can have multiple tags, which are applied in code. Refer to the Tags documentation for more information and examples.