Skip to content

Compute and Process Configuration Overview

Terms and definitions

Object Description
Compute Configuration Stores all configuration settings required for the databricks job: compute configuration, job configuration + few DataForge-specific parameters used to control job execution. Compute configuration record is directly linked to the databricks job via unique job_id attribute.
Process Configuration Comprised of one default compute configuration and optional set of compute configurations for each specific process type. Process configuration is attached to each Source in DataForge.

Below is high level diagram representing relationship of compute and process configurations to other DataForge metadata tables and Databricks objects