Acceldata Orchestrator
Enterprise
| Image | $DATAOPS_ACCELDATA_RUNNER_IMAGE |
|---|
The Acceldata orchestrator enables automated creation and management of data quality (DQ) policies in Acceldata's Data Observability Platform. This orchestrator integrates with your DataOps pipelines to publish data quality rules based on your dbt test definitions.
Usage
"Acceldata DQ Policy Creation":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
SNOWFLAKE_DATABASE: "YOUR_DATABASE"
script:
- /dataops
icon: ${ACCELDATA_ICON}
The Acceldata orchestrator assumes that a DataOps modeling and transformation job completed running — including the dbt test execution — in an earlier stage of the DataOps pipeline. It uses the dbt test definitions to automatically create corresponding data quality policies in Acceldata.
Supported parameters
Core Configuration Parameters
| Parameter | Required/Default | Description |
|---|---|---|
DATAOPS_ACCELDATA_HOST | REQUIRED | Acceldata host URL (without https://), e.g., your-host.acceldata.com |
DATAOPS_ACCELDATA_ACCESS_KEY | REQUIRED | API access key for authentication |
DATAOPS_ACCELDATA_SECRET_KEY | REQUIRED | API secret key for authentication |
DATAOPS_ACCELDATA_ASSEMBLY_NAME | REQUIRED | Spark assembly name in Acceldata |
SNOWFLAKE_DATABASE | Optional, defaults to OPS_DP_PUB | Snowflake database name |
DATAOPS_ACCELDATA_TAG_NAME | Optional, auto-detected | Domain tag name for policies/assets |
DATAOPS_ACCELDATA_NOTIFICATION_CHANNEL_ID | Optional | Notification channel ID for alerts |
DATAOPS_ACCELDATA_DBT_MODELS_PATH | Optional, defaults to ${CI_PROJECT_DIR}/dataops/modelling/ | Path to dbt schema YAML files |
ADOC_POLICY_ACTION | Optional | Action: LIST, EXECUTE, DELETE, GET_ASSETS, or CRAWLER |
ADOC_RULE_IDS | Optional | Comma-separated rule IDs for specific operations |
DATAOPS_ACCELDATA_DISABLE_SCHEDULING | Optional, defaults to false | Set to true to create unscheduled policies |
DATAOPS_ACCELDATA_SCHEDULE_START_TIME | Optional, defaults to 2 | Schedule start hour (0-23) |
DATAOPS_ACCELDATA_SCHEDULE_END_TIME | Optional, defaults to 10 | Schedule end hour (0-23) |
DATAOPS_ACCELDATA_SCHEDULE_INTERVAL_MINUTES | Optional, defaults to 10 | Minutes between schedule slots |
Standard Policies and Monitoring Parameters
| Parameter | Required/Default | Description |
|---|---|---|
DATAOPS_ACCELDATA_ENABLE_STANDARD_POLICIES | Optional, defaults to false | Enable schema drift, freshness, and anomaly policies |
DATAOPS_ACCELDATA_PROFILING_SCHEDULE | Optional, defaults to 0 0 0,8,16 * * ? | Cron schedule for profiling |
DATAOPS_ACCELDATA_ANOMALY_SENSITIVITY | Optional, defaults to MEDIUM | Anomaly sensitivity: LOW, MEDIUM, or HIGH |
DATAOPS_ACCELDATA_ANOMALY_TRAINING_WINDOW_DAYS | Optional, defaults to 7 | Anomaly training window (days) |
DATAOPS_ACCELDATA_FRESHNESS_CRON_SCHEDULE | Optional, defaults to 0 12 0/4 * * ? | Cron schedule for freshness checks |
DATAOPS_ACCELDATA_FRESHNESS_POLICY_TYPES | Optional | Comma-separated: data_freshness, row_count, row_count_drift, asset_size, asset_size_drift. See Freshness Configuration |
DATAOPS_ACCELDATA_ENABLE_DATA_FRESHNESS | Optional, defaults to true | Enable Data Freshness policy |
DATAOPS_ACCELDATA_ENABLE_ABSOLUTE_ROW_COUNT | Optional, defaults to true | Enable Absolute Row Count policy |
DATAOPS_ACCELDATA_ENABLE_ROW_COUNT_DRIFT | Optional, defaults to false | Enable Row Count Drift policy |
DATAOPS_ACCELDATA_ENABLE_ABSOLUTE_ASSET_SIZE | Optional, defaults to true | Enable Absolute Asset Size policy |
DATAOPS_ACCELDATA_ENABLE_ASSET_SIZE_DRIFT | Optional, defaults to false | Enable Asset Size Drift policy |
DATAOPS_ACCELDATA_PROFILING_TYPE | Optional, defaults to FULL | Profiling type: FULL or INCREMENTAL |
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_TYPE | Optional | Strategy: id, datetime, or partition. See Incremental Profiling |
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_COLUMN | Optional | Column name for incremental strategy |
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_FORMAT | Optional, defaults to yyyy-mm-dd | Date/time format |
DATAOPS_ACCELDATA_TABLE_INCREMENTAL_COLUMNS | Optional | JSON mapping table names to columns |
ADOC_RUN_CRAWLER | Optional, auto-enabled | Enable/disable metadata crawler |
ADOC_CRAWLER_START_TIMEOUT | Optional, defaults to 120 | Crawler start timeout (seconds) |
ADOC_CRAWLER_COMPLETION_TIMEOUT | Optional, defaults to 1800 | Crawler completion timeout (seconds) |
Freshness Policy Threshold Parameters
| Parameter | Required/Default | Description |
|---|---|---|
DATAOPS_ACCELDATA_FRESHNESS_LOOKBACK_WINDOW | Optional, defaults to 24 | Data freshness lookback window |
DATAOPS_ACCELDATA_FRESHNESS_LOOKBACK_WINDOW_TYPE | Optional, defaults to HOURS | Window type: HOURS or DAYS |
DATAOPS_ACCELDATA_ROW_COUNT_LOOKBACK_WINDOW | Optional, defaults to 4 | Row count lookback window |
DATAOPS_ACCELDATA_ROW_COUNT_LOOKBACK_WINDOW_TYPE | Optional, defaults to HOURS | Window type: HOURS or DAYS |
DATAOPS_ACCELDATA_ROW_COUNT_CHANGE_THRESHOLD | Optional, defaults to 10.0 | Row count change threshold (%) |
DATAOPS_ACCELDATA_ROW_COUNT_DRIFT_LOOKBACK_WINDOW | Optional, defaults to 4 | Row count drift lookback window |
DATAOPS_ACCELDATA_ROW_COUNT_DRIFT_LOOKBACK_WINDOW_TYPE | Optional, defaults to HOURS | Window type: HOURS or DAYS |
DATAOPS_ACCELDATA_ROW_COUNT_DRIFT_THRESHOLD | Optional, defaults to 10.0 | Row count drift threshold (%) |
DATAOPS_ACCELDATA_ASSET_SIZE_LOOKBACK_WINDOW | Optional, defaults to 24 | Asset size lookback window |
DATAOPS_ACCELDATA_ASSET_SIZE_LOOKBACK_WINDOW_TYPE | Optional, defaults to HOURS | Window type: HOURS or DAYS |
DATAOPS_ACCELDATA_ASSET_SIZE_CHANGE_THRESHOLD | Optional, defaults to 10.0 | Asset size change threshold (%) |
DATAOPS_ACCELDATA_ASSET_SIZE_DRIFT_LOOKBACK_WINDOW | Optional, defaults to 24 | Asset size drift lookback window |
DATAOPS_ACCELDATA_ASSET_SIZE_DRIFT_LOOKBACK_WINDOW_TYPE | Optional, defaults to HOURS | Window type: HOURS or DAYS |
DATAOPS_ACCELDATA_ASSET_SIZE_DRIFT_THRESHOLD | Optional, defaults to 10.0 | Asset size drift threshold (%) |
For multi-environment deployments, append _PROD or _QA to: DATAOPS_ACCELDATA_HOST, DATAOPS_ACCELDATA_ACCESS_KEY, DATAOPS_ACCELDATA_SECRET_KEY, DATAOPS_ACCELDATA_ASSEMBLY_NAME, DATAOPS_ACCELDATA_NOTIFICATION_CHANNEL_ID
Incremental Profiling
Configure incremental profiling strategies for efficient data processing:
ID-Based Strategy:
DATAOPS_ACCELDATA_PROFILING_TYPE: "INCREMENTAL"
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_TYPE: "id"
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_COLUMN: "customer_id"
Datetime Strategy:
DATAOPS_ACCELDATA_PROFILING_TYPE: "INCREMENTAL"
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_TYPE: "datetime"
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_COLUMN: "created_date"
DATAOPS_ACCELDATA_INCREMENTAL_STRATEGY_FORMAT: "yyyy-MM-dd"
Per-Table Configuration:
DATAOPS_ACCELDATA_TABLE_INCREMENTAL_COLUMNS: |
{
"CUSTOMERS": "updated_at",
"ORDERS": "order_date"
}
Example jobs
Basic DQ Policy Creation
"Acceldata DQ Policies":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
SNOWFLAKE_DATABASE: "YOUR_DATABASE"
script:
- /dataops
icon: ${ACCELDATA_ICON}
With Standard Policies
"Acceldata DQ with Monitoring":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
DATAOPS_ACCELDATA_NOTIFICATION_CHANNEL_ID: "11345"
SNOWFLAKE_DATABASE: "YOUR_DATABASE"
DATAOPS_ACCELDATA_ENABLE_STANDARD_POLICIES: "true"
DATAOPS_ACCELDATA_FRESHNESS_POLICY_TYPES: "data_freshness,row_count,asset_size"
script:
- /dataops
icon: ${ACCELDATA_ICON}
List Policies
"List Acceldata Policies":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
ADOC_POLICY_ACTION: "LIST"
script:
- /dataops
icon: ${ACCELDATA_ICON}
Execute Policies
"Execute Acceldata Policies":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
ADOC_POLICY_ACTION: "EXECUTE"
ADOC_RULE_IDS: "123,456,789"
script:
- /dataops
icon: ${ACCELDATA_ICON}
Get Policy Assets
"Get Policy Assets":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
ADOC_POLICY_ACTION: "GET_ASSETS"
ADOC_RULE_IDS: "123,456,789"
script:
- /dataops
icon: ${ACCELDATA_ICON}
Delete Policies
"Delete Acceldata Policies":
extends:
- .agent_tag
stage: "Data Quality"
image: $DATAOPS_ACCELDATA_RUNNER_IMAGE
variables:
DATAOPS_ACCELDATA_HOST: "your-host.acceldata.com"
DATAOPS_ACCELDATA_ACCESS_KEY: DATAOPS_VAULT(ACCELDATA.ACCESS_KEY)
DATAOPS_ACCELDATA_SECRET_KEY: DATAOPS_VAULT(ACCELDATA.SECRET_KEY)
DATAOPS_ACCELDATA_ASSEMBLY_NAME: "Your_Assembly"
ADOC_POLICY_ACTION: "DELETE"
ADOC_RULE_IDS: "123,456,789"
script:
- /dataops
icon: ${ACCELDATA_ICON}