How-To Guides

This section offers several valuable and useful "How-To" guides:

📄️ Add Constraints to Databases

How-to guide for adding primary key constraints in tables and views

📄️ Add Tables/Views to an Existing Share

How-to guide for adding tables or views to an existing share in Snowflake using the MATE (Modelling and Transformation) orchestrator

📄️ Auto-Cancel Duplicate Pipelines

How-to guide for configuring a check job to abort a pipeline if another is already running in the same ref

📄️ Auto-Cancel Running Queries

How-to guide for automatically canceling any running Snowflake queries in the event of pipeline failure

📄️ Build Only Changed MATE Models

How-to guide for adding a runner script that builds or tests only the models that have been changed since the previous pipeline commit

📄️ Change Column Datatype in a Table with data

How-to guide for changing the data type of a column of a table that is already populated with data

📄️ Clone Schemas in Feature Branches

How-to guide for cloning in feature branches only the schemas specified in a configurable list

📄️ Configure a Production-Only Runner

How-to guide for creating a DataOps runner that only runs jobs in the production environment on the main branch

📄️ Configure Select Statement in a Snowflake PIPE

How-to guide for configuring the select_statement parameter of the Snowflake PIPE object using the Snowflake Lifecycle Engine.

📄️ Create Incremental Models in MATE

How-to guide for incrementally loading data from a staging table to a history table in the curation layer in a DataOps pipeline with the MATE orchestrator

📄️ Create a Custom Before Script

How-to guide for creating a custom before_script that sets dynamic variables to be used by the app and script that run in the job

📄️ Create a Custom Orchestrator Image

How-to guide for creating a custom DataOps.live orchestrator packing your prebuilt business logic and extending existing orchestrators or using generic docker images

📄️ Create a Custom Reference Project

How-to guide for creating a custom reference project that helps to avoid the needless repetition of content

📄️ Network rules with SOLE for Data Products

How-to guide for defining Snowflake network rules with SOLE for Data Products.

📄️ Detect if a File has Changed

How-to guide for triggering a specific action in a pipeline only if the state of a file has changed within a job

📄️ Create Fallback Jobs

How-to guide for creating fallback jobs. When a job fails, you can specify another image to perform the task.

📄️ Git Clone over HTTPS

How-to guide for performing a Git clone over HTTPS

📄️ Git Clone when SSO is Enabled

How-to guide for performing a Git clone when SSO is enabled where you need to create a personal access token for users that do not have a password set

📄️ Host a dbt Package

How-to guide for hosting a dbt package in the DataOps.live data product platform to easily manage common macros, models, and other modeling and transformation resources

📄️ Configure the Runner Health Check Script

How-to guide for configuring the health check script to monitor your DataOps runner.

📄️ Manage Log Output

How-to guide for configuring logging in DataOps.live when running pipelines and log size recommendations

📄️ Manage Multiple Databases

How-to guide for creating and working with more than just the default DataOps database in SOLE and MATE.

📄️ SOLE June 2025

Guide on migrating from SOLE prior June 2025 to SOLE June 2025 and later in DataOps projects

📄️ Mix and Match Database Roles with Account Roles

How-to guide for using Snowflake database roles and account roles with SOLE to improve overall security and adhere to strict naming converntions.

📄️ Pass Variables from a Pipeline to MATE

How-to guide for setting up a pipeline job to pass a variable from the DataOps pipeline to a MATE model

📄️ YAML CI/CD Config File Wildcards

How-to guide for supporting wildcards to break down your YAML configuration files into multiple smaller files and improve reusability

📄️ Prevent DataOps Pipelines from Running Concurrently

How-to guide for ensuring that a new DataOps pipeline does not start before the current pipeline run is completed

📄️ Automatic Retry of Failed Jobs

How-to guide for automatically retrying a failed job and self-healing pipelines. Showing a configuration for setting the number of retry attempts

📄️ Runner Disk Space Maintenance

How-to guide for keeping the disk space usage of your DataOps Runner within a quota

📄️ Git Error - Server Certificate Verification Failed

How-to guide for solving the error server certificate verification failed. CAfile: none CRLfile: none

📄️ Serve MATE Pipeline Documentation Offline

How-to guide for retrieving, storing, and restoring MATE documentation for future offline browsing

📄️ How to Set up Git Pre-Commit Hooks for a DataOps Project

How-to guide for setting up git pre-commit hooks automating quality checks of code and data before committing. Save development time and reduce errors.

📄️ Set up Multiple Pull Policies on the DataOps Runner

How-to guide for adding multiple pull policies. They provide mitigation strategies for container registry downtimes.

📄️ Using Snowflake Iceberg tables with SOLE for Data Products

How-to guide for using Snowflake Iceberg Table with SOLE for Data Product in DataOps.live.

📄️ Use a Third-Party Git Repository

How-to guide for using an external Git repository like GitHub, Bitbucket, or Azure DevOps for central code management

📄️ Update Tags on Existing Runners

How-to guide for updating the tags of existing runners

📄️ Use Datetime and Time Modules in Jinja

How-to guide using Python's datetime and time modules in your Jinja templates and dbt models.

📄️ Use Parent-Child Pipelines

How-to guide for triggering one pipeline from another using variables between parent and child pipelines

📄️ Use Snowflake Tags

How-to guide for creating and applying Snowflake tags in SOLE and MATE

📄️ Use SSH with Git

How-to guide for using SSH in DataOps.live to communicate with Git securely