A cross tenant metadata driven processing framework for Azure Data Factory and Azure Synapse Analytics achieved by coupling orchestration pipelines with a SQL database and a set of Azure Functions.

- Overview
- Contents

View the Project on GitHub mrpaulandrew/procfwk

Welcome to the Orchestrate .procfwk Home Page

This site provides details on the latest version of the processing framework (procfwk) code project, available on GitHub here, as a single source of all information needed to use and support this solution.

Please use the Contents page, also available in the side bar, to navigate.

ProcFwk Has Become CF.Cumulus.Control

See blog: mrpaulandrew.com

See new product page: cloudformations.org/cumulus

ProcFwk will receive no further development beyond December 2023.

Code Project Overview

What is procfwk?

This open source code project delivers a simple metadata driven processing framework for Azure Data Factory and/or Azure Synapse Analytics (Intergate Pipelines). The framework is made possible by coupling the orchestration service with a SQL Database that houses execution batches, execution stages and pipeline metadata that is later called using an Azure Functions App. The execution batch, stage and worker pipeline structures allow; concurrent overall batch handling with inner stages of process dependencies executed in sequence. Then at the lowest level, all worker pipelines within a stage to be executed in parallel offering scaled out control flows where no inter-dependencies exist.

The framework is designed to integrate with any existing set of modular processing pipelines by making the lowest level executor a stand alone worker pipeline that is wrapped in a higher level of controlled (sequential) dependencies. This level of abstraction means operationally nothing about the monitoring of the orchestration processes is hidden in multiple levels of dynamic activity calls. Instead, everything from the processing pipeline doing the work (the Worker) can be inspected using out-of-the-box Azure monitoring features.

This framework can also be used in any Azure Tenant and allows the creation of complex control flows across multiple orchestartion resources and even across Azure Tenant/Subscriptions by connecting Service Principal details through metadata to targeted Tenants > Subscriptions > Resource Groups > Orchestartion Services and Pipelines, this offers granular administration over any data processing components in a given environment from a single point of easily controlled metadata.

Why use procfwk?

To answer the question of why use a metadata driven framework, please see the following YouTube video.

YouTube Video

Framework Capabilities

Deployment Steps

For details on how to deploy the processing framework to your Azure Tenant see Deploying ProcFwk.

Complete Orchestrator Activity Chain

The following offers a view of all pipeline activities at every level within the processing framework if flattened out onto a single canvas. This applies regardless of the orchestrator type.