Table: pipes_workspace_pipeline - Query Pipes Workspace Pipelines using SQL
A Pipes Workspace Pipeline is a crucial component within the Pipes service that allows for the execution of a sequence of data processing steps. These pipelines can be used to extract, transform, and load (ETL) data from various sources into a target destination. They are highly configurable and can be managed and monitored for efficient data processing and analytics.
Table Usage Guide
The pipes_workspace_pipeline
table provides insights into the Workspace Pipelines within the Pipes service. As a Data Engineer, explore pipeline-specific details through this table, including pipeline configuration, status, and related metadata. Utilize it to uncover information about pipelines, such as their current execution status, configuration details, and to manage and monitor them effectively.
Important Notes
This table supports optional quals. Queries with optional quals in the
where
clause are optimised to use Turbot Pipes filters.Optional quals are supported for the following columns:
created_at
id
identity_handle
identity_id
pipeline
query_where
- Allows use of query filters. For a list of supported columns for pipelines, please see Supported APIs and Columns. Please note that any query filter passed into thequery_where
qual will be combined with other optional quals.title
updated_at
workspace_handle
workspace_id
Examples
Basic info
Explore which workspace pipelines are active, how frequently they run, and any associated tags. This can be useful in understanding the flow of data and identifying any potential bottlenecks in the pipeline.
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipeline;
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipeline;
List pipelines for a specific workspace
Discover the segments that are part of a particular workspace in order to understand how frequently they run, their specific parameters, and their latest process. This is useful for assessing the operational efficiency and identifying any potential bottlenecks within a specific workspace.
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere workspace_handle = 'dev';
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere workspace_handle = 'dev';
List pipelines of frequency type interval
for a specific workspace
Analyze the settings to understand the pipelines configured to run at regular intervals in a specific workspace. This can be useful in managing and optimizing scheduled tasks within your workspace.
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere workspace_handle = 'dev' and frequency ->> 'type' = 'interval';
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere workspace_handle = 'dev' and json_extract(frequency, '$.type') = 'interval';
List pipelines for the AWS Compliance CIS v1.4.0
dashboard created in the last 7 days
Explore which pipelines have been created in the past week for the AWS Compliance CIS v1.4.0 dashboard. This is useful for auditing recent changes and ensuring compliance standards are being met.
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere args ->> 'resource' = 'aws_compliance.benchmark.cis_v140' and created_at >= now() - interval '7 days';
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere json_extract(args, '$.resource') = 'aws_compliance.benchmark.cis_v140' and created_at >= datetime('now', '-7 days');
List pipelines for the AWS Compliance CIS v1.4.0
dashboard created in the last 7 days using query filter
Explore recently created pipelines for a specific compliance dashboard. This is particularly useful for tracking the latest updates and changes made within the past week.
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere query_where = 'title = ''Scheduled snapshot: CIS v1.4.0'' and created_at >= now() - interval ''7 days''';
select id, identity_handle, workspace_handle, title, frequency, pipeline, args, tags, last_process_idfrom pipes_workspace_pipelinewhere query_where = 'title = ''Scheduled snapshot: CIS v1.4.0'' and created_at >= datetime(' now ', ' -7 days ')';
Schema for pipes_workspace_pipeline
Name | Type | Operators | Description |
---|---|---|---|
_ctx | jsonb | Steampipe context in JSON form. | |
args | jsonb | Arguments to be passed to the pipeline. | |
created_at | timestamp with time zone | >, >=, =, <, <= | The time when the pipeline was created. |
created_by | jsonb | Information about the user who created the pipeline. | |
created_by_id | text | The unique identifier of the user who created the pipeline. | |
frequency | jsonb | The frequency at which the pipeline will be executed. | |
id | text | =, != | The unique identifier for the pipeline. |
identity_handle | text | = | The handle of the identity. |
identity_id | text | = | The unique identifier of the identity to which the pipeline belongs to. |
identity_type | text | The type of identity, can be org/user. | |
last_process | jsonb | Information about the process that was last executed for the pipeline. | |
last_process_id | text | The unique identifier of the last process that was executed for the pipeline. | |
pipeline | text | =, != | The name of the pipeline to be executed. |
query_where | text | = | The query where expression to filter pipelines. |
sp_connection_name | text | =, !=, ~~, ~~*, !~~, !~~* | Steampipe connection name. |
sp_ctx | jsonb | Steampipe context in JSON form. | |
tags | jsonb | The tags for the pipeline. | |
title | text | =, != | The title of the pipeline. |
updated_at | timestamp with time zone | >, >=, =, <, <= | The time when the pipeline was last updated. |
updated_by | jsonb | Information about the user who last updated the pipeline. | |
updated_by_id | text | The unique identifier of the user who last updated the pipeline. | |
user_id | text | =, !=, ~~, ~~*, !~~, !~~* | The unique identifier for the user. |
version_id | bigint | The current version ID for the pipeline. | |
workspace_handle | text | = | The handle of the workspace. |
workspace_id | text | = | The unique identifier for the workspace. |
Export
This table is available as a standalone Exporter CLI. Steampipe exporters are stand-alone binaries that allow you to extract data using Steampipe plugins without a database.
You can download the tarball for your platform from the Releases page, but it is simplest to install them with the steampipe_export_installer.sh
script:
/bin/sh -c "$(curl -fsSL https://steampipe.io/install/export.sh)" -- pipes
You can pass the configuration to the command with the --config
argument:
steampipe_export_pipes --config '<your_config>' pipes_workspace_pipeline