steampipe plugin install azure

Table: azure_databricks_workspace - Query Azure Databricks Workspaces using SQL

Azure Databricks Workspace is a feature within Microsoft Azure that offers an interactive workspace for big data analytics and machine learning. It provides a centralized environment for collaborative projects, allowing users to write in multiple languages, visualize data, and share insights. Azure Databricks Workspace supports the full lifecycle of big data analytics, from data preparation to exploration, and from model training to production.

Table Usage Guide

The azure_databricks_workspace table provides insights into Azure Databricks Workspaces within Microsoft Azure. As a data scientist or data analyst, you can explore workspace-specific details through this table, including configuration, status, and properties of each workspace. Use it to uncover information about workspaces, such as their location, SKU, managed private network, and provisioning status.

Examples

Basic info

Explore the various Azure Databricks workspaces within your organization to gain insights into their creation dates and associated SKU details. This can be useful for tracking resource usage and understanding your workspace configuration.

select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace;
select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace;

List workspaces created in the last 30 days

Discover the segments that have been recently added to your workspace within the past month. This is especially useful for keeping track of new additions and managing growth in your workspace.

select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
created_date_time >= now() - interval '30' day;
select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
created_date_time >= datetime('now', '-30 days');

List failed workspaces

Determine the areas in which Azure Databricks workspaces have failed. This can be useful in identifying issues and troubleshooting the workspaces that are not successfully provisioned.

select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace;
where
provisioning_state = 'Failed';
select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
provisioning_state = 'Failed';

List all encrypted workspaces

Identify instances where workspaces in Azure Databricks are encrypted. This is useful for ensuring data security and compliance with encryption standards.

select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
parameters -> 'Encryption' is not null;
select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
json_extract(parameters, '$.Encryption') is not null;

List workspaces that allow public IP

Determine the areas in which Azure Databricks workspaces are configured to allow public IP access. This query can be used to identify potential security vulnerabilities and ensure best practices for data protection.

select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
parameters -> 'enableNoPublicIp' ->> 'value' = 'false';
select
name,
id,
workspace_id,
workspace_url,
created_date_time,
sku
from
azure_databricks_workspace
where
json_extract(
json_extract(parameters, '$.enableNoPublicIp'),
'$.value'
) = 'false';

Schema for azure_databricks_workspace

NameTypeOperatorsDescription
_ctxjsonbSteampipe context in JSON form.
akasjsonbArray of globally unique identifier strings (also known as) for the resource.
authorizationsjsonbThe workspace provider authorizations.
cloud_environmenttextThe Azure Cloud Environment.
created_byjsonbIndicates the Object ID, PUID and Application ID of entity that created the workspace.
created_date_timetimestamp with time zoneSpecifies the date and time when the workspace is created.
idtextFully qualified resource ID for the resource.
locationtextThe geo-location where the resource lives.
managed_resource_group_idtextThe managed resource group ID.
nametext=The resource name.
parametersjsonbThe workspace's custom parameters.
provisioning_statetextThe workspace provisioning state.
regiontextThe Azure region/location in which the resource is located.
resource_grouptext=The resource group which holds this resource.
skujsonbThe SKU of the resource.
sp_connection_nametext=, !=, ~~, ~~*, !~~, !~~*Steampipe connection name.
sp_ctxjsonbSteampipe context in JSON form.
storage_account_identityjsonbThe details of Managed Identity of Storage Account
subscription_idtext=, !=, ~~, ~~*, !~~, !~~*The Azure Subscription ID in which the resource is located.
tagsjsonbA map of tags for the resource.
titletextTitle of the resource.
typetextThe type of the resource.
ui_definition_uritextThe blob URI where the UI definition file is located.
updated_byjsonbIndicates the Object ID, PUID and Application ID of entity that last updated the workspace.
workspace_idtextThe unique identifier of the databricks workspace in databricks control plane.
workspace_urltextThe workspace URL which is of the format 'adb-{workspaceId}.{random}.azuredatabricks.net'.

Export

This table is available as a standalone Exporter CLI. Steampipe exporters are stand-alone binaries that allow you to extract data using Steampipe plugins without a database.

You can download the tarball for your platform from the Releases page, but it is simplest to install them with the steampipe_export_installer.sh script:

/bin/sh -c "$(curl -fsSL https://steampipe.io/install/export.sh)" -- azure

You can pass the configuration to the command with the --config argument:

steampipe_export_azure --config '<your_config>' azure_databricks_workspace