Table: azure_data_lake_store - Query Azure Data Lake Store using SQL
Azure Data Lake Store is a hyper-scale repository for big data analytic workloads in Azure. It allows you to store and analyze petabyte-size files and trillions of objects. Azure Data Lake Store offers high-speed integration with Azure HDInsight, Azure Data Factory, and Azure Machine Learning.
Table Usage Guide
The azure_data_lake_store
table provides insights into the data storage and analytics service in Azure. As a data engineer or data scientist, explore details about your Azure Data Lake Stores through this table, including their properties, encryption settings, and firewalls rules. Utilize it to manage and monitor your data lake stores, ensuring they are configured according to your organization's security and compliance policies.
Examples
Basic info
Explore the general information of your Azure Data Lake Store resources to understand their current state and type. This can help in monitoring the provisioning status and managing these resources effectively.
select name, id, type, provisioning_statefrom azure_data_lake_store;
select name, id, type, provisioning_statefrom azure_data_lake_store;
List data lake stores with encryption disabled
Explore which Azure data lake stores have disabled encryption, a potential security risk. This can be useful in auditing and improving your data security measures.
select name, id, type, provisioning_statefrom azure_data_lake_storewhere encryption_state = 'Disabled';
select name, id, type, provisioning_statefrom azure_data_lake_storewhere encryption_state = 'Disabled';
List data lake stores with firewall disabled
Discover the segments that utilize Azure Data Lake stores with disabled firewalls, enabling you to identify potential security risks and take necessary precautions. This is particularly useful for ensuring optimal security measures are in place and avoiding potential data breaches.
select name, id, type, provisioning_statefrom azure_data_lake_storewhere firewall_state = 'Disabled';
select name, id, type, provisioning_statefrom azure_data_lake_storewhere firewall_state = 'Disabled';
Control examples
Schema for azure_data_lake_store
Name | Type | Operators | Description |
---|---|---|---|
_ctx | jsonb | Steampipe context in JSON form. | |
account_id | text | The unique identifier associated with this data lake store account. | |
akas | jsonb | Array of globally unique identifier strings (also known as) for the resource. | |
cloud_environment | text | The Azure Cloud Environment. | |
creation_time | timestamp with time zone | The account creation time. | |
current_tier | text | The commitment tier in use for current month. | |
default_group | text | The default owner group for all new folders and files created in the data lake store account. | |
diagnostic_settings | jsonb | A list of active diagnostic settings for the data lake store. | |
encryption_config | jsonb | The key vault encryption configuration. | |
encryption_provisioning_state | text | The current state of encryption provisioning for this data lake store account. | |
encryption_state | text | The current state of encryption for this data lake store account. | |
endpoint | text | The full cname endpoint for this account. | |
firewall_allow_azure_ips | jsonb | The current state of allowing or disallowing IPs originating within azure through the firewall. If the firewall is disabled, this is not enforced. | |
firewall_rules | jsonb | The list of firewall rules associated with this data lake store account. | |
firewall_state | text | The current state of the IP address firewall for this data lake store account. | |
id | text | The resource identifier. | |
identity | jsonb | The key vault encryption identity, if any. | |
last_modified_time | timestamp with time zone | The account last modified time. | |
name | text | = | The resource name. |
new_tier | text | The commitment tier to use for next month. | |
provisioning_state | text | The provisioning status of the data lake store account. | |
region | text | The Azure region/location in which the resource is located. | |
resource_group | text | = | The resource group which holds this resource. |
sp_connection_name | text | =, !=, ~~, ~~*, !~~, !~~* | Steampipe connection name. |
sp_ctx | jsonb | Steampipe context in JSON form. | |
state | text | The state of the data lake store account. | |
subscription_id | text | =, !=, ~~, ~~*, !~~, !~~* | The Azure Subscription ID in which the resource is located. |
tags | jsonb | A map of tags for the resource. | |
title | text | Title of the resource. | |
trusted_id_provider_state | text | The current state of the trusted identity provider feature for this data lake store account. | |
trusted_id_providers | jsonb | The list of trusted identity providers associated with this data lake store account. | |
type | text | The resource type. | |
virtual_network_rules | jsonb | The list of virtual network rules associated with this data lake store account. |
Export
This table is available as a standalone Exporter CLI. Steampipe exporters are stand-alone binaries that allow you to extract data using Steampipe plugins without a database.
You can download the tarball for your platform from the Releases page, but it is simplest to install them with the steampipe_export_installer.sh
script:
/bin/sh -c "$(curl -fsSL https://steampipe.io/install/export.sh)" -- azure
You can pass the configuration to the command with the --config
argument:
steampipe_export_azure --config '<your_config>' azure_data_lake_store