Documentation

Azure Blob Storage

Updated on

Jan 29, 2024

IMPORTANT: This article covers setup of warehouse for load data from Improvado, not customer data warehouse from which data is being extracted. This article doesn't cover setup of customer data warehouse for Data Prep as well.

Required information

  • Title
  • Account URL
  • ~Account URL must satisfy the following regular expression: ```https://[a-z0-9]*\.blob\.core\.windows\.net$```
  • ~The ```[a-z0-9]``` part of Account URL must be between 3 and 24 characters in length
  • SAS Token

{% docs-informer info title="Important"%}

Learn how to create a SAS Token with this guide.

{% docs-informer-end%}

Encryption

Possible options:

  • No encryption (default cloud storage encryption is still enabled)
  • Customer-provided keys

Encryption Key

If you have selected the Default Cloud Storage encryption type, you will not be able to edit this field.

Otherwise, you should enter your AES-256 key, encoded in standard Base64 or resource name of Cloud KMS key used to encrypt the blob’s contents. For more info, see Azure Blob Storage encryption docs.

Folder

Possible parameters:

```/{{data_source}}/{{data_table_title}}/{{report_type}}/{{YYYY}}/{{MM}}/{{DD}}/{{timestamp}}```

  • ```{{data_source}}``` is a data provider, integration, connector
  • ```{{data_table_title}}``` is an object that contains all extraction orders with the same granularity (dimensional schema)
  • ```{{report_type}}``` is a set of such fields as metrics, properties, dimensions, etc.
  • ```{{timestamp}}``` is the date and time when data load started

If you use ```/{{YYYY}}/{{MM}}/{{DD}}``` settings, the data will be added to folders daily. Each new record will not delete the previous one, even for data that contains no date.

File format

Possible formats:

  • csv
  • csv+gzip
  • json
  • json+gzip
  • parquet

File name

Possible parameters:

```{{filename}}-{{YYYY}}-{{MM}}-{{DD}}```

  • ```{{filename}}``` is the same as destination table name

IMPORTANT: you cannot use {{DD}} for partition by month

  • ```{{filename}}-{{YYYY}}-{{MM}}-{{DD}}``` – for partition by day
  • ```{{filename}}-{{YYYY}}-{{MM}}``` – for partition by month

Also, you can use “_” instead of “-” or do not use any symbols at all, for example:

  • ```{{filename}}_{{YYYY}}-{{MM}}-{{DD}}```
  • ```{{filename}}{{YYYY}}{{MM}}{{DD}}```

Separator

Possible delimiters that can separate data in your file:

  • comma
  • semicolon
  • tab

Partition by

Possible ways of splitting data:

  • Day
  • Month

Use static IP

Select Yes for Use static IP option if you allow Improvado to connect your database by the static IPs mentioned on the Destination connection page.

Select No if you have permitted access to your database from any IP. In this case, Improvado will connect your database using dynamic IPs not listed on the Destination connection page.

Schema information

Setup guide

Settings

No items found.

Troubleshooting

Troubleshooting guides

Check out troubleshooting guides for
Azure Blob Storage
here

Limits

Frequently asked questions

No items found.
☶ On this page
Description
Related articles
No items found.
No items found.

Questions?

Improvado team is always happy to help with any other questions you might have! Send us an email.

Contact your Customer Success Manager or raise a request in Improvado Service Desk.