AV | Google Cloud Storage

Integrations Guide > Design > Source and Target Connections > Google Cloud Storage

Was this helpful?

Google Cloud Storage

The Google Cloud Storage connector provides access to Google Cloud Storage buckets and objects in the Google Cloud Platform.

Prerequisites

You must have the Google Cloud account and credentials.

Connection Details

Specify the following details to define a connection:

Property	Description
Credentials	(Optional) The JSON credentials of the service account used to access the Cloud Storage service. This is the content of the credentials file that can be downloaded from the Cloud Storage console. Note: To define a connection, the concerned GCP account must have the permission for storage.buckets.list. Your credentials are authenticated successfully only if you have the permissions.

Source Details

Specify the following source details:

Property	Description
Bucket	Specifies the name of the cloud storage bucket to access. This is a mandatory property.
Folder	(Optional) Specifies the folder containing the object. Clicking on the refresh button lists folders within the current folder. Hierarchies of the folders are navigated by repeatedly clicking on the refresh button.
Object	Specifies the name of the object to access. This is a mandatory property.
Read Chunk Size (Bytes)	(Optional) Specifies the size of chunks, in bytes, in which to retrieve data from the source object. When the property is not specified, or is set to a non-positive value, the entire object data is retrieved in a single chunk.
Data Format	Specifies the data format. This is a mandatory property. Select one of the following as per the data format type: • CSV: The following properties must be configured: – Header: If enabled, specifies whether the CSV data contains a header row. – Quote: Specifies the quote character. Default value is ". – Delimiter: Specifies the field delimiter character. Default value is ,. – Escape: Specifies the escape character. – Separator: Specifies the row separator characters. Default value is \r\n. – Character Set: Specifies the character set of the data. Default value is UTF-8. – Sample Size: Specifies the number of records to analyze when determining the data structure. Default value is 100. • Excel: The following properties must be configured: – Header: If enabled, specifies whether the CSV data contains a header row. – Worksheet Index: Specifies the index of the worksheet. The index of the first worksheet is 1 (default). – Sample Size: Specifies the number of records to analyze when determining the data structure. Default value is 100. • Avro: The following properties must be configured: – Sample Size: Specifies the number of records to analyze when determining the data structure. Default value is 100. – Select Source Tables: Select which arrays of objects in the AVRO data you wish to map to tables in the target. • Parquet • JSON: The following properties must be configured: – Find Array: If enabled, specifies that the output will iterate on the first array found in the JSON document. – Array Path: Specifies the path of the JSON array within the document. Omit if the document is an array. For example,"/resources". – Sample Size: Specifies the number of records to analyze when determining the data structure. Default value is 100. – Select Source Tables: Select which arrays of objects in the JSON data you wish to map to tables in the target.
Limit	(Optional) Specifies the maximum number of records to return.
Batch Size	(Optional) Specifies the number of records to return per batch.

Additional Information

Limitations: The Google Cloud Storage connector only supports authentication for Google service accounts. Authentication for Google end-user accounts and API keys is not supported.

Last modified date: 05/15/2024