GCP Big Query

GCP Big Query Destination Plugin

Google BigQuery is a fully-managed, cloud-native data warehouse that allows businesses to store, analyze, and query large datasets in a scalable and cost-effective manner. It is part of the Google Cloud Platform (GCP) suite of services.

BigQuery provides a serverless architecture, meaning that users do not need to worry about infrastructure provisioning, management, or tuning. It can process terabytes of data in seconds and petabytes of data in minutes, making it suitable for organizations that need to quickly process and analyze large amounts of data.

The GCP Big Query Suite Destination Plugin in Calyptia Core allows you to easily configure your pipeline to send your log data and metrics to GCP Big Query.

Configuration Parameters

The following are configuration Parameters for GCP Big Query Destination Plugin.

KeyDescription

Google Service Credentials Path

The Service Credentials file allows Calyptia Core to communicate directly with Google Cloud Services, please read the following on how to set up service credentials: https://cloud.google.com/logging/docs/agent/logging/authorization#create-service-account

Google Project Id

The project id containing the BigQuery dataset to stream into. If the service file is provided then the project id is taken from there

Existing Data Set ID

The dataset id of the BigQuery dataset to write into. This dataset must exist in your project.

Existing Table ID

The table id of the BigQuery table to write into. This table must exist in the specified dataset and the schema must match the output.

The following are Advanced configuration Parameters for GCP Big Query Destination Plugin.

KeyDescription

Skip Invalid Rows

If on then insert all valid rows of a request, even if invalid rows exist. The default value is false, which causes the entire request to fail if any invalid rows exist.

Ignore Unknown Values

Accept rows that contain values that do not match the schema. The unknown values are ignored. Default is false, which treats unknown values as errors.

Enable Workload Identity Federation

Enables workload identity federation as an alternative authentication method. Cannot be used with service account credentials file or environment variable. AWS is the only identity provider currently supported

GCP Region for BigQuery

GCP Region for BigQuery

GCP Project Number

GCP project number where the identity provider was created. Used to construct the full resource name of the identity provider.

GCP Pool Id

GCP workload identity pool where the identity provider was created. Used to construct the full resource name of the identity provider.

GCP Provider Id

GCP workload identity provider. Used to construct the full resource name of the identity provider. Currently only AWS accounts are supported.

The following are Security and TLS configuration Parameters for GCP Big Query Destination Plugin.

KeyDescription

TLS

Enable or Disable TLS/SSL support

TLS Certificate Validation

Turn TLS/SSL certificate validation on / off, TLS must be on for this setting to be enabled.

TLS Debug Level

Set TLS debug verbosity level. It accepts the following values: 0 (No debug), 1 (Error), 2 (State change), 3 (Informational), and 4 Verbose

CA Certificate File Path

Absolute path to CA certificate file

Certificate File Path

Absolute path to Certificate file

Private key File Path

Absolute path to private Key file

Private Key Path Password

Optional password for tls.key_file file

TLS SNI Hostname Extension

Hostname to be used for TLS SNI Extension

The following are Advanced Networking configuration Parameters for GCP Big Query Destination Plugin.

KeyDescription

DNS Mode

Select the primary DNS connection type (TCP or UDP)

DNS Resolver

Select the primary DNS connection type (TCP or UDP)

Prefer IPv4

Prioritize IPv4 DNS results when trying to establish a connection

Keepalive

Enable or disable Keepalive support

Keepalive Idle Timeout

Set maximum time allowed for an idle Keepalive connection

Max Connect Timeout

Set maximum time allowed to establish a connection, this time includes the TLS handshake

Max Connect Timeout Log Error

On connection timeout, specify if it should log an error. When disabled, the timeout is logged as a debug message

Max Keepalive Recycle

Set maximum number of times a keepalive connection can be used before it is retired.

Source Address

Specify network address to bind for data traffic

Last updated