Ami*_*mit 3 google-bigquery google-cloud-platform terraform terraform-provider-gcp
我是 GCP 和 Terraform 的新手。我正在开发 terraform 脚本来配置大约 50 个 BQ 数据集,每个数据集至少有 10 个表。所有表都没有相同的架构。
我已经开发了脚本来创建数据集和表,但我面临着向表添加架构的挑战,我需要帮助。我正在利用 terraform 变量来构建脚本。
这是我的代码。我需要集成逻辑来创建表架构。
variable "test_bq_dataset" {
  type = list(object({
    id       = string
    location = string
  }))
}
variable "test_bq_table" {
  type = list(object({
    dataset_id = string
    table_id   = string
  }))
}
test_bq_dataset = [{
  id       = "ds1"
  location = "US"
  },
  {
    id       = "ds2"
    location = "US"
  }
]
test_bq_table = [{
  dataset_id = "ds1"
  table_id   = "table1"
  },
  {
    dataset_id = "ds2"
    table_id   = "table2"
  },
  {
    dataset_id = "ds1"
    table_id   = "table3"
  }
]
resource "google_bigquery_dataset" "dataset" {
  count      = length(var.test_bq_dataset)
  dataset_id = var.test_bq_dataset[count.index]["id"]
  location   = var.test_bq_dataset[count.index]["location"]
  labels = {
    "environment" = "development"
  }
}
resource "google_bigquery_table" "table" {
  count = length(var.test_bq_table)
  dataset_id = var.test_bq_table[count.index]["dataset_id"]
  table_id   = var.test_bq_table[count.index]["table_id"]
  labels = {
    "environment" = "development"
  }
  depends_on = [
    google_bigquery_dataset.dataset,
  ]
}
我尝试了所有可能性来为数据集中的表创建模式。然而没有一个起作用。
想必您的所有表都应该具有相同的架构......
我会尝试这种方式
在里面
resource "google_bigquery_table" "table"
例如,在标签之后,您可以添加:
schema = file("${path.root}/subdirectories-path/table_schema.json")
哪里的
${path.root}- 是您 terraform 文件的位置subdirectories-path- 零个或多个子目录table_schema.json- 带有模式的 json 文件==>更新 14/02/2021
根据请求显示表架构不同的示例...对原始问题的最小修改。
变量.tf
variable "project_id" {
  description = "The target project"
  type        = string
  default     = "ishim-sample"
}
variable "region" {
  description = "The region where resources are created => europe-west2"
  type        = string
  default     = "europe-west2"
}
variable "zone" {
  description = "The zone in the europe-west region for resources"
  type        = string
  default     = "europe-west2-b"
}
# ===========================
variable "test_bq_dataset" {
  type = list(object({
    id       = string
    location = string
  }))
}
variable "test_bq_table" {
  type = list(object({
    dataset_id = string
    table_id   = string
    schema_id  = string
  }))
}
terraform.tfvars
test_bq_dataset = [
  {
    id       = "ds1"
    location = "EU"
  },
  {
    id       = "ds2"
    location = "EU"
  }
]
test_bq_table = [
  {
    dataset_id = "ds1"
    table_id   = "table1"
    schema_id  = "table-schema-01.json"
  },
  {
    dataset_id = "ds2"
    table_id   = "table2"
    schema_id  = "table-schema-02.json"
  },
  {
    dataset_id = "ds1"
    table_id   = "table3"
    schema_id  = "table-schema-03.json"
  },
  {
    dataset_id = "ds2"
    table_id   = "table4"
    schema_id  = "table-schema-04.json"
  }
]
json 架构文件示例 - table-schema-01.json
[
  {
    "name": "table_column_01",
    "mode": "REQUIRED",
    "type": "STRING",
    "description": ""
  },
  {
    "name": "_gcs_file_path",
    "mode": "REQUIRED",
    "type": "STRING",
    "description": "The GCS path to the file for loading."
  },
  {
    "name": "_src_file_ts",
    "mode": "REQUIRED",
    "type": "TIMESTAMP",
    "description": "The source file modification timestamp."
  },
  {
    "name": "_src_file_name",
    "mode": "REQUIRED",
    "type": "STRING",
    "description": "The file name of the source file."
  },
    {
    "name": "_firestore_doc_id",
    "mode": "REQUIRED",
    "type": "STRING",
    "description": "The hash code (based on the file name and its content, so each file has a unique hash) used as a Firestore document id."
  },
  {
    "name": "_ingested_ts",
    "mode": "REQUIRED",
    "type": "TIMESTAMP",
    "description": "The timestamp when this record was processed during ingestion into the BigQuery table."
  }
]
主.tf
provider "google" {
  project = var.project_id
  region  = var.region
  zone    = var.zone
}
resource "google_bigquery_dataset" "test_dataset_set" {
  project    = var.project_id
  count      = length(var.test_bq_dataset)
  dataset_id = var.test_bq_dataset[count.index]["id"]
  location   = var.test_bq_dataset[count.index]["location"]
  labels = {
    "environment" = "development"
  }
}
resource "google_bigquery_table" "test_table_set" {
  project    = var.project_id
  count      = length(var.test_bq_table)
  dataset_id = var.test_bq_table[count.index]["dataset_id"]
  table_id   = var.test_bq_table[count.index]["table_id"]
  schema     = file("${path.root}/bq-schema/${var.test_bq_table[count.index]["schema_id"]}")
  labels = {
    "environment" = "development"
  }
  depends_on = [
    google_bigquery_dataset.test_dataset_set,
  ]
}
项目目录结构- 屏幕截图
请记住子目录名称 - “bq-schema”,因为它在“main.tf”文件中“google_bigquery_table”资源的“schema”属性中使用。
BigQuery 控制台- 屏幕截图
“terraform apply”命令的结果。
| 归档时间: | 
 | 
| 查看次数: | 3891 次 | 
| 最近记录: |