雪花,获取两个表之间不匹配列的列表(SQL)

Kau*_*ber 1 sql snowflake-cloud-data-platform

我一直在做一些研究,但没有发现太多。我需要比较两个表以获得表 1 中的列的列表,但表 2 中不包含的列。我使用的是 Snowflake。现在,我找到了这个答案:postgresql - get a list of columns Difference Between 2 table

问题是当我运行代码时出现此错误:

SQL compilation error: invalid identifier TRANSIENT_STAGE_TABLE
Run Code Online (Sandbox Code Playgroud)

如果我单独运行该代码,则它可以正常工作,因此如果我运行:

SELECT column_name
FROM information_schema.columns 
WHERE table_schema = 'your_schema' AND table_name = 'table2'
Run Code Online (Sandbox Code Playgroud)

我实际上得到了一个列名列表,但是当我将其链接到第二个表达式时,返回了上述错误。关于发生了什么事有任何提示吗?谢谢

Mar*_*ski 7

原始帖子中的查询应该可以工作,也许您在某处缺少单引号?看这个例子

create or replace table xxx1(i int, j int);
create or replace table xxx2(i int, k int);

-- Query from the original post
SELECT column_name
FROM information_schema.columns 
WHERE table_name = 'XXX1'
    AND column_name NOT IN
    (
        SELECT column_name
        FROM information_schema.columns 
        WHERE table_name = 'XXX2'
    );
-------------+
 COLUMN_NAME |
-------------+
 J           |
-------------+
Run Code Online (Sandbox Code Playgroud)

您还可以编写稍微复杂的查询来查看两个表中所有不匹配的列:

with 
s1 as (
  select table_name, column_name 
  from information_schema.columns 
  where table_name = 'XXX1'), 
s2 as (
  select table_name, column_name 
  from information_schema.columns 
  where table_name = 'XXX2') 
select * from s1 full outer join s2 on s1.column_name = s2.column_name;
------------+-------------+------------+-------------+
 TABLE_NAME | COLUMN_NAME | TABLE_NAME | COLUMN_NAME |
------------+-------------+------------+-------------+
 XXX1       | I           | XXX2       | I           |
 XXX1       | J           | [NULL]     | [NULL]      |
 [NULL]     | [NULL]      | XXX2       | K           |
------------+-------------+------------+-------------+
Run Code Online (Sandbox Code Playgroud)

WHERE s1.column_name IS NULL or s2.column_name IS NULL当然,您可以添加以仅查找缺失的列。

您还可以轻松扩展它以检测列类型差异。