Joh*_*tud 1 sql sql-server string-aggregation
我有以下形状的数据:
BOM -- 500 rows, 4 cols
PartProject -- 2.6mm rows, 4 cols
Project -- 1000 rows, 5 cols
Part -- 200k rows, 18 cols
Run Code Online (Sandbox Code Playgroud)
然而,当我尝试这样做时string_agg,我的代码将花费 10 多分钟才能在 500 行上执行。我该如何改进这个查询(数据不可用)。
select
BOM.*,
childParentPartProjectName
into #tt2 -- tt for some testing
from #tt1 AS BOM -- tt for some testing
-- cross applys for string agg many to one
CROSS APPLY (
SELECT childParentPartProjectName = STRING_AGG(PROJECT_childParentPart.NAME, ', ') WITHIN GROUP (ORDER BY PROJECT_childParentPart.NAME)
FROM (
SELECT DISTINCT PROJECT3.NAME
FROM [dbo].[Project] PROJECT3
LEFT JOIN [dbo].[Part] P3 on P3.ITEM_NUMBER = BOM.childParentPart
LEFT JOIN [dbo].[PartProject] PP3 on PP3.SOURCE_ID = P3.ID
WHERE PP3.RELATED_ID = PROJECT3.ID and P3.CURRENT = 1
) PROJECT_childParentPart ) PROJECT3
Run Code Online (Sandbox Code Playgroud)
您拥有的子查询(子查询内)有一种代码“气味”,它是有意编写的,但不正确。
首先,子查询中有 2个,但是,别名为和LEFT JOIN的表都必须具有非值;如果没有找到相关行,这是不可能的。这意味着s 是隐式s。P3PP3NULLJOININNER JOIN
接下来,当从多个表中进行查询时,您将DISTINCT针对单个列进行操作;这似乎是错误的。非常昂贵,并且您使用它的事实意味着要么不是唯一的,要么由于您的隐式s 您得到了重复的行。我认为是后者。因此,您很可能实际上应该使用 an ,而不是s。SELECTDISTINCTNAMEINNER JOINEXISTSLEFT JOINsINNER JOIN
以下内容很大程度上是猜测,但我怀疑它的性能会更高。
SELECT BOM.*, --Replace this with an explicit list of the columns you need
SA.childParentPartProjectName
INTO #tt2
FROM #tt1 BOM
CROSS APPLY (SELECT STRING_AGG(Prj.NAME, ', ') WITHIN GROUP (ORDER BY Prj.NAME) AS childParentPartProjectName
FROM dbo.Project Prj --Don't use an alias that is longer than the object name
WHERE EXISTS (SELECT 1
FROM dbo.Part P
JOIN dbo.PartProject PP ON P.ID = PP.SOURCE_ID
WHERE PP.Related_ID = Prg.ID
AND P.ITEM_NUMBER = BOM.childParentPart
AND P.Current = 1)) SA;
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
2083 次 |
| 最近记录: |