Sla*_*uma 6 sql t-sql sql-server sql-server-express sql-server-2008-r2
我有一个SQL数据库查询的问题突然(但通常大约每三周)变得缓慢.
安装程序如下:
Orders查询主要选择的表()有大约24000条记录,其他五条连接表小(100条或更少)Orders有一个包含二进制数据(PDF文档)的varbinary(MAX)列Report,平均大小约为200到300 kB(但有时可能高达2 MB).这24000个订单中超过90%的列填充了此列,其他的则是NULL,即6 GB数据库大小的90%以上是二进制数据.有问题的查询具有以下结构:
SELECT TOP (30) [Project2].[OrderID] AS [OrderID]
-- around 20 columns more
FROM ( SELECT [Project2].[OrderID] AS [OrderID],
-- around 20 columns more
row_number() OVER (ORDER BY [Project2].[OrderID] ASC) AS [row_number]
FROM ( SELECT [Filter1].[OrderID] AS [OrderID]
-- around 20 columns more
FROM ( SELECT [Extent1].[OrderID] AS [OrderID]
-- around 20 columns more
FROM [dbo].[Orders] AS [Extent1]
INNER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
WHERE ([Extent1].[Status] IS NOT NULL)
AND (4 = CAST( [Extent1].[Status] AS int))
AND ([Extent1].[SomeDateTime] IS NULL)
AND ([Extent1].[Report] IS NULL)
) AS [Filter1]
OUTER APPLY (SELECT TOP (1) [Project1].[C1] AS [C1]
FROM ( SELECT CAST( [Extent7].[CreationDateTime] AS datetime2) AS [C1],
[Extent7].[CreationDateTime] AS [CreationDateTime]
FROM [dbo].[OtherTable] AS [Extent7]
WHERE [Filter1].[OrderID] = [Extent7].[OrderID]
) AS [Project1]
ORDER BY [Project1].[CreationDateTime] DESC
) AS [Limit1]
) AS [Project2]
) AS [Project2]
WHERE [Project2].[row_number] > 0
ORDER BY [Project2].[OrderID] ASC
Run Code Online (Sandbox Code Playgroud)
它是由Entity Framework从LINQ到实体查询生成的.查询发生在一些变体中,这些变体仅在第一WHERE个子句中有所不同:
五个变种
WHERE ([Extent1].[Status] IS NOT NULL)
AND (X = CAST( [Extent1].[Status] AS int))
Run Code Online (Sandbox Code Playgroud)
X可以在0和之间4.这些查询从来都不是问题.
这两个变种(*)
WHERE ([Extent1].[Status] IS NOT NULL)
AND (4 = CAST( [Extent1].[Status] AS int))
AND ([Extent1].[SomeDateTime] IS NULL)
AND ([Extent1].[Report] IS NULL)
Run Code Online (Sandbox Code Playgroud)
或者... IS NOT NULL...在最后一行.我只有这两个查询才会遇到下面描述的问题.
"现象"是:
另外一个观察:
不知怎的,我怀疑整个问题与Express版本的内存限制(1 GB)和做varbinary(MAX)柱,虽然我只是用它的WHERE来检查,如果列值子句NULL与否NULL.所述Report柱本身不是选定的列中的一个.
由于我正在运行明年Express版的限制(10 GB mdf文件大小),我正在考虑更改:
Orders表中问题:查询突然变慢的原因是什么?我计划的其中一项变更可以解决问题还是有其他解决方案?
编辑
在下面的评论中按照bhamby的提示,我SET STATISTICS TIME ON在再次运行查询之前设置了SSMS.当查询再次变慢时,我得到一个高值SQL Server parse and compile time,即:CPU time = 27,3 sec和Elapsed time = 81,9 sec.查询的执行时间仅为CPU时间= 0,06秒,经过时间= 2,8秒.在此之后第二次运行查询,为SQL Server解析和编译时间提供CPU时间0,06秒和经过时间= 0,08.
这看起来很浪费
SELECT TOP (1) [Project1].[C1] AS [C1]
FROM ( SELECT CAST( [Extent7].[CreationDateTime] AS datetime2) AS [C1],
[Extent7].[CreationDateTime] AS [CreationDateTime]
FROM [dbo].[OtherTable] AS [Extent7]
WHERE [Filter1].[OrderID] = [Extent7].[OrderID]
) AS [Project1]
ORDER BY [Project1].[CreationDateTime] DESC
Run Code Online (Sandbox Code Playgroud)
是
SELECT max( CAST( [Extent7].[CreationDateTime] AS datetime2) ) AS [C1]
FROM [dbo].[OtherTable] AS [Extent7]
WHERE [Filter1].[OrderID] = [Extent7].[OrderID]
Run Code Online (Sandbox Code Playgroud)
为什么不将日期存储为日期时间?
我不喜欢那个外部应用
我会创建一个运行一次的#temp并加入它
确保并将[OrderID]声明为PK
SELECT [Extent7].[OrderID], max( CAST( [Extent7].[CreationDateTime] AS datetime2) ) AS [C1]
FROM [dbo].[OtherTable] AS [Extent7]
GROUP BY [Extent7].[OrderID]
Run Code Online (Sandbox Code Playgroud)
你可以进行循环连接
接下来我会将其放入 #temp2 中,以便您确定它只运行一次
再次确保将 OrderID 声明为 PK
SELECT [Extent1].[OrderID] AS [OrderID]
-- around 20 columns more
FROM [dbo].[Orders] AS [Extent1]
INNER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
LEFT OUTER JOIN -- small table
WHERE ([Extent1].[Status] IS NOT NULL)
AND (4 = CAST( [Extent1].[Status] AS int))
AND ([Extent1].[SomeDateTime] IS NULL)
AND ([Extent1].[Report] IS NULL)
Run Code Online (Sandbox Code Playgroud)
如果 Order 只有 24,000 行,那么查询时间超过几秒就会发生一些愚蠢的事情。