Sco*_*ain 2 sql-server-2005 sql-server clustered-index insert bulk
我正在尝试将数据从一个数据库系统转换为另一个数据库系统。我需要传输和格式化的表之一包含超过 1000 万行。
我正在运行以下脚本来做到这一点:
USE [Cvti101687]
go
truncate table [IDAT_MR_NOTEHISTORY]
go
IF EXISTS (SELECT * FROM sys.indexes WHERE object_id = OBJECT_ID(N'[dbo].[IDAT_MR_NOTEHISTORY]') AND name = N'PK_iMR_NOTES')
ALTER TABLE [dbo].[IDAT_MR_NOTEHISTORY] DROP CONSTRAINT [PK_iMR_NOTES]
GO
USE [101687_test2]
GO
declare @ChunkCounter int
declare @ChunkSize int
set @ChunkCounter = 0
set @ChunkSize = 50000
--hack to do a do-while loop in sql
while 1 = 1
begin
insert into cvti101639..IDAT_MR_NOTEHISTORY with (tablock)
SELECT newid(), '00000000-0000-0000-0000-000000000000', '00000000-0000-0000-0000-000000000000', '00000000-0000-0000-0000-000000000000'
,'00000000-0000-0000-0000-000000000000', ISNULL(TargetClientAccountNum, [REC_CLINUM]), ISNULL(TargetPetAccountNum, ' ')
,ISNULL(pck_desc, 'Converted Medical Record'), isnull([REC_DOCTOR], '99999'), isnull([REC_ENTRY],''), isnull(REC_ENTRY,''),[rec_dattim], 0
FROM [RECORD]
left join [WVSSPCK] on pck_code = 99 and pck_link = [REC_TAG]
left join cvti101639..idat_patients p on p.oldAccountNum = REC_CASE
where rec_deleted = 0 and (rec_rtype = 0 or rec_rtype is null)
and (TargetClientAccountNum is not null or [REC_CLINUM] is not null)
and rec_id >= @ChunkCounter and rec_id < @ChunkCounter + @ChunkSize
--If no rows inserted, break out of the loop
if(@@rowcount = 0)
break;
set @ChunkCounter = @ChunkCounter + @ChunkSize
end
go
USE [Cvti101687]
go
ALTER TABLE [dbo].[IDAT_MR_NOTEHISTORY] ADD CONSTRAINT [PK_iMR_NOTES] PRIMARY KEY CLUSTERED
(
[cvtGUID] ASC
)
Run Code Online (Sandbox Code Playgroud)
为了加快速度,我尝试删除目标表上的聚集索引并在最后重建它(是的,我知道它是 GUID 上的聚集索引,我只是转换人员,我没有发言权数据库是如何设计的)。
但是,当从插入的 while 循环的其中一个迭代中查看实际执行计划时,我发现它正在对我已经删除的索引执行聚集索引插入。此外,它在聚集索引插入之前执行的排序成本为每个块的 69%。
|--Clustered Index Insert(OBJECT:([Cvti101639].[dbo].[IDAT_MR_NOTEHISTORY].[PK_iMR_NOTES]), SET:([Cvti101639].[dbo].[IDAT_MR_NOTEHISTORY].[cvtGUID] = RaiseIfNull([Expr1012]),[Cvti101639].[dbo].[IDAT_MR_NOTEHISTORY].[cvtClientGUID] = [Expr1013],[Cvti
|--Parallelism(Gather Streams, ORDER BY:([Expr1012] ASC))
|--Sort(ORDER BY:([Expr1012] ASC))
|--Compute Scalar(DEFINE:([Expr1012]=newid(), [Expr1013]={guid'00000000-0000-0000-0000-000000000000'}, [Expr1014]={guid'00000000-0000-0000-0000-000000000000'}, [Expr1015]={guid'00000000-0000-0000-0000-000000000000'}, [Expr1016]={guid'
(snip)
Run Code Online (Sandbox Code Playgroud)
这是完成分块循环的前几次迭代后的实际执行计划的副本。只需获取 xml 并将其另存为.sqlplan
文件,它应该能够在 Management Studio 中打开。
我做错了什么导致这不做堆插入?
归档时间: |
|
查看次数: |
164 次 |
最近记录: |