我要求将相邻记录返回到本机SQL中序列中的缺失记录.如果缺少序列中的第一个条目,则仅返回下一个条目.无需在序列中查找缺失的结尾.
这是在SQL Server 12.0.2000.8中运行
有问题的列的结构:
BatchId(nvarchar(50), null)
CreateDate(datetime, null)
UserId(varchar(50), null)
Batch(varchar(50), null)
Run Code Online (Sandbox Code Playgroud)
" - "后BatchId中的最后一个数字确定序列.BatchId与Batch相关联.批量更改时,BatchId上的序列应重置为1.
BatchId CreateDate UserId Batch
#########################################################
9K182855 - 1 2017-09-26 17:57:20.977 9K182855 8
9K182855 - 2 2017-09-26 18:20:57.693 9K182855 8
9K182855 - 1 2017-09-27 11:04:46.177 9K182855 9
9K182855 - 2 2017-09-27 11:19:32.990 9K182855 9
Run Code Online (Sandbox Code Playgroud)
我用来获取数据的查询
select BatchID, CreateDate, UserId, Batch from Results
where CreateDate > dateadd(day,-2,getdate())
and Batch between 0 and 9
order by UserId, CreateDate, Batch;
Run Code Online (Sandbox Code Playgroud)
这是好的数据
BatchId CreateDate UserId Batch
#########################################################
4L182855 - 1 2017-09-28 14:04:46.177 4L182855 9
4L182855 - 2 2017-09-28 15:19:32.990 4L182855 9
4L182855 - 3 2017-09-28 16:30:27.953 4L182855 9
4L182855 - 4 2017-09-28 17:57:20.977 4L182855 9
4L182855 - 5 2017-09-28 18:20:57.693 4L182855 9
4L182855 - 1 2017-09-29 11:04:46.177 4L182855 0
4L182855 - 2 2017-09-29 11:19:32.990 4L182855 0
4L182855 - 3 2017-09-29 11:30:27.953 4L182855 0
4L182855 - 4 2017-09-29 11:57:20.977 4L182855 0
4L182855 - 5 2017-09-29 12:00:57.693 4L182855 0
4L182855 - 6 2017-09-29 12:04:46.177 4L182855 0
4L182855 - 7 2017-09-29 12:19:32.990 4L182855 0
4L182855 - 8 2017-09-29 12:30:27.953 4L182855 0
4L182855 - 9 2017-09-29 13:57:20.977 4L182855 0
4L182855 - 10 2017-09-29 14:20:57.693 4L182855 0
Run Code Online (Sandbox Code Playgroud)
这是MISSING数据
BatchId CreateDate UserId Batch
#########################################################
4L182855 - 1 2017-09-28 14:04:46.177 4L182855 9
4L182855 - 2 2017-09-28 15:19:32.990 4L182855 9
4L182855 - 4 2017-09-28 17:57:20.977 4L182855 9
4L182855 - 5 2017-09-28 18:20:57.693 4L182855 9
4L182855 - 1 2017-09-29 11:04:46.177 4L182855 0
4L182855 - 2 2017-09-29 11:19:32.990 4L182855 0
4L182855 - 3 2017-09-29 11:30:27.953 4L182855 0
4L182855 - 4 2017-09-29 11:57:20.977 4L182855 0
4L182855 - 5 2017-09-29 12:00:57.693 4L182855 0
4L182855 - 6 2017-09-29 12:04:46.177 4L182855 0
4L182855 - 7 2017-09-29 12:19:32.990 4L182855 0
4L182855 - 8 2017-09-29 12:30:27.953 4L182855 0
4L182855 - 10 2017-09-29 14:20:57.693 4L182855 0
Run Code Online (Sandbox Code Playgroud)
要求是返回下面的行,它们与丢失的记录相邻
BatchId CreateDate UserId Batch
#########################################################
4L182855 - 2 2017-09-28 15:19:32.990 4L182855 9
4L182855 - 4 2017-09-28 17:57:20.977 4L182855 9
4L182855 - 8 2017-09-29 12:30:27.953 4L182855 0
4L182855 - 10 2017-09-29 14:20:57.693 4L182855 0
Run Code Online (Sandbox Code Playgroud)
我可以用Python或者可能通过CLR用户定义函数来完成.但是,我不确定它在本机SQL中是否可行.如果可以,请赐教.
使用stuff()截断batchid得到批处理序列,并lead()与lag()从对计算的前面和后面的行获取值BatchSeq:
select s.BatchId, s.CreateDate, s.UserId, s.Batch
from (
select t.*
, PrevSeq = lag(x.BatchSeq) over (partition by Batch order by CreateDate)
, x.BatchSeq
, NextSeq = lead(x.BatchSeq) over (order by CreateDate)
from results t
cross apply (values (convert(int,stuff(t.batchid,1,charindex('- ',t.batchid)+1,'')))
) x (BatchSeq)
) s
where BatchSeq - isnull(PrevSeq,0) != 1
or (BatchSeq - NextSeq !=-1 and NextSeq != 1)
order by createdate
Run Code Online (Sandbox Code Playgroud)
rextester演示:http://rextester.com/ZCBLP37968
收益:
+---------------+---------------------+----------+-------+
| BatchId | CreateDate | UserId | Batch |
+---------------+---------------------+----------+-------+
| 4L182855 - 2 | 2017-09-28 15:19:32 | 4L182855 | 9 |
| 4L182855 - 4 | 2017-09-28 17:57:20 | 4L182855 | 9 |
| 4L182855 - 8 | 2017-09-29 12:30:27 | 4L182855 | 0 |
| 4L182855 - 10 | 2017-09-29 14:20:57 | 4L182855 | 0 |
+---------------+---------------------+----------+-------+
Run Code Online (Sandbox Code Playgroud)
这也适用于丢失的第一条记录:http://rextester.com/BLAD55913
| 归档时间: |
|
| 查看次数: |
65 次 |
| 最近记录: |