在SQL中插值的最佳方法

Cyr*_*don 13 sql interpolation sql-server-2005 linear-interpolation

我有一张特定日期的价格表:

          Rates

Id  |     Date      |  Rate
----+---------------+-------
 1  |   01/01/2011  |  4.5
 2  |   01/04/2011  |  3.2
 3  |   04/06/2011  |  2.4
 4  |   30/06/2011  |  5
Run Code Online (Sandbox Code Playgroud)

我想让输出率基于简单的线性插值.

所以,如果我输入17/06/2011:

Date        Rate
----------  -----
01/01/2011  4.5
01/04/2011  3.2
04/06/2011  2.4
17/06/2011  
30/06/2011  5.0
Run Code Online (Sandbox Code Playgroud)

线性插值是 (5 + 2,4) / 2 = 3,7

有没有办法做一个简单的查询(SQL Server 2005),或者这种东西需要以编程方式完成(C#...)?

ype*_*eᵀᴹ 21

像这样(纠正):

SELECT CASE WHEN next.Date IS NULL  THEN prev.Rate
            WHEN prev.Date IS NULL  THEN next.Rate
            WHEN next.Date = prev.Date  THEN prev.Rate
              ELSE ( DATEDIFF(d, prev.Date, @InputDate) * next.Rate 
                   + DATEDIFF(d, @InputDate, next.Date) * prev.Rate
                   ) / DATEDIFF(d, prev.Date, next.Date)
       END AS interpolationRate 
FROM
  ( SELECT TOP 1 
        Date, Rate 
    FROM Rates
    WHERE Date <= @InputDate
    ORDER BY Date DESC
  ) AS prev
  CROSS JOIN
  ( SELECT TOP 1 
        Date, Rate 
    FROM Rates
    WHERE Date >= @InputDate
    ORDER BY Date ASC
  ) AS next
Run Code Online (Sandbox Code Playgroud)


小智 5

这里使用CROSS JOIN的技巧是,如果任何一个表都没有行(1 * 0 = 0)并且查询可能会中断,则它将不返回任何记录。更好的方法是在不等式条件下使用FULL OUTER JOIN(避免获得多于一行)

( SELECT TOP 1 
        Date, Rate 
    FROM Rates
    WHERE Date <= @InputDate
    ORDER BY Date DESC
  ) AS prev
  FULL OUTER JOIN
  ( SELECT TOP 1 
        Date, Rate 
    FROM Rates
    WHERE Date >= @InputDate
    ORDER BY Date ASC
  ) AS next
ON (prev.Date <> next.Date) [or Rate depending on what is unique]
Run Code Online (Sandbox Code Playgroud)


car*_*10m 5

正如@Mark已经指出的那样,它CROSS JOIN有其局限性。一旦目标值超出定义值的范围,将不会返回任何记录。

另外,上述解决方案仅限于一个结果。对于我的项目,我需要对x值的整个列表进行插值,并提出以下解决方案。也许其他读者也有兴趣?

-- generate some grid data values in table #ddd:
CREATE TABLE #ddd (id int,x float,y float, PRIMARY KEY(id,x));
INSERT INTO  #ddd VALUES (1,3,4),(1,4,5),(1,6,3),(1,10,2),
                         (2,1,4),(2,5,6),(2,6,5),(2,8,2);
SELECT * FROM #ddd;                         

-- target x-values in table #vals (results are to go into column yy):
CREATE TABLE #vals (xx float PRIMARY KEY,yy float null, itype int);
INSERT INTO  #vals (xx) VALUES (1),(3),(4.3),(9),(12);

-- do the actual interpolation
WITH valstyp AS (
  SELECT id ii,xx,
         CASE WHEN min(x)<xx THEN CASE WHEN max(x)>xx THEN 1 ELSE 2 END ELSE 0 END flag,
         min(x) xmi,max(x) xma 
  FROM #vals INNER JOIN #ddd ON id=1 GROUP BY xx,id
), ipol AS (
  SELECT v.*,(b.x-xx)/(b.x-a.x) f,a.y ya,b.y yb 
  FROM valstyp v 
  INNER JOIN #ddd a ON a.id=ii AND a.x=(SELECT max(x) FROM #ddd WHERE id=ii 
             AND (flag=0 AND x=xmi OR flag=1 AND x<xx OR flag=2 AND x<xma))
  INNER JOIN #ddd b ON b.id=ii AND b.x=(SELECT min(x) FROM #ddd WHERE id=ii 
             AND (flag=0 AND x>xmi OR flag=1 AND x>xx OR flag=2 AND x=xma))
)
UPDATE v SET yy=ROUND(f*ya+(1-f)*yb,8),itype=flag FROM #vals v INNER JOIN ipol i ON i.xx=v.xx;

-- list the interpolated results table:
SELECT * FROM #vals
Run Code Online (Sandbox Code Playgroud)

运行上面的脚本时,您将在表中获得以下数据网格点 #ddd

id x  y 
-- -- - 
1  3  4 
1  4  5 
1  6  3 
1  10 2 
2  1  4 
2  5  6 
2  6  5 
2  8  2 
Run Code Online (Sandbox Code Playgroud)

[[该表包含两个标识(id=1id=2)的网格点。在我的示例中,我仅在CTE中1使用来引用-group 。可以根据您的要求进行更改。]]where id=1valstyp

以及结果表,#vals其中的数据已插入列yy

xx  yy   itype 
--- ---- ----- 
1   2    0     
3   4    0     
4.3 4.7  1     
9   2.25 1     
12  1.5  2     
Run Code Online (Sandbox Code Playgroud)

最后一列itype指示用于计算值的内插/外推类型:

0:  extrapolation to lower end
1:  interpolation within given data range
2:  extrapolation to higher end
Run Code Online (Sandbox Code Playgroud)

这个工作示例可以在这里找到。