Cursor is really faster than set-based solution for weighted moving average?

Mon Dec 10, 2007 by Peter Larsson in optimization, sql-server-2008, algorithms, sql-server-2005, sql-server-2000

Today, I was involved in an interesting discussion.

Someone asked for a moving average solution. I joined the discussion late.
The previous solutions and mine were all set-based and very slow. Then Jezemine come up with a CURSOR solution that looked fine and fast.

After some trial-and-errors I finally posted a set-based solution that seems to be the fastest solution yet.
http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=93911

So it still seems that [proper implemented] set-based solutions are the fastest, compared to CURSORs.

Below is also an implementation of a SQL Server 2005 approach.

DECLARE @Sample TABLE (dt DATETIME, Rate FLOAT)

INSERT @Sample

SELECT CURRENT_TIMESTAMP - 10, 1 UNION ALL

SELECT CURRENT_TIMESTAMP - 9, 2 UNION ALL

SELECT CURRENT_TIMESTAMP - 8, 4 UNION ALL

SELECT CURRENT_TIMESTAMP - 7, 4 UNION ALL

SELECT CURRENT_TIMESTAMP - 6, 5 UNION ALL

SELECT CURRENT_TIMESTAMP - 5, 6 UNION ALL

SELECT CURRENT_TIMESTAMP - 4, 6 UNION ALL

SELECT CURRENT_TIMESTAMP - 3, 8 UNION ALL

SELECT CURRENT_TIMESTAMP - 2, 9 UNION ALL

SELECT CURRENT_TIMESTAMP - 1, 10 UNION ALL

SELECT CURRENT_TIMESTAMP - 0, 11 UNION ALL

SELECT CURRENT_TIMESTAMP + 1, 9

-- SQL Server 2005

SELECT DATEADD(DAY, p.ActualDate, '19000101') AS dt,

AVG(p.Rate) AS SimpleMovingAvg,

SUM(

CASE p.dt

WHEN 'dt0' THEN p.r0

WHEN 'dt1' THEN p.r1

WHEN 'dt2' THEN p.r2

ELSE 0

END

) AS WeightedMovingAvg

FROM (

SELECT Rate,

DATEDIFF(DAY, '19000101', dt) AS dt0,

0.7 * Rate AS r0,

DATEDIFF(DAY, '18991231', dt) AS dt1,

0.2 * Rate AS r1,

DATEDIFF(DAY, '18991230', dt) AS dt2,

0.1 * Rate AS r2

FROM @Sample

) AS y

UNPIVOT (

ActualDate

FOR dt IN (y.dt0, y.dt1, y.dt2)

) AS p

GROUP BY p.ActualDate

HAVING MIN(p.dt) = 'dt0'

ORDER BY p.ActualDate

-- SQL Server 2000

SELECT DATEADD(DAY, k.dt, '19000101') AS dt,

AVG(k.Rate) AS SimpleMovingAvg,

SUM(k.wr) AS WeightedMovingAvg

FROM (

SELECT DATEDIFF(DAY, '18991230', dt) AS dt,

0.1 * Rate AS wr,

Rate,

0 AS Actualdate

FROM @Sample

UNION ALL

SELECT DATEDIFF(DAY, '18991231', dt),

0.2 * Rate,

Rate,

FROM @Sample

UNION ALL

SELECT DATEDIFF(DAY, '19000101', dt),

0.7 * Rate,

Rate,

FROM @Sample

) AS k

GROUP BY k.dt

HAVING MAX(k.Actualdate) = 1

ORDER BY    k.dt

Or simply use this rewrite...

SELECT        DATEADD(DAY, v.Number, s.dt),
              AVG(1.0E0 * s.Rate)
FROM          @Sample AS s
INNER JOIN    master..spt_values AS v ON v.Type = 'P'
WHERE         v.Number < 3
GROUP BY      DATEADD(DAY, v.Number, t.dt)
HAVING        COUNT(*) = 3
ORDER BY      DATEADD(DAY, v.Number, t.dt) DESC

Legacy Comments


Jesse 2007-12-16	re: Cursor is really faster than set-based solution? I think a loop will still be faster for running totals though, since for a running total you have to look at all the rows preceding the current row, rather than just the preceding few. want to prove me wrong? ;)

Peso 2007-12-17	re: Cursor is really faster than set-based solution for weighted moving average? No... I can't. With a running total there can be any number of records. With moving average there are a finite number of revords.

Jeff Moden 2009-07-20	re: Cursor is really faster than set-based solution for weighted moving average? Jesse wrote: I think a loop will still be faster for running totals though, since for a running total you have to look at all the rows preceding the current row, rather than just the preceding few. want to prove me wrong? ;) ------------------------------------------- The only proper way to drop such a gauntlet is with code Jesse. The onus is on the challenger. ;-) --Jeff Moden