Resampling/Group By Time Series Data

lenoyjacob · February 1, 2023, 5:54pm

General formula is: SELECT TO_TIMESTAMP(UNIX_TIMESTAMP(your_timestamp_column, 'YYYY-MM-DD HH24:MI:SS.FFF') - UNIX_TIMESTAMP(your_timestamp_column,'YYYY-MM-DD HH24:MI:SS.FFF')%your_granularity_in_seconds)

So in your case, the query will for 5 minutes (300 seconds) will be:

SELECT tag, TO_TIMESTAMP(UNIX_TIMESTAMP(ts, 'YYYY-MM-DD HH24:MI:SS.FFF') - UNIX_TIMESTAMP(ts,'YYYY-MM-DD HH24:MI:SS.FFF')%300), avg(val)
FROM nikm
GROUP BY 1, 2
ORDER BY 2

Here’s a full example with some sample data:

WITH nikm AS
  (
      select TO_TIMESTAMP('2023-01-31 10:00:02', 'YYYY-MM-DD HH24:MI:SS')  as ts, 'A' as tag, 10 as val
      UNION ALL
      select TO_TIMESTAMP('2023-01-31 10:08:02', 'YYYY-MM-DD HH24:MI:SS')  as ts, 'A' as tag, 5 as val
      UNION ALL
      select TO_TIMESTAMP('2023-01-31 10:09:02', 'YYYY-MM-DD HH24:MI:SS')  as ts, 'A' as tag, 20 as val
      UNION ALL
      select TO_TIMESTAMP('2023-01-31 10:20:02', 'YYYY-MM-DD HH24:MI:SS')  as ts, 'B' as tag, 6 as val
  )
SELECT tag, TO_TIMESTAMP(UNIX_TIMESTAMP(ts, 'YYYY-MM-DD HH24:MI:SS.FFF') - UNIX_TIMESTAMP(ts,'YYYY-MM-DD HH24:MI:SS.FFF')%300) as ts_round, avg(val) as avg_val
FROM nikm
GROUP BY 1, 2
ORDER BY 2

Hope that’s what you’re looking for. Based on my previous answer here.

Topic		Replies	Views
Trimming minutes based on some condition	3	1392	August 8, 2022
Retrieve rows based on the latest timestamp for each group	4	3978	January 17, 2023
Milliseconds and Microseconds in TIMESTAMP_ADD	0	1444	July 15, 2021
Dremio truncating floating point values	2	2409	January 19, 2018
Tableau, dremio and mongodb - error: New schema found and recorded. Please reattempt the query. Multiple attempts may be necessary to fully learn the schema	2	1814	November 13, 2018

Resampling/Group By Time Series Data

Related topics