Understanding raw profile

dacopan · January 30, 2024, 8:25pm

Hello guys, can you give me some documentation or explanation of how to interpret the profile in an advanced way?

similar for example in PostgreSQL we can detect that when buffers reads from disk can slow query but when was cached in OS was speed,

have Dremio some similar documentation to understand in detail each part of raw profile?

8523abb1-db11-4f1b-9f6d-c4fd1b8c8c50.zip (40,9 KB)

I’ve attached a example query profile that I want interprete why is “slow”

balaji.ramaswamy · January 31, 2024, 6:52am

@dacopan Not currently, this is something that will be done soon.

In your case TABLE_FUNCTION 02-xx-21 is what is taking up almost 11s of the query run time. Since you have a single executor and it is only a 4 core machine there is delay, reading 24 Million records across 450 files

dacopan · January 31, 2024, 4:00pm

thank you @balaji.ramaswamy I’ll wait for this guide, it’ll be a powerful tool to optimize queries.
In some cases as me, this query was generate by PowerBi we only select a specific date “2023-11-30” and table are partitioned by date, so the records mut be apprx 700k is ot correct that Dremio first scan 24M records before apply filter by date.

dotjdk · February 2, 2024, 5:49am

You can learn a lot by looking at the apache drill query profile documentation. It is not a 1:1 mapping, but just to get an understanding of the overall concepts it is helpful

https://drill.apache.org/docs/query-profiles/

Carsten_Hufe · February 6, 2024, 9:22am

There is a whitepaper available, explaining how profiles can be interpreted:

dacopan · February 6, 2024, 2:17pm

thanks @dotjdk @Carsten_Hufe this will too help for me.

Topic		Replies	Views
Dremio slow query	0	1079	January 28, 2020
Identical SQL Querys - Profile Comparison	5	819	February 22, 2023
Question's on cloud cache	7	2422	July 2, 2022
Query takes very long to run	3	1295	July 12, 2019
Querying details	1	982	October 24, 2018

Understanding raw profile

Related Topics