Query plan spending lots of time in Dremio 4.0

dcmsdancosta · October 17, 2019, 8:26pm

Hello everyone.

I’m trying to optimize the total execution time in the following scenario:

PDSs* -> Base table (VDS) -> Derived table (VDS, more than 20 left joins in base table)
*PDSs are in parquet files located in S3

We’ve tested some combinations:

Base table with reflection and Derived table with reflection
Base table materialized with CTAS and Derived table with reflection

The total execution time is in the best combination (base table w/ CTAS and derived table w/ reflection is the following: query plan 17 and execution 5s = total 22s.

For my purpose, the response time goal must be much less than 10s, and I believe the query plan
is making it impossible. On the other hand, use CTAS for derived tables isn’t and option due to managing complexity.

b3f7f5bc-451c-4620-b354-c4cc1766455c.zip (115,8,KB)

Is there a way of reducing and optimizing the query plan?

Cheers,

Danilo from DataSprints

dcmsdancosta · October 21, 2019, 12:24pm

Hi there,

We haven’t found a solution for this yet.

Any tips, comments, best practices are really welcome.

Thank you.

balaji.ramaswamy · October 21, 2019, 1:22pm

@dcmsdancosta

It looks like we ar spending time on the materialization (reflection replacement plans), to confirm that, kindly turn on verbose planning and send us the profile

admin-support-support key-“planner.verbose_profile”, show - enable - save

Also try to do the below and see if the planning time comes down?

admin-support-support key-“accelerator.enable_agg_join”-disable-save

dcmsdancosta · October 21, 2019, 2:22pm

@balaji.ramaswamy we’ve tested with your suggestions.

The verbose profile can be reached here: https://drive.google.com/open?id=1YGUw0P1Ewh-h1mEVtoAQQ5WYOpH-Zwlk

Thank you

dcmsdancosta · October 22, 2019, 3:07pm

Hi Balaji,

Have you read the files that I’ve shared?

Cheers,

Danilo

dcmsdancosta · October 25, 2019, 11:20am

Hello,

Balaji, do you have any suggestions on that? Things still the same.

Thank you,

Danilo

Topic		Replies	Views
Query execution time vs accelerations	5	2908	April 16, 2018
Too much time planning	4	96	September 25, 2024
Logical planning phase too slow	12	2175	March 25, 2019
Planning phase slow compared to execution	16	2000	August 14, 2020
Dremio Planning & Running Time is very high	1	995	December 3, 2020

Query plan spending lots of time in Dremio 4.0

Related topics