Reflection refresh issue: Blocked on upstream

vpontello · December 15, 2020, 8:25pm

When we perform the Reflection refresh, it is noticed that some reflections are updated extremely fast and other ones take pretty much longer to finish the procedure (for some it is taken seconds to be done and for others circa 30 minutes). When we analyzed the logs, it was realized that the main issue is almost always the “Blocked on upstream” issue. Can you give us some advice on how to deal with this issue properly? Maybe some tips on optimizing the refresh configuration to prevent this issue to happen. It is important to mention that the PDS are being loaded from a Glue Catalog.

balaji.ramaswamy · December 16, 2020, 8:15am

@vpontello

Blocked on downstream just means one of the Dremio phases is waiting on another phase to complete. This can be CPU or IO bound. The job profile should tells us where the bottleneck is. Are you able to share the query profile with us?

vpontello · December 16, 2020, 1:09pm

Hi, Balaji;
of course. Thanks a lot for the attention,
7066f05b-7ef5-4efe-9145-2a8ace643380.zip (19,0,KB)

balaji.ramaswamy · December 18, 2020, 8:10am

@vpontello

You have couple of issues

Wait time on Glue is about 9 minutes, for 420,000 records you have splits=[57850]), and since all these are remote reads, see operator metrics on HIVE_SUB_SCAN (open job profile) there is significant IO wait. You have C3 turned on , see operator metrics on HIVE_SUB_SCAN, column “NUM_CACHE_HITS” and “NUM_CACHE_MISSES” but nothing is reading of the C3 cache. Has this been configured righ? See C3 documentation below

CPU contention - The second issue is there is significant sleep time (CPU) on phase 1, expand phase 1 and see column “waiting”, are there other queries running. Do you have plans on adding on more executor. If this happens again, send us the new profile and server.log from the Dremio executor

Topic		Replies	Views
Blocked on upstream taking too long for some queries	4	1642	February 20, 2021
HIVE Queries taking more Time on Blocked on Upstream	4	1327	July 20, 2020
Refresh is very slowing time	5	1107	October 16, 2022
Identical SQL Querys - Profile Comparison	5	850	February 22, 2023
Dremio CE hangs on reflection refresh Dremio University	3	936	October 16, 2022

Reflection refresh issue: Blocked on upstream

Related topics