Dremio unbalance worker load

i9um0 · December 13, 2020, 3:41pm

Hi all,

our dremio cluster didn’t work as our expectation. we had 5 worker and only 1 worker used memory until 77% , other nodes only use under 10% of memory. is there any missing config that i was set. i set dremio-env MAX_HEAP=8GB and MAX_DIRECT=128GB.

balaji.ramaswamy · December 14, 2020, 4:52am

@i9um0

Are all nodes configured with the same hep and direct, sometimes due to data skew processing might target to single node, are you able to share with us the Dremio query profile

Share Dremio Query Profile

Thanks
Bali

i9um0 · December 14, 2020, 8:56am

Hi Bali,

All nodes have same config, is there any strategy to mitigate skew processing since those process are most ly joining 3-6 tables using single key smartphone number ?

balaji.ramaswamy · December 15, 2020, 6:55am

@i9um0

Send us the profile and we can see if there is anything that can be done, data skew due to filter may be tough to resolve

i9um0 · December 15, 2020, 7:30am

Hi Bali,

I need permission from my internal team to send profile. I just curious about dremio internal process. did it work like other distributed system that always shuffle the data using hash to distribute process?

balaji.ramaswamy · December 16, 2020, 7:03am

@i9um0

Based on the size of the tables joined, it can be either a shuffle or a broadcast

Topic		Replies	Views
Optimization and scaling question	2	1825	April 16, 2019
What is the memory config I should set for each node in the Dremio cluster	1	996	April 6, 2022
How to scale up dremio cluster?	4	592	September 5, 2023
Query Workload Distribution and Cost Calculation in Dremio	8	100	January 25, 2025
Configuring Dremio Memory Limits on Kubernetes	3	874	November 30, 2022

Dremio unbalance worker load

Related topics