I have a issue with Dremio, very often happens to have one or more Workers in Provisioning or Disconnected giving me errors like that:
Unable to acquire queue resources for query within timeout. Timeout for large queue was set at 300 seconds.
Exceeded timeout (45000) while waiting send intermediate work fragments to remote nodes. Sent 9 and only heard response back from 0 nodes
I use Dremio (Build 3.0.1-201811132128360291-804fe82, Community Edition) connected to Hive.
How can i solve my problem ?
I can’t stop and restart the workers every day.