Zero downtime maintenance

jacek · November 3, 2020, 12:17pm

Hi Dremians,
can you clarify, what operations can be managed with no downtime?
I am interested in Kubernetes deployment.
Is it possible to redeploy K8S worker node containing Dremio PODs and always face to zero downtime?
Is it possible to upgrade version of Dremio without any downtime (rolling upgrade)?
If short downtime is required, do you provide any recommendations how to make resilient an application running on top of Dremio? (retry mechanisms, …)

Regards

Jacek

balaji.ramaswamy · November 3, 2020, 10:44pm

@jacek

When you redeploy worker nodes, active running queries will get cancelled, increasing worker nodes should not cause any running queries to fail. Decreasing workers may cause some queries to get cancelled
Currently, Dremio does not offer rolling upgrade
Reflections can have a retry limit set, not user queries, so the application would have to retry once Dremio is up again

Topic		Replies	Views
AutoScaling dremio	3	1607	February 3, 2022
Job cancellation requested failure	3	1148	October 13, 2021
Dremio unbalance worker load	5	900	December 16, 2020
Configuring Dremio Memory Limits on Kubernetes	3	774	November 30, 2022
Workers in Provisioning or Disconnected	1	1639	March 11, 2019

Zero downtime maintenance

Related Topics