Job automatic retry on failure

allCag · June 14, 2022, 12:36pm

Hello,

we have been experiencing a few job (reflection or query) failures due to external reasons (Eg. error Remote backend is unreachable (502 Bad Gateway)).
The problem in such cases is that the job, such as a long running reflection, is not retried automatically.
Note: Physical datasets set to “Never Refresh” and “Never expire” and launch of PDS dependent reflections done using the API catalog refresh functionality.

Is there a “retry” functionality in Dremio in case of such job failures ?

Rgs,
Alexandre

Benny_Chow · June 16, 2022, 6:33pm

Unfortunately, there is no retry in this case. There’s another outstanding issue where if you query this reflection from the sys.reflections table, it’ll have a status of CANNOT_ACCELERATE_SCHEDULED.

This would make a good improvement since Dremio does retry reflections whose physical dataset have refresh periods configured.

allCag · June 20, 2022, 12:06pm

Tks for your feedback @Benny_Chow

Topic		Replies	Views
Reflection Refresh failed. Dremio does not retry	7	1845	February 5, 2020
Reflections automatic update	1	937	October 1, 2020
Most recent version of the reflection is unreachable	10	1643	March 15, 2021
How to restart the failed reflection automatically	14	2087	January 18, 2021
Dremio 3.2 Refresh API Not working	2	946	June 3, 2019

Job automatic retry on failure

Related topics