Hi there,
I’m rebuilding our monitor systems for Dremio
Is there any common error keyword I should capture from log file, so that I could raise an alert for the team to look into?
Right now I have some like:
- Full GC (Allocation Failure) → jvm full gc is triggerd
- ERROR Fabric Channel closed → connection beetween nodes
- java.lang.OutOfMemoryError: → OOM
Any other keyword I should pay attention to?
Very appreciate