Metadata management bottleneck

shragj · April 8, 2021, 12:27am

Running a cluser with a few thousand, relatively small datasets that need to be refreshed at a relatively high rate (every 15, or even 5 min).

We are seeing issue with scaling the metadata refresh and are becoming concerned that Dremio clusters are bottlenecked on refreshes due to the single master architecture, or more specifically the use of RocksDB that only allows a single writer.

Please advise.

balaji.ramaswamy · April 8, 2021, 6:32am

@shragj

Have we narrowed down the cause to be the write to RocksDB? The delay is usually waiting on the source to provide the metadata information like waiting on S3 or HDFS etc. How did we confirm?

shragj · April 8, 2021, 1:32pm

Even if the bottleneck is currently not RocksDB, the concern is that - unlike read access during query planning that can be scaled horizontally across multiple co-ordinators - metadata refresh only scales vertically. There’s a limit to the I/O bandwidth a single master coordinator can achieve. Using RocksDB as the metadata store preculdes horizontally scaling metedata refresh since RocksDB is a single-writer store.

With that, we’re still looking at what is bottlenecking metadata refreshes in the configuration we’re currently using.

balaji.ramaswamy · April 9, 2021, 5:36am

@shragj Keep us updated on what you find and/or if you need help. Currently there is no horizontal scaling for the master

Topic		Replies	Views
Dremio Cluster Capacity Planning	1	1342	January 11, 2021
Unable to refresh metadata for the dataset (due to concurrent updates). Please retry Dremio Cloud	2	443	December 6, 2023
Metadata Refresh - Stop Refreshing for old Datasets Dremio University	1	1188	December 20, 2021
Improve S3 Parquet mapping and metadata update	2	1386	January 8, 2020
Refresh Metadata Taking Ling Time	15	4031	February 25, 2021

Metadata management bottleneck

Related topics