Distributed Storage Questions

summersmd · June 20, 2019, 6:18pm

What are the following folders used for:
accelerator - location of reflection files?
downloads - ?
results - ?
scratch - ?
uploads - location of uploaded files?

How can I tell which reflection file in the accelerators folder belongs to which VDS? Is there a table or file where I can reference this information? If I wanted to open a specific reflection parquet file in say, python, how would I know which file to open?

ben · June 27, 2019, 11:07pm

Hi @summersmd,

accelerator/ - this stores the reflection materialization files. You’ll see directory for each reflection id with subdirectories for each materialization id. Within those will be parquet files.
downloads/ - if you click the download button when viewing a dataset, the query is run results in the chosen format (CSV, JSON or Parquet) are written to this directory.
results/ - the results of executing queries, in Arrow buffer format, are written here. You should see directories associated with each job id. These eventually expire and get cleaned up.
scratch/ - the out-of-the-box directory we provide so you can run CREATE TABLE $scratch.<your new table> AS ... queries to… create tables. This is where the materialized results are stored as parquet files. See https://docs.dremio.com/sql-reference/sql-commands/tables.html.
uploads/ - where user uploaded files go.

How can I tell which reflection file in the accelerators folder belongs to which VDS? Is there a table or file where I can reference this information?

sys.reflections, will give you a record for each reflection ID which contains a field for the dataset to which it corresponds.

If I wanted to open a specific reflection parquet file in say, python, how would I know which file to open?

As mentioned above, accelerator/ contains directories with the following structure:

<reflection id>/<materialization id>/<partition1>/<partition2>..../<file>.parquet

Join sys.reflections with sys.materializations tables to locate the appropriate directories in accelerator/ where your reflection parquet files will be found.

You can also run a query in Dremio like this:

"__accelerator"."<reflection id>"."<materialization id>"

… and download the results (though this will be limited to 1000 records)

Topic		Replies	Views
Matching VDS to refelction parquet file	7	1285	October 30, 2022
Critical : Reflection provides wrong results	3	1082	April 27, 2018
Determining S3 path of reflection files	3	1141	April 5, 2022
Different distributed storage location for cache locations	6	2501	May 18, 2018
Multiple copies of reflection files	5	242	May 26, 2024

Distributed Storage Questions

Related topics