Reflections on hive external tables

Sneha_Krishnaswamy · December 27, 2018, 9:43am

Hi,

I’m using hive external tables created on parquet files on s3 as my physical data source. Will there be any benefit of creating raw reflections on s3 in dremio?

Also, how is the reflection refreshed? Is it refreshed completely every interval or does it identify the changed records and only refreshes the changes?

anthony · December 27, 2018, 2:12pm

You will see more performance benefits with aggregation reflections - https://docs.dremio.com/acceleration/creating-reflections.html#aggregation-reflections

The reflections are refreshed via a time interval or by using our REST API. It can either do a complete refresh or, if datasource permits, an incremental refresh - https://docs.dremio.com/acceleration/updating-reflections.html

Sneha_Krishnaswamy · December 28, 2018, 2:06am

Is there a way to create partial raw reflection? What I mean is, suppose I’ve a table with data from 2000 to 2018 and I only want to create reflection for 2018 data. Can I do that?

anthony · December 28, 2018, 2:46am

Yes an example is you would create a VDS with a query that applies that date filter (select * from table where year = 2018) and create a raw reflection on that.

Topic		Replies	Views
How does reflection work	1	60	September 14, 2024
Update partition of a reflection	19	2975	December 9, 2021
External_query() reflection refresh schedule - what is it?	2	121	June 27, 2024
How to orchestrate reflection refresh incrementally in dremio-oss incase of overwriting partition in a datapipeline	2	590	October 25, 2023
Reflection and datasource scan	11	1132	May 21, 2021

Reflections on hive external tables

Related topics