One thing to keep in mind - Data Reflections can help to keep the analytics on one side of the network. So, you could run Dremio in your Hadoop cluster, and have the reflections live in HDFS, or you could deploy Dremio on AWS and have the reflections live in S3. If you’re always joining across AWS and on-prem, you’re going to have non-trivial network latency and your experience might be so-so.