Dremio - Data Lake Engine

I just saw the webinar “Using a Data Lake Engine to Create a Scalable and Lightning-Fast Data Pipeline” . i wanted to know that how Dremio Data Lake Engine replaces ETL Data pipeline like AWS Glue, EMR, Pyspark and others 3rd part tools.

Sharing any insights is greatly appreciated.



For ETL type of workloads, if it is SQL based, sure you can use Dremio. You can run CTAS jobs selecting from a source via Dremio that generates Parquet back to the lake and then you can run interactive, BI, ad-hoc querying using Dremio on the generated Parquet files

Hope this helps