From time-to-time we face issues with Dremio not being able to read parquet files that other tools (like spark, pandas) can handle easily.
And the bad thing is, that the error messages we got from Dremio aren’t helpful at all. This also true for other errors.
e.g.: “Error in parquet reader (complex). Message: Failure in setting up reader Parquet Metadata:”
My wish…
- Please set parquet compatibility as a higher priority for development.
- Please provide error messages that speak to your developer AND the Dremio users.
Many thanks
Thomas
ben
2
Hi @thomasz,
Thanks for reporting this. Is it possible to share some examples of these files?
Hi @ben , I can share parquet files that Dremio can not read. What is the best way to send them to you ?
Hi @dfleckinger If they do not contain any private content, can you please upload them to this thread? I will create internal tickets as required.
Hi @Praveen_Kumar, they do contain private content.
I have sent them by email to @ben and @balaji.ramaswamy privately in the thread Regression in parquet reader in version 3.3.1