Im running my own research on utilization of dremio into our production data analytics. During my research, I found an issue that not sure how to solve it.
I notice that dremio unable to load data from elasticsearch that contain date with timezone. Below are the error :
Below are the format for documentSet data.iso_8601 from ElasticSearch side :
I’ve found a link that seems to be related to this issue Elasticsearch datetime problems but didn’t seem to get response on that.
So far now I’m keen to use dremio as middle man between ElasticSearch as main data lake and client BI tools. Therefore I would highly appreciate if your end have any suggestion or recommendation on above issue if it’s possible to be fix.
We have couple of enhancements we are working on to address this issue. We are actively working on getting our next MAJOR release 3.1 out and then will probably prioritize this into one of the future releases.
Would like to check with you, if i need to manipulate the datetime format from the source before storing into elasticsearch, could you share some example what kind of format that can be accepted from dremio?
Depending on which Elasticsearch version you’re using, you may have documents under the same mapping with different date formats. In early versions, you can update date format on an existing date field (see https://github.com/elastic/elasticsearch/issues/25271)
My apologies for not looking closely enough at the top of the thread. I notice that the value in the example Elastic document you provided includes “UTC” in the iso_86901 field. It looks like Elastic parses this fine, but let me verify that Dremio does as well, and I’ll get back to you.