I have an Elasticsearch database sucking in data from a Trac ticket system once an hour. I set up a Dremio connection to this ES database and enabled raw reflections.
After the next hour has gone by and new data is loaded, the columns get all mixed up.
Before:
(redacted some data)After:
Also interesting that the id
column went from being recognized as a string instead of a number. Indeed, when I try to query SELECT id FROM <virtual dataset>
, I get an Unexpected error occurred
. Looking at the job report, here was the error:
Error
AssertionError: Type mismatch:
rowtype of new rel:
RecordType(VARCHAR(65536) CHARACTER SET "ISO-8859-1" COLLATE "ISO-8859-1$en_US$primary"
id) NOT NULL
rowtype of set:
RecordType(BIGINT id) NOT NULL
Any idea what is happening here? I can provide query profiles etc. One of the reasons my company wants to deploy Dremio is to be able to work with our Elasticsearch data as new documents are added and schemas are allowed to change, this is definitely a breaking issue.