I need to know the following queries on Dremio
- Support for multiple dataset using regex
- Customization of field size for json files.
- Optimization and improve performance while fetching data from dataset
- More details on Dremio reflections
- If any change in db schema, how dremio will be in sync with that schema
- Dremio can infer the schema and is there a possiblity that we can provide schema file
@ balaji.ramaswamy
Thanks for addressing most of the queries on Dremio, I appreciate your responses. I would like to know on first 2 queries.
-
I would like to query all the json file that matches the regex pattern(with same schema) in a given directory/directories. I explored and found that we can use * as the regex but its not working as expected and option was to make the folder as dataset which doesn’t support some of my use cases. Can you suggest the way to query tables using the regex.
assuming this structure:
folder/user1/{trip1.json, trips2.json, trips3.json……}
folder/user2/{trip1.json, trips2.json, trips3.json……}
folder/user3/{trip1.json, trips2.json, trips3.json……}
-
When loading the json files, I encounter issues like “Attempting to read a too large value for field with name data. Size was 35486 but limit was 32000” due to which I am unable to load those json files.
@shrikanth
#2 is a Dremio limit and we currently support only upto 32K on field width
#1 For Regex, below is our docs
https://docs.dremio.com/sql-reference/sql-functions/string.html