I can think of a couple of ways we might incorporate schema inference:
-
We could continue with the current workflow of creating virtual dataset, but use schema inference to make the process easier. For tables with a large number of columns, it can be a bit cumbersome to add all of the type information manually.
-
It’s already possible to query a file/directory without first setting the format settings. In this case, maybe it would make sense to use an inferred schema.
Note that I’m just brainstorming here, and this is not in any way meant to be a product road map. But I think this forum is great place to get this sort of feedback.