Hi,
How do I setup the engine to discard invalid rows on a CSV file? Example would be missing columns etc…
Hi,
How do I setup the engine to discard invalid rows on a CSV file? Example would be missing columns etc…
Can you provide a specific example/sample? You can use SQL to “remove” rows by using a where
filter or a group by
as examples.
I was trying to load using the UI a data lake folder with a set of CSV files all using same structure. Apparently there are some rows on the files that dont have all of the fields (all of the column separators). So it breaks the load of the virtual dataset.