Improve error messages / documentation around scanning hive directories #17436
Labels
A-io-partitioning
Area: reading/writing (Hive) partitioned files
accepted
Ready for implementation
enhancement
New feature or an improvement of an existing feature
P-medium
Priority: medium
Description
Some hive datasets don't work out of the box if the directory is passed to
scan_parquet
because they contain non-data files, and currently the error messages we end up printing are very cryptic (e.g.parquet: File out of specification: The file must end with PAR1
). We should check the file extensions and raise a better error message if we see mixed extensions.The text was updated successfully, but these errors were encountered: