01-17-2024 01:15 PM
Hello,
Is there a way to read .xls files using auto loader or is there any workaround since excel files are not supported by the auto loader per the following document?
https://docs.databricks.com/en/ingestion/auto-loader/options.html
Thanks.
01-17-2024 02:27 PM
@YS1 - Can you please check if you can convert .xls to .csv files and ingest using autoloader?
01-17-2024 02:49 PM
@Isha I tried to read them directly as csv files using different delimiters but the files weren't parsed correctly. The problem is that you have to check for the files that weren't read by the auto loader and then convert them to csv which is not the target here of the auto loader. My main goal is to read the files as they arrive to my folder without the need to check which one was read before.
01-17-2024 04:21 PM
You can try to read the file as text file, using "wholeText" option https://docs.databricks.com/en/ingestion/auto-loader/options.html#text-options and then the whole file contents will be passed to the from_xml function and it will return an array of the records. Give it a try.
01-18-2024 08:00 AM
07-03-2024 10:45 AM
I am facing the same issue--I have a stream that I'd like to use autoloader on with an .xlsx. Is there any update to any workarounds on this issue?
07-05-2024 09:26 AM
I couldn't find any. I had to change the type of the files.
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group