cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
cancel
Showing results for 
Search instead for 
Did you mean: 

Auto Loader for Shape File

JD2
Contributor

Hello:

As you can see from below link, that it support 7 file formats. I am dealing with GeoSpatial Shape files and I want to know if Auto Loader can support Shape Files ???

Any help on this is greatly appreciated.

Thanks.

https://docs.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loader-gen2...

avro: Avro file

 

binaryFile: Binary file

csv: CSV file

 

json: JSON file

orc: ORC file

 

parquet: Parquet file

* text: Text file

1 ACCEPTED SOLUTION

Accepted Solutions

dbkent
New Contributor III
New Contributor III

Hi @Jay DAVE​ ,

Currently, shapefiles are not a supported file-type when using auto-loader. Would you be willing to share more about your use case? I am the Product Manager responsible for Geospatial in Databricks, and I need help from customers like you to better understand what you are doing with spatial data, how often you refresh the data, how big it is, etc. Any information you can share would be great!

Kent Marten

PM Databricks

Kent Marten | Product Manager | DBSQL

View solution in original post

6 REPLIES 6

Kaniz
Community Manager
Community Manager

Hi @ JayDAVE! My name is Kaniz, and I'm the technical moderator here. Great to meet you, and thanks for your question! Let's see if your peers on the Forum have an answer to your questions first. Or else I will follow up shortly with a response.

dbkent
New Contributor III
New Contributor III

Hi @Jay DAVE​ ,

Currently, shapefiles are not a supported file-type when using auto-loader. Would you be willing to share more about your use case? I am the Product Manager responsible for Geospatial in Databricks, and I need help from customers like you to better understand what you are doing with spatial data, how often you refresh the data, how big it is, etc. Any information you can share would be great!

Kent Marten

PM Databricks

Kent Marten | Product Manager | DBSQL

Hello Kent:

Thanks for your reply. we receive .shape files from Satellites, LiDAR, Drones etc... consist of Geometry data along with other data points. this helps to get data insights based on geo location of assets where we do further EDA in Machine Learning.

It depends on requirements but data will be refresh weekly.

FYI ..... if you read this link, you will get better idea about all Oil & Gas companies are going to do with this .shape files.

https://scottpeters.house.gov/media-center/in-the-news/biden-signs-methane-reduction-resolution-by-r...

Thanks

dbkent
New Contributor III
New Contributor III

@Jay DAVE​ ,

How are you solving your analysis with geo-data today?

Are you using a GIS or spatial-ETL tool?

If you could upload shapefiles, what else would you want to do with that data? Run spatial operations against it -- like spatially joining your point datasets to your boundaries?

Can I email you and ask more questions 🙂

Kent Marten

PM Databricks

Kent Marten | Product Manager | DBSQL

-werners-
Esteemed Contributor III

You could try to use the binary file type. But the disadvantage of this is that the content of the shape files will be put into a column, that might not be what you want.

If you absolutely want to use the autoloader, maybe some thinking outside the box can help.

What if you convert the shape files to geojson or topojson (this is not hard to do) and then use the json file format for autoloader? I have not tried this but it might just work, and in the mean time Kent can do his thing 🙂

Hello Werners:

Thanks for your reply, i agree to an extent but shape files are best way to handle geometry\geography data. converting\translating is a careful consideration keeping data integrity & corruption in a mind.

For now I am using Azure event grid & function to automate processing of shape files.

Thanks

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.