<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Auto Loader for Shape File in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17575#M11562</link>
    <description>&lt;P&gt;Hello Kent:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your reply. we receive .shape files from Satellites, LiDAR, Drones etc... consist of Geometry data along with other data points. this helps to get data insights based on geo location of assets where we do further EDA in Machine Learning. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It depends on requirements but data will be refresh weekly. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;FYI ..... if you read this link, you will get better idea about all Oil &amp;amp; Gas companies are going to do with this .shape files.&lt;/P&gt;&lt;P&gt;&lt;A href="https://scottpeters.house.gov/media-center/in-the-news/biden-signs-methane-reduction-resolution-by-rep-scott-peters" target="test_blank"&gt;https://scottpeters.house.gov/media-center/in-the-news/biden-signs-methane-reduction-resolution-by-rep-scott-peters&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Thu, 30 Sep 2021 04:50:32 GMT</pubDate>
    <dc:creator>JD2</dc:creator>
    <dc:date>2021-09-30T04:50:32Z</dc:date>
    <item>
      <title>Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17571#M11558</link>
      <description>&lt;P&gt;&lt;/P&gt;
&lt;P&gt;Hello:&lt;/P&gt;
&lt;P&gt;As you can see from below link, that it support 7 file formats. I am dealing with GeoSpatial Shape files and I want to know if Auto Loader can support Shape Files ???&lt;/P&gt;&lt;P&gt;&lt;/P&gt;Any help on this is greatly appreciated.
&lt;P&gt;Thanks.&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loader-gen2#common-options" target="test_blank"&gt;https://docs.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loader-gen2#common-options&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;I&gt; avro: Avro file&lt;/I&gt;&lt;P&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/P&gt;&lt;I&gt;&lt;/I&gt; binaryFile: Binary file&lt;P&gt;&lt;/P&gt;&lt;I&gt; csv: CSV file&lt;/I&gt;&lt;P&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/P&gt;&lt;I&gt;&lt;/I&gt; json: JSON file&lt;P&gt;&lt;/P&gt;&lt;I&gt; orc: ORC file&lt;/I&gt;&lt;P&gt;&lt;I&gt;&amp;nbsp;&lt;/I&gt;&lt;/P&gt;&lt;I&gt;&lt;/I&gt; parquet: Parquet file&lt;P&gt;&lt;/P&gt;* text: Text file 
&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 20 Jul 2021 02:16:43 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17571#M11558</guid>
      <dc:creator>JD2</dc:creator>
      <dc:date>2021-07-20T02:16:43Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17573#M11560</link>
      <description>&lt;P&gt;Hi @Jay DAVE​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Currently, shapefiles are not a supported file-type when using auto-loader. Would you be willing to share more about your use case? I am the Product Manager responsible for Geospatial in Databricks, and I need help from customers like you to better understand what you are doing with spatial data, how often you refresh the data, how big it is, etc. Any information you can share would be great!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Kent Marten&lt;/P&gt;&lt;P&gt;PM Databricks&lt;/P&gt;</description>
      <pubDate>Mon, 27 Sep 2021 18:46:44 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17573#M11560</guid>
      <dc:creator>dbkent</dc:creator>
      <dc:date>2021-09-27T18:46:44Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17574#M11561</link>
      <description>&lt;P&gt;You could try to use the binary file type.  But the disadvantage of this is that the content of the shape files will be put into a column, that might not be what you want.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;If you absolutely want to use the autoloader, maybe some thinking outside the box can help.&lt;/P&gt;&lt;P&gt;What if you convert the shape files to geojson or topojson (this is not hard to do) and then use the json file format for autoloader?  I have not tried this but it might just work, and in the mean time Kent can do his thing &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 29 Sep 2021 08:56:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17574#M11561</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2021-09-29T08:56:25Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17575#M11562</link>
      <description>&lt;P&gt;Hello Kent:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your reply. we receive .shape files from Satellites, LiDAR, Drones etc... consist of Geometry data along with other data points. this helps to get data insights based on geo location of assets where we do further EDA in Machine Learning. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;It depends on requirements but data will be refresh weekly. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;FYI ..... if you read this link, you will get better idea about all Oil &amp;amp; Gas companies are going to do with this .shape files.&lt;/P&gt;&lt;P&gt;&lt;A href="https://scottpeters.house.gov/media-center/in-the-news/biden-signs-methane-reduction-resolution-by-rep-scott-peters" target="test_blank"&gt;https://scottpeters.house.gov/media-center/in-the-news/biden-signs-methane-reduction-resolution-by-rep-scott-peters&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 30 Sep 2021 04:50:32 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17575#M11562</guid>
      <dc:creator>JD2</dc:creator>
      <dc:date>2021-09-30T04:50:32Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17576#M11563</link>
      <description>&lt;P&gt;Hello Werners:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks for your reply, i agree to an extent but shape files are best way to handle geometry\geography data. converting\translating is a careful consideration keeping data integrity &amp;amp; corruption in a mind.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;For now I am using Azure event grid &amp;amp; function to automate processing of shape files.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Thu, 30 Sep 2021 04:55:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17576#M11563</guid>
      <dc:creator>JD2</dc:creator>
      <dc:date>2021-09-30T04:55:48Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17577#M11564</link>
      <description>&lt;P&gt;@Jay DAVE​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;How are you solving your analysis with geo-data today? &lt;/P&gt;&lt;P&gt;Are you using a GIS or spatial-ETL tool? &lt;/P&gt;&lt;P&gt;If you could upload shapefiles, what else would you want to do with that data? Run spatial operations against it -- like spatially joining your point datasets to your boundaries?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Can I email you and ask more questions &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Kent Marten&lt;/P&gt;&lt;P&gt;PM Databricks&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 30 Sep 2021 16:19:45 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/17577#M11564</guid>
      <dc:creator>dbkent</dc:creator>
      <dc:date>2021-09-30T16:19:45Z</dc:date>
    </item>
    <item>
      <title>Re: Auto Loader for Shape File</title>
      <link>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/120461#M46172</link>
      <description>&lt;P&gt;Kent, we have the same use case. Probably, I will reach out to Bryan and ask him to schedule a call to go over our requirements.&lt;/P&gt;</description>
      <pubDate>Wed, 28 May 2025 17:48:41 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/auto-loader-for-shape-file/m-p/120461#M46172</guid>
      <dc:creator>Ramana</dc:creator>
      <dc:date>2025-05-28T17:48:41Z</dc:date>
    </item>
  </channel>
</rss>

