<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster in Get Started Discussions</title>
    <link>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105554#M9520</link>
    <description>&lt;P&gt;Thanks! Will proceed with custom containers then.&lt;/P&gt;</description>
    <pubDate>Tue, 14 Jan 2025 10:46:38 GMT</pubDate>
    <dc:creator>OlehSemeniuk</dc:creator>
    <dc:date>2025-01-14T10:46:38Z</dc:date>
    <item>
      <title>Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster</title>
      <link>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105534#M9518</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I need to ingest and transform historical climate data into a Delta table. The data is stored in .nc format (NetCDF). To work with this format, specific C libraries for Python are required, along with particular versions of Python libraries (e.g., numpy).&lt;/P&gt;&lt;P&gt;On my local machine, I resolved this using Anaconda, which installed the necessary libraries (xarray, netCDF4) and handled all dependencies seamlessly.&lt;/P&gt;&lt;P&gt;However, I'm encountering issues when trying to achieve the same on a Databricks cluster:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Upgrading certain libraries (e.g., numpy) causes dependency conflicts, breaking the cluster's functionality.&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;I came across the &lt;A href="https://docs.databricks.com/en/compute/custom-containers.html#enable" target="_new" rel="noopener"&gt;&lt;SPAN&gt;Databricks&lt;/SPAN&gt;&lt;SPAN&gt; Container&lt;/SPAN&gt;&lt;SPAN&gt; Service&lt;/SPAN&gt;&lt;/A&gt;, which seems to allow customization by using custom containers.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.databricks.com/en/compute/custom-containers.html#enable" target="_blank"&gt;https://docs.databricks.com/en/compute/custom-containers.html#enable&lt;/A&gt;&lt;/P&gt;&lt;P&gt;Is this the only way to install xarray, netCDF4, and upgrade pre-installed libraries? Are there alternative approaches to handle this without compromising the cluster's stability?&lt;/P&gt;&lt;P&gt;Any help or guidance would be much appreciated!&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2025 08:20:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105534#M9518</guid>
      <dc:creator>OlehSemeniuk</dc:creator>
      <dc:date>2025-01-14T08:20:25Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster</title>
      <link>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105551#M9519</link>
      <description>&lt;P&gt;Using custom containers is generally the most stable and flexible approach to ensure all dependencies are correctly managed and do not interfere with the cluster's functionality.&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2025 10:14:53 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105551#M9519</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2025-01-14T10:14:53Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster</title>
      <link>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105554#M9520</link>
      <description>&lt;P&gt;Thanks! Will proceed with custom containers then.&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2025 10:46:38 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105554#M9520</guid>
      <dc:creator>OlehSemeniuk</dc:creator>
      <dc:date>2025-01-14T10:46:38Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster</title>
      <link>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105566#M9521</link>
      <description>&lt;P&gt;Great, please let us know in case any assistance is needed&lt;/P&gt;</description>
      <pubDate>Tue, 14 Jan 2025 12:38:46 GMT</pubDate>
      <guid>https://community.databricks.com/t5/get-started-discussions/ingesting-and-transforming-netcdf-data-in-delta-table-on/m-p/105566#M9521</guid>
      <dc:creator>Walter_C</dc:creator>
      <dc:date>2025-01-14T12:38:46Z</dc:date>
    </item>
  </channel>
</rss>

