<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to update external metastore cluster configuration on the fly ? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29730#M21437</link>
    <description>&lt;P&gt;Hi @oliv vier​&amp;nbsp; as per our doc this can be achieved  by only through &lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Spark config&lt;/LI&gt;&lt;LI&gt;Init script&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;So I think , on the fly it won't work. Thanks. But may be you can have this as feature request to our product team.&lt;/P&gt;</description>
    <pubDate>Sat, 12 Feb 2022 16:36:21 GMT</pubDate>
    <dc:creator>Atanu</dc:creator>
    <dc:date>2022-02-12T16:36:21Z</dc:date>
    <item>
      <title>How to update external metastore cluster configuration on the fly ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29728#M21435</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In my use case, my data is pushed to an adls gen2 container called ingest&lt;/P&gt;&lt;P&gt;After some data processing on a databricks cluster of the ingest workspace, I declare the associated table in an external metastore for this workspace&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;At the end of this processing (according to certain criteria) I push the curated data (a simple copy) to other containers (lab/qal/prd, each container contains data for a databricks workspace)&lt;/P&gt;&lt;P&gt;&amp;nbsp;and I want to declare the metastores for these 3 workspaces.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;One solution is to launch 3 tasks after this first task. Each cluster associated with these tasks is configured with the metastore of each databricks workspace. It works but this solution is cumbersome:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;need to start a cluster for each workspace&lt;/LI&gt;&lt;LI&gt;even if the table is already declared in the metastore, you have to start the cluster to check.&lt;/LI&gt;&lt;LI&gt;slow down our data process&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Another solution could be to update the cluster configuration on the fly in the first task. I tried to modify the spark session configuration with the above lines of code:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;spark.sparkContext.getConf().set("spark.hadoop.javax.jdo.option.ConnectionURL","jdbc:sqlserver://lab_env.database.windows.net:1433;database=labdatabase")&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Or&lt;/P&gt;&lt;PRE&gt;&lt;CODE&gt;spark.conf.set("spark.hadoop.javax.jdo.option.ConnectionURL","jdbc:sqlserver://lab_env.database.windows.net:1433;database=labdatabase")&lt;/CODE&gt;&lt;/PRE&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;but it seems that it doesn't work.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;My question is simple : Do you know if there is a way to change this configuration in a notebook, or if it is not possible at all&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thanking you in advance for your help&lt;/P&gt;</description>
      <pubDate>Thu, 03 Feb 2022 09:05:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29728#M21435</guid>
      <dc:creator>Oliver_Floyd</dc:creator>
      <dc:date>2022-02-03T09:05:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to update external metastore cluster configuration on the fly ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29730#M21437</link>
      <description>&lt;P&gt;Hi @oliv vier​&amp;nbsp; as per our doc this can be achieved  by only through &lt;/P&gt;&lt;OL&gt;&lt;LI&gt;Spark config&lt;/LI&gt;&lt;LI&gt;Init script&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;So I think , on the fly it won't work. Thanks. But may be you can have this as feature request to our product team.&lt;/P&gt;</description>
      <pubDate>Sat, 12 Feb 2022 16:36:21 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29730#M21437</guid>
      <dc:creator>Atanu</dc:creator>
      <dc:date>2022-02-12T16:36:21Z</dc:date>
    </item>
    <item>
      <title>Re: How to update external metastore cluster configuration on the fly ?</title>
      <link>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29731#M21438</link>
      <description>&lt;P&gt;Hello @Atanu Sarkar​&amp;nbsp;,&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Thank you for your answer. I have created a feature request. I hope, it will be soon accepted  ^^&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Mar 2022 08:16:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/how-to-update-external-metastore-cluster-configuration-on-the/m-p/29731#M21438</guid>
      <dc:creator>Oliver_Floyd</dc:creator>
      <dc:date>2022-03-07T08:16:27Z</dc:date>
    </item>
  </channel>
</rss>

