<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2 in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89481#M37821</link>
    <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/120188"&gt;@JR61276126&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yeah, just like I thought. And to answer your second question. Datbricks is hosting hive metastore in MySQL database. So that's why you need to add an outbound connection to it&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 11 Sep 2024 13:39:04 GMT</pubDate>
    <dc:creator>szymon_dybczak</dc:creator>
    <dc:date>2024-09-11T13:39:04Z</dc:date>
    <item>
      <title>Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89376#M37772</link>
      <description>&lt;P&gt;Receiving the following error when attempting to run the classroom setup for lesson 1.2 of the Data Engineering with Databricks 3.1.12.&amp;nbsp;&lt;BR /&gt;This has been tested with multiple accounts, both admins and non-admins.&lt;/P&gt;&lt;P&gt;Below is the error message I am receiving.&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;PRE&gt;&lt;SPAN&gt;%&lt;/SPAN&gt;&lt;SPAN&gt;run .&lt;/SPAN&gt;&lt;SPAN&gt;/&lt;/SPAN&gt;&lt;SPAN&gt;Includes&lt;/SPAN&gt;&lt;SPAN&gt;/&lt;/SPAN&gt;&lt;SPAN&gt;Classroom&lt;/SPAN&gt;&lt;SPAN&gt;-&lt;/SPAN&gt;&lt;SPAN&gt;Setup&lt;/SPAN&gt;&lt;SPAN&gt;-&lt;/SPAN&gt;&lt;SPAN&gt;01.2&lt;/SPAN&gt;&lt;/PRE&gt;&lt;/DIV&gt;&lt;P&gt;&lt;STRONG&gt;&lt;SPAN class=""&gt;AnalysisException: &lt;/SPAN&gt;org.apache.hadoop.hive.ql.metadata.HiveException: java.lang.RuntimeException: Unable to instantiate org.apache.hadoop.hive.metastore.HiveMetaStoreClient&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;My databricks workspace is deployed in Azure with VNET injection.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I believe the issue to be with access to the hive_metastore.&amp;nbsp; If viewing the hive_metastore in the Catalog Explorer, it is unable to see the default schema using compute in my tenant (both SQL Warehouse and All-Purpose Compute) but I can view the default schema when using Serverless SQL Warehouse.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I found the following post with a similar issue and attempted to run the commands suggested without success.&amp;nbsp;&lt;A href="https://community.databricks.com/t5/data-engineering/unable-to-instantiate-hive-meta-store-client/td-p/49877" target="_blank" rel="noopener"&gt;https://community.databricks.com/t5/data-engineering/unable-to-instantiate-hive-meta-store-client/td-p/49877&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;&lt;STRONG&gt;%sh sudo service hive-metastore status&lt;/STRONG&gt;&lt;/PRE&gt;&lt;P&gt;&lt;STRONG&gt;Unit hive-metastore.service could not be found.&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 10 Sep 2024 21:05:27 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89376#M37772</guid>
      <dc:creator>JR61276126</dc:creator>
      <dc:date>2024-09-10T21:05:27Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89377#M37773</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/120188"&gt;@JR61276126&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Since your workspace is deployed in azure with vent injection I assume it might be a network/firewall related issue. Could you check your driver logs also?&lt;/P&gt;</description>
      <pubDate>Tue, 10 Sep 2024 21:16:48 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89377#M37773</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-09-10T21:16:48Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89480#M37820</link>
      <description>&lt;P&gt;Hello&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Looking at the driver logs, it appears to be an issue connecting to&amp;nbsp;consolidated-eastusc3-prod-metastore-0.mysql.database.azure.com.&lt;BR /&gt;&lt;BR /&gt;In researching this endpoint, I found the following document outlining access points for Azure Databricks with that being the endpoint for the Metastore.&amp;nbsp;&lt;A href="https://learn.microsoft.com/en-us/azure/databricks/resources/ip-domain-region" target="_blank"&gt;IP addresses and domains for Azure Databricks services and assets - Azure Databricks | Microsoft Learn&lt;/A&gt;&lt;BR /&gt;I understand that to resolve the issue, I need to open access to that endpoint, but I first have a question of why it needs to connect to a MySQL endpoint and what is stored there?&amp;nbsp; We implemented VNET injection because we want to keep our instance private, but if data is being stored outside of our tenant, that is a potential risk.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Sep 2024 13:17:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89480#M37820</guid>
      <dc:creator>JR61276126</dc:creator>
      <dc:date>2024-09-11T13:17:07Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89481#M37821</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/120188"&gt;@JR61276126&lt;/a&gt;&amp;nbsp;,&amp;nbsp;&lt;/P&gt;&lt;P&gt;Yeah, just like I thought. And to answer your second question. Datbricks is hosting hive metastore in MySQL database. So that's why you need to add an outbound connection to it&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 11 Sep 2024 13:39:04 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89481#M37821</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-09-11T13:39:04Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89648#M37875</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/110502"&gt;@szymon_dybczak&lt;/a&gt;,&amp;nbsp;is there any way around that?&amp;nbsp; We do not plan to use hive_metastore in favor of Unity Catalog, but we need it for the purpose of allowing our staff to go through the Databricks provided learning content.&amp;nbsp;&lt;/P&gt;&lt;P&gt;If we would open this port to allow connections to this endpoint, what type of data is stored there?&amp;nbsp; I would need to be able to justify this change to our security teams and therefore need to better understand the purpose and content within that endpoint.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Sep 2024 14:55:57 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89648#M37875</guid>
      <dc:creator>JR61276126</dc:creator>
      <dc:date>2024-09-12T14:55:57Z</dc:date>
    </item>
    <item>
      <title>Re: Data Engineering with Databricks 3.1.12 - Unable to run Classroom-Setup-01.2</title>
      <link>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89652#M37877</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/120188"&gt;@JR61276126&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;In metastore they will only store metadata, like columns names and data types. Your actual data are stored on your storage account. If this is only for learning purposes I would say you have nothing to worry about (from security perspective)&lt;/P&gt;</description>
      <pubDate>Thu, 12 Sep 2024 15:11:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/data-engineering-with-databricks-3-1-12-unable-to-run-classroom/m-p/89652#M37877</guid>
      <dc:creator>szymon_dybczak</dc:creator>
      <dc:date>2024-09-12T15:11:54Z</dc:date>
    </item>
  </channel>
</rss>

