cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Unity Catalog metastore is down error

alesventus
Contributor

When I want to run notebook in databricks all queries, saves and read take really long and I found error message in the clusters event log that says: Metastore is down. So, I think cluster is not able to connect to the metastore right now. Could be this the reason? I tried to create another cluster but with the same result. Any help on this? Unity catalog`s Metastore is on ADLS gen2. and we use premium workspace.

After few minutes the command is finished. But simple load of delta table with few records takes 5 minutes or so.

Thanks

5 REPLIES 5

Kaniz_Fatma
Community Manager
Community Manager

Hi @alesventusYes, the cluster may becannot connect to the metastore, which could be the reason for the slow performance and error messages. You can check the eventNodeDiskSpace metric in usage_logs to see if the root file system of the driver's host VM is getting 100%, which can cause the cluster to go down.

Additionally, it is recommended to convert the table to a delta table to improve performance in Unity Catalog. You can also try increasing the number of user connections from the RDS and performing instance-type upgrades.

If the issue persists, you can check the node daemon log of the problematic instance and raise the case to the Dataplane team.

Hi Kaniz. Thanks for reply. Just want to add some information. Im not sure if this error message is related to legacy Hive metastore or to Unity Catalog. We use UC from the beginning. Our metastore is on external storage - see screenshot.

This error occurs in the clusters event log exactly 6 minutes after the start of the cluster. Every time we start the cluster that error is there.

alesventus_0-1690439546311.png

 

karthik_p
Esteemed Contributor

@alesventus what is size of your data and what type of cluster size you are using. also is your workspace region and metastore region same

Giri-Patcham
Contributor

@alesventus I think "Metastore is down"  here is related to the legacy Hive metastore. You can find the stack trace in the driver logs. 
In your case do you need Hivemetastore?

alesventus
Contributor

This issue is solely related to the VNET. Azure engineer must set up connection within VNET correctly. 

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group