cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Governance
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Can we assume the path to the managed tables in the hive_metastore is reliable?

giohappy
New Contributor III

Managed tables are stored under the /user/hive/warehouse, which is also mentioned in the documentation

In our workflow, we use that path to read the parquet files from outside (through databricks connector). Can we assume this path is reliable, or is it an "implementation detail" that might change at any time?

1 ACCEPTED SOLUTION

Accepted Solutions

Kaniz
Community Manager
Community Manager

While the current documentation mentions that managed tables are stored under the /user/hive/warehouse path, let's stick to it until any official announcements come up for any updates or changes to implementation details.

View solution in original post

5 REPLIES 5

Kaniz
Community Manager
Community Manager

Hi @giohappyThe path /user/hive/warehouse is commonly used as the default location for managed tables, according to the documentation.

 

giohappy
New Contributor III

Yes, that link was also mentioned in my question. The point is if our pipeline can always assume that the path is where the parquet files for the managed tables are expected to be, or it's just an internal detail that could change at any time.

Kaniz
Community Manager
Community Manager

Hi @giohappy, By default, managed tables are stored in the root storage location you configure when creating a metastore. Optionally specify managed table storage locations at the catalog or schema levels, overriding the root storage location. Managed tables always use the Delta table format.

When a managed table is dropped, its underlying data is deleted from your cloud tenant within 30 days.

See Managed tables.

 

giohappy
New Contributor III

In our case we haven't configured or created the metastore directly. We're relying on the default metastore, which is where the tables are written when we do:

df.write.format("delta").mode("overwrite").saveAsTable(output_table_name)

I haven't found anything saying that the path of the default metastore might change unexpectedly. By the way, I don't even found something stating the opposite ๐Ÿ™‚

Kaniz
Community Manager
Community Manager

While the current documentation mentions that managed tables are stored under the /user/hive/warehouse path, let's stick to it until any official announcements come up for any updates or changes to implementation details.