<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Experiences using managed tables in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59478#M31420</link>
    <description>&lt;P&gt;I would recommend using managed tables for table backups and tables used for data processing in the notebooks that can be dropped at the end of the process or kind of staging table. I have not explored how to copy a managed table from Dev to QA Environment. Incase of external table , we can copy the storage folder from one Dev Storage Account to QA Storage Account and create the DDL.&lt;/P&gt;</description>
    <pubDate>Tue, 06 Feb 2024 15:37:00 GMT</pubDate>
    <dc:creator>Ramakrishnan83</dc:creator>
    <dc:date>2024-02-06T15:37:00Z</dc:date>
    <item>
      <title>Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59427#M31400</link>
      <description>&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;We are looking into the use of managed tables on databricks. As this decision won’t be easy to reverse I am reaching out to all of you fine folks to learn more about your experience with using this.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;If I understand correctly we dont have to deal with manageing the storage as databricks will make guids for schemas and tables. The readability will be worse on the storage it self (usning ADLS at the moment) but I dont think that matters so much as we will still have good readability within the databricks environment.&amp;nbsp;&lt;BR /&gt;Together with the managed tables we were thinking to use tags together with the built in metadata so we can build and share the three structure if needed.&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&lt;SPAN class=""&gt;What is the pros and cons of managed tables?&lt;BR /&gt;What are some things I should look into before deciding?&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2024 09:37:54 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59427#M31400</guid>
      <dc:creator>cltj</dc:creator>
      <dc:date>2024-02-06T09:37:54Z</dc:date>
    </item>
    <item>
      <title>Re: Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59455#M31411</link>
      <description>&lt;P&gt;Managed tables are the tables which are completely managed by databricks, i.e. If we drop the table from the databricks the underlying files will be also deleted.&amp;nbsp; &amp;nbsp;&lt;BR /&gt;Ideally it should be used in the following cases:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;if&amp;nbsp;&lt;SPAN&gt;you have temporary data that is not critical to your long-term storage or analysis.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;If you have ad-hoc analysis scenarios where data is not required to persist beyond the scope of the analysis, you can use managed tables.&lt;/SPAN&gt;&lt;/LI&gt;&lt;LI&gt;&lt;SPAN&gt;&lt;STRONG&gt;If multiple users or teams need to access and work with the same table, it's recommended to use external tables&lt;/STRONG&gt; instead of managed tables. External tables provide more flexibility in terms of data sharing and access control.&lt;/SPAN&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <pubDate>Tue, 06 Feb 2024 11:31:29 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59455#M31411</guid>
      <dc:creator>Hkesharwani</dc:creator>
      <dc:date>2024-02-06T11:31:29Z</dc:date>
    </item>
    <item>
      <title>Re: Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59458#M31413</link>
      <description>&lt;P&gt;Thanks for your response &lt;a href="https://community.databricks.com/t5/user/viewprofilepage/user-id/99364"&gt;@Hkesharwani&lt;/a&gt;&amp;nbsp;&lt;BR /&gt;In what scenario will we need to drop tables? Cant we just avoid giving drop table privileges to our analysts, superusers and users?&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Our current thought is that we will manage access and data lifecycle anyways.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;In addition, cant we just use the undrop command within 7 days? (we are using UC)&lt;BR /&gt;&lt;A href="https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-ddl-undrop-table.html" target="_blank"&gt;UNDROP TABLE | Databricks on AWS&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2024 11:58:34 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59458#M31413</guid>
      <dc:creator>cltj</dc:creator>
      <dc:date>2024-02-06T11:58:34Z</dc:date>
    </item>
    <item>
      <title>Re: Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59476#M31419</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;SPAN class=""&gt;&lt;A class="" href="https://community.databricks.com/t5/user/viewprofilepage/user-id/94868" target="_self"&gt;&lt;SPAN class=""&gt;cltj,&lt;/SPAN&gt;&lt;/A&gt;&lt;BR /&gt;As I mentioned that you may drop tables when you have to only save data for temp purpose. And yes you can only grant required access to the team.&lt;BR /&gt;I believe&amp;nbsp;&lt;A href="https://docs.databricks.com/en/sql/language-manual/sql-ref-privileges.html" target="_blank" rel="noopener"&gt;https://docs.databricks.com/en/sql/language-manual/sql-ref-privileges.html&lt;/A&gt;&amp;nbsp;this will be a great help for you.&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2024 14:49:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59476#M31419</guid>
      <dc:creator>Hkesharwani</dc:creator>
      <dc:date>2024-02-06T14:49:07Z</dc:date>
    </item>
    <item>
      <title>Re: Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59478#M31420</link>
      <description>&lt;P&gt;I would recommend using managed tables for table backups and tables used for data processing in the notebooks that can be dropped at the end of the process or kind of staging table. I have not explored how to copy a managed table from Dev to QA Environment. Incase of external table , we can copy the storage folder from one Dev Storage Account to QA Storage Account and create the DDL.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Feb 2024 15:37:00 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/59478#M31420</guid>
      <dc:creator>Ramakrishnan83</dc:creator>
      <dc:date>2024-02-06T15:37:00Z</dc:date>
    </item>
    <item>
      <title>Re: Experiences using managed tables</title>
      <link>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/101538#M40714</link>
      <description>&lt;P&gt;Databricks recommends to &lt;STRONG&gt;ALWAYS use Managed Tables&lt;/STRONG&gt; always UNLESS:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Your tables are not Delta&lt;/LI&gt;&lt;LI&gt;You explicitly need to have the table files in a specific location&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Managed Tables are just better... Databricks manages:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;the upgrades (Deletion Vectors? Column Mapping? If they are managed, you will get that)&lt;/LI&gt;&lt;LI&gt;the layout (Optimal number of files, optimal clustering to accelerate queries... all with Predictive Optimization)&lt;/LI&gt;&lt;LI&gt;things like renaming a table, dropping and undropping&lt;/LI&gt;&lt;LI&gt;plus observability and other cool stats provided by UC&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;There should be NO need whatsoever for users to know WHERE the table data is stored, any operation dealing with a table should be done directly to the table elements through the Metastore and not to the files.&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2024 02:09:09 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/experiences-using-managed-tables/m-p/101538#M40714</guid>
      <dc:creator>JimmyEatBrick</dc:creator>
      <dc:date>2024-12-10T02:09:09Z</dc:date>
    </item>
  </channel>
</rss>

