<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage? in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26749#M18761</link>
    <description>&lt;P&gt;Deep Clone should do a good job for taking back up of delta tables.&lt;/P&gt;</description>
    <pubDate>Tue, 01 Mar 2022 07:41:25 GMT</pubDate>
    <dc:creator>AmanSehgal</dc:creator>
    <dc:date>2022-03-01T07:41:25Z</dc:date>
    <item>
      <title>What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26748#M18760</link>
      <description>&lt;P&gt;I have a large delta table that I would like to back up and I am wondering what is the best practice for backing it up. &lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;The goal is so that if there is any accidental corruption or data loss either at the Azure blob storage level or within Databricks itself I can restore the data.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Is using the Azure blob "&lt;B&gt;Point-in-time&lt;/B&gt;" restore features appropriate? On paper, it sounds like it has all the features I require. However, what is the downstream effect of using it on a delta table and will weekly OPTIMIZE cause rewrites of the data and blow out the costs?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;In other &lt;A href="https://docs.microsoft.com/en-us/azure/databricks/administration-guide/disaster-recovery#:~:text=use%20Deep%20Clone%20for%20Delta%20Tables" alt="https://docs.microsoft.com/en-us/azure/databricks/administration-guide/disaster-recovery#:~:text=use%20Deep%20Clone%20for%20Delta%20Tables" target="_blank"&gt;Azure/Databricks documentation&lt;/A&gt;, there was mention of using &lt;B&gt;Deep Clone &lt;/B&gt;for data replication.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Any thoughts appreciated.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Mar 2022 05:56:37 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26748#M18760</guid>
      <dc:creator>deisou</dc:creator>
      <dc:date>2022-03-01T05:56:37Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26749#M18761</link>
      <description>&lt;P&gt;Deep Clone should do a good job for taking back up of delta tables.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Mar 2022 07:41:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26749#M18761</guid>
      <dc:creator>AmanSehgal</dc:creator>
      <dc:date>2022-03-01T07:41:25Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26750#M18762</link>
      <description>&lt;P&gt;You can also set some copy process in Azure Data Factory&lt;/P&gt;</description>
      <pubDate>Tue, 01 Mar 2022 08:25:25 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26750#M18762</guid>
      <dc:creator>Hubert-Dudek</dc:creator>
      <dc:date>2022-03-01T08:25:25Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26751#M18763</link>
      <description>&lt;P&gt;big advantage of file based storage (compared to rdmbs): copy/paste &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Mar 2022 08:39:24 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26751#M18763</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-03-01T08:39:24Z</dc:date>
    </item>
    <item>
      <title>Re: What is the best strategy for backing up a large Databricks Delta table that is stored in Azure blob storage?</title>
      <link>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26752#M18764</link>
      <description>&lt;P&gt;Hi @deisou​&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Just wanted to check in if you were able to resolve your issue. If yes, would you be happy to mark the answer as best? If not, please tell us so we can help you.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Cheers!&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 27 Apr 2022 16:33:07 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/what-is-the-best-strategy-for-backing-up-a-large-databricks/m-p/26752#M18764</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2022-04-27T16:33:07Z</dc:date>
    </item>
  </channel>
</rss>

