<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: delta table storage in Data Engineering</title>
    <link>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19383#M12977</link>
    <description>&lt;P&gt;If you do not define any storage yourself,  data is stored as managed tables, meaning in the blob storage of the databricks subscription (which resides on the cloud provider you use).&lt;/P&gt;&lt;P&gt;If you use your own blob storage/data lake, you can (don't have to but you can) write your data there, as unmanaged tables.&lt;/P&gt;&lt;P&gt;But basically you can store it anywhere you want in the cloud, as long as databricks can access it.&lt;/P&gt;&lt;P&gt;DBFS is a semantic layer on top of actual storage, to make working with files more easy.&lt;/P&gt;&lt;P&gt;So if you mounted 3 blob storage f.e., you can write to any of these 3.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Converting to delta:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta" alt="https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta" target="_blank"&gt;https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;But you could also choose to write to another location so data is copied and saved in delta lake format.&lt;/P&gt;</description>
    <pubDate>Mon, 30 May 2022 07:55:39 GMT</pubDate>
    <dc:creator>-werners-</dc:creator>
    <dc:date>2022-05-30T07:55:39Z</dc:date>
    <item>
      <title>delta table storage</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19382#M12976</link>
      <description>&lt;P&gt;I couldn't find it clearly explained anywhere, so hope sb here shed some light on that.&lt;/P&gt;&lt;P&gt;Few questions:&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;1) Where does delta tables are stored? &lt;/P&gt;&lt;P&gt;Docs say: "Delta Lake uses versioned Parquet files to store your data&amp;nbsp;&lt;B&gt;in your cloud storage"&lt;/B&gt;&lt;/P&gt;&lt;P&gt;So where exactly is it stored? Could it be stored on any storage I use for instance blob storage or is it somewhere on DBFS or databricks cluster?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;2) If I have already data saved to parquet on my Azure blob storage and want to convert them to delta, would this change be implemented on blob? Or maybe, this data will be copy somewhere else and saved as delta in that new location only?&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;TIA&lt;/P&gt;&lt;P&gt;B&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 May 2022 09:32:31 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19382#M12976</guid>
      <dc:creator>Braxx</dc:creator>
      <dc:date>2022-05-27T09:32:31Z</dc:date>
    </item>
    <item>
      <title>Re: delta table storage</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19383#M12977</link>
      <description>&lt;P&gt;If you do not define any storage yourself,  data is stored as managed tables, meaning in the blob storage of the databricks subscription (which resides on the cloud provider you use).&lt;/P&gt;&lt;P&gt;If you use your own blob storage/data lake, you can (don't have to but you can) write your data there, as unmanaged tables.&lt;/P&gt;&lt;P&gt;But basically you can store it anywhere you want in the cloud, as long as databricks can access it.&lt;/P&gt;&lt;P&gt;DBFS is a semantic layer on top of actual storage, to make working with files more easy.&lt;/P&gt;&lt;P&gt;So if you mounted 3 blob storage f.e., you can write to any of these 3.&lt;/P&gt;&lt;P&gt;&lt;/P&gt;&lt;P&gt;Converting to delta:&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta" alt="https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta" target="_blank"&gt;https://docs.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/language-manual/delta-convert-to-delta&lt;/A&gt;.&lt;/P&gt;&lt;P&gt;But you could also choose to write to another location so data is copied and saved in delta lake format.&lt;/P&gt;</description>
      <pubDate>Mon, 30 May 2022 07:55:39 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19383#M12977</guid>
      <dc:creator>-werners-</dc:creator>
      <dc:date>2022-05-30T07:55:39Z</dc:date>
    </item>
    <item>
      <title>Re: delta table storage</title>
      <link>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19384#M12978</link>
      <description>&lt;P&gt;thanks, very helpful&lt;/P&gt;</description>
      <pubDate>Mon, 30 May 2022 15:01:03 GMT</pubDate>
      <guid>https://community.databricks.com/t5/data-engineering/delta-table-storage/m-p/19384#M12978</guid>
      <dc:creator>Braxx</dc:creator>
      <dc:date>2022-05-30T15:01:03Z</dc:date>
    </item>
  </channel>
</rss>

